There is a challenge being faced by all to manage the tremendous growth of data volume and
complexity. Hence, there needs to be a strategy to apply big data techniques. Now a days, the analytics based on big data. The data is increasing and is in the form of any structured, semi structured and unstructured data taken from online or offline applications including social media. In this article we will demonstrate how Pega 7 can help in integration with big data.
What is Hadoop ?
Hadoop is an open-source software framework for storing huge data and running applications on clusters
of commodity hardware. Hadoop and its ecosystem provides massive storage, processing power and handles many concurrent tasks or jobs. Hadoop and its ecosystem to store and process structured, semi structured and unstructured data.
What is Pega ?
Pega is BPM Tool. Pega is a Business Process Management(BPM) tool that primarily focuses on workflow or integration capabilities. It is commonly used in Banking and Insurance domains.
Big Data Integration for PEGA
Pega 7.2.x version supports big data Hadoop integration to enhance its predictive analytics. The integration involves Pega Platform directly integrating with big data in HDFS or Hbase using HDFS/Hbase connector and building predictive models on very large data sets using Hadoop’s Map Reduce framework.
The Pega Platform can read data from both HDFS file system and HBase database or write data to both HDFS file system and HBase database using HDFS/Hbase connector. The big data Components on the Pega 7.2.x Platform are
• Hadoop Record – All configuration of Hadoop clusters can be done using this to connect to Hadoop through HDFS connector of Hbase connector
• HDFS Data Set- To access an external HDFS to read or write the data we use this on Pega platform.
Pega Platform interacts with Hadoop store through a firewall using HDFS/Hbase connector.