Hadoop Admin


Health Care Data Analytics

This Project (proof of concept) aims to do ETL, Data Aggregation, and Data Analysis on top of Health Care data of Merck. Merck is one of the top drug manufacturers in USA. The data set contains millions of records about Patient demographics data, Prescription data, and Observations data. This entire data is located in Oracle System, we migrated the data from Oracle to Hadoop using Sqoop. Setup a multi node Hadoop cluster. Created Hive, Hbase table and Data loading is performed.This Project (proof of concept) aims to do ETL, Data Aggregation, and Data Analysis on top of Health Care data of Merck. Merck is one of the top drug manufacturers in USA. The data set contains millions of records about Patient demographics data, Prescription data, and Observations data. This entire data is located in Oracle System, we migrated the data from Oracle to Hadoop using Sqoop. Setup a multi node Hadoop cluster. Created Hive, Hbase table and Data loading is performed.