Background

Big Data is a collection of data so large, so complex, so distributed, and growing so fast (or 5Vs- volume, variety, velocity, veracity, and value). It has been known for unlocking new sources of economic value, providing fresh insights into sciences, and assisting on policy making. Healthcare and life science is the most data intensive industry in the world. Huge volumes of very heterogeneous raw data are generated daily by a variety of modern clinical information systems, such as Electronic Health Records (EHRs), Computerized Physician Order Entry (CPOE), Laboratory Information Systems, and Picture Archiving and Communications System (PACS), Medical sensors can generate unimaginable volumes of patient data, per year. These information systems are utilized for functionalities in many healthcare settings such as physician offices and hospitals. Several published studies have asserted that Big Data managed efficiently can improve care delivery while reducing healthcare costs. A number of real world practices and cases also reported using Big Data to improve Healthcare, Life Science, and better Health Policy Decision Making.