Abstract
According to the Centers for Disease Control, in the United States there are 6.8 million children living with asthma. Despite the importance of the disease, the available prognostic tools are not sufficient for biomedical researchers to thoroughly investigate the potential risks of the disease at scale. To overcome these challenges we present a big data integration and analysis infrastructure developed by our Data and Software Coordination and Integration Center (DSCIC) of the NIBIB-funded Pediatric Research using Integrated Sensor Monitoring Systems (PRISMS) program. Our goal is to help biomedical researchers to efficiently predict and prevent asthma attacks. The PRISMS-DSCIC is responsible for collecting, integrating, storing, and analyzing realtime environmental, physiological and behavioral data obtained from heterogeneous sensor and traditional data sources. Our architecture is based on the Apache Kafka, Spark and Hadoop frameworks and PostgreSQL DBMS. A main contribution of this work is extending the Spark framework with a mediation layer, based on logical schema mappings and query rewriting, to facilitate data analysis over a consistent harmonized schema. The system provides both batch and stream analytic capabilities over the massive data generated by wearable and fixed sensors.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 2017 IEEE 33rd International Conference on Data Engineering, ICDE 2017 |
Publisher | IEEE Computer Society |
Pages | 1407-1408 |
Number of pages | 2 |
ISBN (Electronic) | 9781509065431 |
DOIs | |
State | Published - May 16 2017 |
Externally published | Yes |
Event | 33rd IEEE International Conference on Data Engineering, ICDE 2017 - San Diego, United States Duration: Apr 19 2017 → Apr 22 2017 |
Publication series
Name | Proceedings - International Conference on Data Engineering |
---|---|
ISSN (Print) | 1084-4627 |
Other
Other | 33rd IEEE International Conference on Data Engineering, ICDE 2017 |
---|---|
Country/Territory | United States |
City | San Diego |
Period | 4/19/17 → 4/22/17 |
Bibliographical note
Funding Information:This work was supported by NIH grant 1U24EB021996-01.
Publisher Copyright:
© 2017 IEEE.