A scalable data integration and analysis architecture for sensor data of pediatric asthma

Dimitris Stripelis, José Luis Ambite, Yao Yi Chiang, Sandrah P. Eckel, Rima Habre

Research output: Chapter in Book/Report/Conference proceedingConference contribution

14 Scopus citations

Abstract

According to the Centers for Disease Control, in the United States there are 6.8 million children living with asthma. Despite the importance of the disease, the available prognostic tools are not sufficient for biomedical researchers to thoroughly investigate the potential risks of the disease at scale. To overcome these challenges we present a big data integration and analysis infrastructure developed by our Data and Software Coordination and Integration Center (DSCIC) of the NIBIB-funded Pediatric Research using Integrated Sensor Monitoring Systems (PRISMS) program. Our goal is to help biomedical researchers to efficiently predict and prevent asthma attacks. The PRISMS-DSCIC is responsible for collecting, integrating, storing, and analyzing realtime environmental, physiological and behavioral data obtained from heterogeneous sensor and traditional data sources. Our architecture is based on the Apache Kafka, Spark and Hadoop frameworks and PostgreSQL DBMS. A main contribution of this work is extending the Spark framework with a mediation layer, based on logical schema mappings and query rewriting, to facilitate data analysis over a consistent harmonized schema. The system provides both batch and stream analytic capabilities over the massive data generated by wearable and fixed sensors.

Original languageEnglish (US)
Title of host publicationProceedings - 2017 IEEE 33rd International Conference on Data Engineering, ICDE 2017
PublisherIEEE Computer Society
Pages1407-1408
Number of pages2
ISBN (Electronic)9781509065431
DOIs
StatePublished - May 16 2017
Externally publishedYes
Event33rd IEEE International Conference on Data Engineering, ICDE 2017 - San Diego, United States
Duration: Apr 19 2017Apr 22 2017

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627

Other

Other33rd IEEE International Conference on Data Engineering, ICDE 2017
Country/TerritoryUnited States
CitySan Diego
Period4/19/174/22/17

Bibliographical note

Funding Information:
This work was supported by NIH grant 1U24EB021996-01.

Publisher Copyright:
© 2017 IEEE.

Fingerprint

Dive into the research topics of 'A scalable data integration and analysis architecture for sensor data of pediatric asthma'. Together they form a unique fingerprint.

Cite this