H-FUSE: Efficient fusion of aggregated historical data

Zongge Liu, Hyun Ah Song, Vladimir Zadorozhny, Christos Faloutsos, Nicholas Sidiropoulos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

12 Scopus citations

Abstract

In this paper, we address the challenge of recovering a time sequence of counts from aggregated historical data. For example, given a mixture of the monthly and weekly sums, how can we find the daily counts of people infected with flu? In general, what is the best way to recover historical counts from aggregated, possibly overlapping historical reports, in the presence of missing values? Equally importantly, how much should we trust this reconstruction? We propose H-FUSE, a novel method that solves above problems by allowing injection of domain knowledge in a principled way, and turning the task into a welldefined optimization problem. H-FUSE has the following desirable properties: (a) Effectiveness, recovering historical data from aggregated reports with high accuracy; (b) Self-awareness, providing an assessment of when the recovery is not reliable; (c) Scalability, computationally linear on the size of the input data. Experiments on the real data (epidemiology counts from the Tycho project [13]) demonstrates that H-FUSE reconstructs the original data 30 - 81% better than the least squares method.

Original languageEnglish (US)
Title of host publicationProceedings of the 17th SIAM International Conference on Data Mining, SDM 2017
EditorsNitesh Chawla, Wei Wang
PublisherSociety for Industrial and Applied Mathematics Publications
Pages786-794
Number of pages9
ISBN (Electronic)9781611974874
DOIs
StatePublished - 2017
Event17th SIAM International Conference on Data Mining, SDM 2017 - Houston, United States
Duration: Apr 27 2017Apr 29 2017

Publication series

NameProceedings of the 17th SIAM International Conference on Data Mining, SDM 2017

Other

Other17th SIAM International Conference on Data Mining, SDM 2017
Country/TerritoryUnited States
CityHouston
Period4/27/174/29/17

Bibliographical note

Publisher Copyright:
Copyright © by SIAM.

Fingerprint

Dive into the research topics of 'H-FUSE: Efficient fusion of aggregated historical data'. Together they form a unique fingerprint.

Cite this