Abstract
The autonomous identification of time-steps where the behavior of a time-series significantly deviates from a predefined model, or time-series change point detection, is an active field of research with notable applications in finance, health, and advertising. One family of time-series change detection algorithms, referred to as "model-based methods", although useful for many applications, performs poor when the data are noisy and have outliers. We introduce a new framework that enables existing model-based methods to be more robust to these data challenges. We demonstrate the effectiveness of our approach on remote sensing and mobile health data. Our method introduces two new concepts: (i) a random sampling procedure allows us to overcome outliers, and (ii) a matrix-based representation of anomaly scores provides a flexible and intuitive way to identify multiple types of changes and test their significance. We show that our method performs better than several baseline methods, including application-specific algorithms, and provide all data and open-source code.
| Original language | English (US) |
|---|---|
| Title of host publication | 16th SIAM International Conference on Data Mining 2016, SDM 2016 |
| Editors | Sanjay Chawla Venkatasubramanian, Wagner Meira |
| Publisher | Society for Industrial and Applied Mathematics Publications |
| Pages | 162-170 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781510828117 |
| DOIs | |
| State | Published - 2016 |
| Event | 16th SIAM International Conference on Data Mining 2016, SDM 2016 - Miami, United States Duration: May 5 2016 → May 7 2016 |
Publication series
| Name | 16th SIAM International Conference on Data Mining 2016, SDM 2016 |
|---|
Other
| Other | 16th SIAM International Conference on Data Mining 2016, SDM 2016 |
|---|---|
| Country/Territory | United States |
| City | Miami |
| Period | 5/5/16 → 5/7/16 |
Bibliographical note
Publisher Copyright:Copyright © by SIAM.