TY - JOUR
T1 - Mining novel multivariate relationships in time series data using correlation networks
AU - Agrawal, Saurabh
AU - Steinbach, Michael
AU - Boley, Daniel
AU - Chatterjee, Snigdhansu
AU - Atluri, Gowtham
AU - Dang, Anh The
AU - Liess, Stefan
AU - Kumar, Vipin
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2020/9/1
Y1 - 2020/9/1
N2 - In many domains, there is significant interest in capturing novel relationships between time series that represent activities recorded at different nodes of a highly complex system. In this paper, we introduce multipoles, a novel class of linear relationships between more than two time series. A multipole is a set of time series that have strong linear dependence among themselves, with the requirement that each time series makes a significant contribution to the linear dependence. We demonstrate that most interesting multipoles can be identified as cliques of negative correlations in a correlation network. Such cliques are typically rare in a real-world correlation network, which allows us to find almost all multipoles efficiently using a clique-enumeration approach. Using our proposed framework, we demonstrate the utility of multipoles in discovering new physical phenomena in two scientific domains: climate science and neuroscience. In particular, we discovered several multipole relationships that are reproducible in multiple other independent datasets and lead to novel domain insights.
AB - In many domains, there is significant interest in capturing novel relationships between time series that represent activities recorded at different nodes of a highly complex system. In this paper, we introduce multipoles, a novel class of linear relationships between more than two time series. A multipole is a set of time series that have strong linear dependence among themselves, with the requirement that each time series makes a significant contribution to the linear dependence. We demonstrate that most interesting multipoles can be identified as cliques of negative correlations in a correlation network. Such cliques are typically rare in a real-world correlation network, which allows us to find almost all multipoles efficiently using a clique-enumeration approach. Using our proposed framework, we demonstrate the utility of multipoles in discovering new physical phenomena in two scientific domains: climate science and neuroscience. In particular, we discovered several multipole relationships that are reproducible in multiple other independent datasets and lead to novel domain insights.
KW - Multivariate linear patterns
KW - climate teleconnections
KW - correlation mining
KW - fMRI
KW - spatio-temporal
UR - https://www.scopus.com/pages/publications/85090326965
UR - https://www.scopus.com/pages/publications/85090326965#tab=citedBy
U2 - 10.1109/TKDE.2019.2911681
DO - 10.1109/TKDE.2019.2911681
M3 - Article
AN - SCOPUS:85090326965
SN - 1041-4347
VL - 32
SP - 1798
EP - 1811
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 9
M1 - 8693798
ER -