COMPRESSED SPECTRAL SCREENING FOR LARGE-SCALE DIFFERENTIAL CORRELATION ANALYSIS WITH APPLICATION IN SELECTING GLIOBLASTOMA GENE MODULES

Tianxi Li, Xiwei Tang, Ajay Chatrath

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Differential coexpression analysis has been widely applied by scientists in understanding the biological mechanisms of diseases. However, the unknown differential patterns are often complicated; thus, models based on simplified parametric assumptions can be ineffective in identifying the dif-ferences. Meanwhile, the gene expression data involved in such analysis are in extremely high dimensions by nature, whose correlation matrices may not even be computable. Such a large scale seriously limits the application of most well-studied statistical methods. This paper introduces a simple yet powerful approach to the differential correlation analysis problem called compressed spectral screening. By leveraging spectral structures and random sampling techniques, our approach could achieve a highly accurate screening of features with complicated differential patterns while maintaining the scalability to analyze correlation matrices of 104 –105 variables within a few minutes on a standard personal computer. We have applied this screening approach in comparing a TCGA data set about Glioblastoma with normal subjects. Our analysis successfully identifies multiple functional modules of genes that exhibit different coexpression patterns. The findings reveal new insights about Glioblastoma’s evolving mechanism. The validity of our approach is also justified by a theoretical analysis, showing that the compressed spectral analysis can achieve variable screening consistency.

Original languageEnglish (US)
Pages (from-to)3450-3475
Number of pages26
JournalAnnals of Applied Statistics
Volume17
Issue number4
DOIs
StatePublished - Dec 2023
Externally publishedYes

Bibliographical note

Publisher Copyright:
© Institute of Mathematical Statistics, 2023.

Keywords

  • Differential correlation analysis
  • gene coexpression
  • high-dimensional correlation matrices
  • spectral methods

Fingerprint

Dive into the research topics of 'COMPRESSED SPECTRAL SCREENING FOR LARGE-SCALE DIFFERENTIAL CORRELATION ANALYSIS WITH APPLICATION IN SELECTING GLIOBLASTOMA GENE MODULES'. Together they form a unique fingerprint.

Cite this