De novo prediction of cell-type complexity in single-cell RNA-seq and tumor microenvironments

Research output: Contribution to journalArticlepeer-review

7 Scopus citations


Recent single-cell transcriptomic studies revealed new insights into cell-type heterogeneities in cellular microenvironments unavailable from bulk studies. A significant drawback of currently available algorithms is the need to use empirical parameters or rely on indirect quality measures to estimate the degree of complexity, i.e., the number of subgroups present in the sample. We fill this gap with a single-cell data analysis procedure allowing for unambiguous assessments of the depth of heterogeneity in subclonal compositions supported by data. Our approach combines nonnegative matrix factorization, which takes advantage of the sparse and nonnegative nature of single-cell RNA count data, with Bayesian model comparison enabling de novo prediction of the depth of heterogeneity. We show that the method predicts the correct number of subgroups using simulated data, primary blood mononuclear cell, and pancreatic cell data. We applied our approach to a collection of single-cell tumor samples and found two qualitatively distinct classes of cell-type heterogeneity in cancer microenvironments.

Original languageEnglish (US)
Article numbere201900443
JournalLife Science Alliance
Issue number4
StatePublished - Aug 2019

Bibliographical note

Publisher Copyright:
© 2019 Woo et al.

PubMed: MeSH publication types

  • Journal Article


Dive into the research topics of 'De novo prediction of cell-type complexity in single-cell RNA-seq and tumor microenvironments'. Together they form a unique fingerprint.

Cite this