Proteome-wide analysis of protein function composition reveals the clustering and phylogenetic properties of organisms

Lunjiang Ling, Jinhua Wang, Yan Cui, Wei Li, Runsheng Chen

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

A 17-dimensional vector named the proteome vector is defined to represent an organism. The components of the vector reflect the relative contents of protein-encoding genes of the 17 cluster of orthologous groups of proteins (COGs) classes in the whole genome of the relevant organism. Based on the definition of this proteome vector, the fuzzy clustering of 36 completely sequenced organisms (8 archaea, 24 bacteria, and 4 eukarya) was performed and a proteome tree was constructed. Our results show that (1) the 36 organisms can be 100% correctly classified into three clusters corresponding to the three primary kingdoms, (2) our proteome tree is remarkably similar to that derived from 16S rRNA, and (3) the chromosomes and/or plasmids belonging to the same organism have very similar gene composition. Based on these results, we argue that the 17-dimensional proteome vector could be a good criterion for clustering approaches and to a large extent reveals the phylogenetic properties of organisms; the Three Primary Kingdoms Hypothesis is trustworthy although the existence of lateral gene transfer (LGT) brings controversy to the construction of the "universal tree of life."

Original languageEnglish (US)
Pages (from-to)101-111
Number of pages11
JournalMolecular Phylogenetics and Evolution
Volume25
Issue number1
DOIs
StatePublished - Oct 29 2002

Fingerprint Dive into the research topics of 'Proteome-wide analysis of protein function composition reveals the clustering and phylogenetic properties of organisms'. Together they form a unique fingerprint.

Cite this