Identification of hepatocellular carcinoma-related genes with a machine learning and network analysis

Tuantuan Gui, Xiao Dong, Rudong Li, Yixue Li, Zhen Wang

Research output: Contribution to journalArticlepeer-review

52 Scopus citations

Abstract

Liver cancer is one of the leading causes of cancer mortality worldwide. Hepatocellular carcinoma (HCC) is the main type of liver cancer. We applied a machine learning approach with maximum-relevance-minimum-redundancy (mRMR) algorithm followed by incremental feature selection (IFS) to a set of microarray data generated from 43 tumor and 52 nontumor samples. With the machine learning approach, we identified 117 gene probes that could optimally separate tumor and nontumor samples. These genes not only include known HCC-relevant genes such as MT1X, BMI1, and CAP2, but also include cancer genes that were not found previously to be closely related to HCC, such as TACSTD2. Then, we constructed a molecular interaction network based on the protein-protein interaction (PPI) data from the STRING database and identified 187 genes on the shortest paths among the genes identified with the machine learning approach. Network analysis reveals new potential roles of ubiquitin C in the pathogenesis of HCC. Based on gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis, we showed that the identified subnetwork is significantly enriched in biological processes related to cell death. These results bring new insights of understanding the process of HCC.

Original languageEnglish (US)
Pages (from-to)63-71
Number of pages9
JournalJournal of Computational Biology
Volume22
Issue number1
DOIs
StatePublished - Jan 1 2015
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2015 Mary Ann Liebert, Inc.

Keywords

  • Hepatocellular carcinoma
  • maximum relevance minimum redundancy
  • protein-protein interaction.

Fingerprint

Dive into the research topics of 'Identification of hepatocellular carcinoma-related genes with a machine learning and network analysis'. Together they form a unique fingerprint.

Cite this