Bridging the Chromosome-centric and Biology/Disease-driven Human Proteome Projects: Accessible and Automated Tools for Interpreting the Biological and Pathological Impact of Protein Sequence Variants Detected via Proteogenomics

Ray Sajulga, Subina Mehta, Praveen Kumar, James E Johnson, Candace R Guerrero, Michael C. Ryan, Rachel Karchin, Pratik D Jagtap, Timothy J Griffin

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

The Chromosome-centric Human Proteome Project (C-HPP) seeks to comprehensively characterize all protein products coded by the genome, including those expressed sequence variants confirmed via proteogenomics methods. The closely related Biology/Disease-driven Human Proteome Project (B/D-HPP) seeks to understand the biological and pathological associations of expressed protein products, especially those carrying sequence variants that may be drivers of disease. To achieve these objectives, informatics tools are required that interpret potential functional or disease implications of variant protein sequence detected via proteogenomics. Toward this end, we have developed an automated workflow within the Galaxy for Proteomics (Galaxy-P) platform, which leverages the Cancer-Related Analysis of Variants Toolkit (CRAVAT) and makes it interoperable with proteogenomic results. Protein sequence variants confirmed by proteogenomics are assessed for potential structure-function effects as well as associations with cancer using CRAVAT's rich suite of functionalities, including visualization of results directly within the Galaxy user interface. We demonstrate the effectiveness of this workflow on proteogenomic results generated from an MCF7 breast cancer cell line. Our free and open software should enable improved interpretation of the functional and pathological effects of protein sequence variants detected via proteogenomics, acting as a bridge between the C-HPP and B/D-HPP.

Original languageEnglish (US)
Pages (from-to)4329-4336
Number of pages8
JournalJournal of Proteome Research
Volume17
Issue number12
DOIs
StatePublished - Dec 7 2018

Fingerprint

Proteome
Chromosomes
Galaxies
Proteins
Workflow
Human Chromosomes
Informatics
Proteomics
Proteogenomics
Neoplasms
Software
User interfaces
Genome
Breast Neoplasms
Visualization
Genes
Cell Line
Cells

Keywords

  • Biology/Disease-driven Human Proteome Project
  • CRAVAT
  • Chromosome-centric Human Proteome Project
  • Galaxy-P
  • bioinformatics
  • cancer
  • multiomics
  • proteogenomics

Cite this

@article{6bc4bd973a5b4a978544582d3bd62cbc,
title = "Bridging the Chromosome-centric and Biology/Disease-driven Human Proteome Projects: Accessible and Automated Tools for Interpreting the Biological and Pathological Impact of Protein Sequence Variants Detected via Proteogenomics",
abstract = "The Chromosome-centric Human Proteome Project (C-HPP) seeks to comprehensively characterize all protein products coded by the genome, including those expressed sequence variants confirmed via proteogenomics methods. The closely related Biology/Disease-driven Human Proteome Project (B/D-HPP) seeks to understand the biological and pathological associations of expressed protein products, especially those carrying sequence variants that may be drivers of disease. To achieve these objectives, informatics tools are required that interpret potential functional or disease implications of variant protein sequence detected via proteogenomics. Toward this end, we have developed an automated workflow within the Galaxy for Proteomics (Galaxy-P) platform, which leverages the Cancer-Related Analysis of Variants Toolkit (CRAVAT) and makes it interoperable with proteogenomic results. Protein sequence variants confirmed by proteogenomics are assessed for potential structure-function effects as well as associations with cancer using CRAVAT's rich suite of functionalities, including visualization of results directly within the Galaxy user interface. We demonstrate the effectiveness of this workflow on proteogenomic results generated from an MCF7 breast cancer cell line. Our free and open software should enable improved interpretation of the functional and pathological effects of protein sequence variants detected via proteogenomics, acting as a bridge between the C-HPP and B/D-HPP.",
keywords = "Biology/Disease-driven Human Proteome Project, CRAVAT, Chromosome-centric Human Proteome Project, Galaxy-P, bioinformatics, cancer, multiomics, proteogenomics",
author = "Ray Sajulga and Subina Mehta and Praveen Kumar and Johnson, {James E} and Guerrero, {Candace R} and Ryan, {Michael C.} and Rachel Karchin and Jagtap, {Pratik D} and Griffin, {Timothy J}",
year = "2018",
month = "12",
day = "7",
doi = "10.1021/acs.jproteome.8b00404",
language = "English (US)",
volume = "17",
pages = "4329--4336",
journal = "Journal of Proteome Research",
issn = "1535-3893",
publisher = "American Chemical Society",
number = "12",

}

TY - JOUR

T1 - Bridging the Chromosome-centric and Biology/Disease-driven Human Proteome Projects

T2 - Accessible and Automated Tools for Interpreting the Biological and Pathological Impact of Protein Sequence Variants Detected via Proteogenomics

AU - Sajulga, Ray

AU - Mehta, Subina

AU - Kumar, Praveen

AU - Johnson, James E

AU - Guerrero, Candace R

AU - Ryan, Michael C.

AU - Karchin, Rachel

AU - Jagtap, Pratik D

AU - Griffin, Timothy J

PY - 2018/12/7

Y1 - 2018/12/7

N2 - The Chromosome-centric Human Proteome Project (C-HPP) seeks to comprehensively characterize all protein products coded by the genome, including those expressed sequence variants confirmed via proteogenomics methods. The closely related Biology/Disease-driven Human Proteome Project (B/D-HPP) seeks to understand the biological and pathological associations of expressed protein products, especially those carrying sequence variants that may be drivers of disease. To achieve these objectives, informatics tools are required that interpret potential functional or disease implications of variant protein sequence detected via proteogenomics. Toward this end, we have developed an automated workflow within the Galaxy for Proteomics (Galaxy-P) platform, which leverages the Cancer-Related Analysis of Variants Toolkit (CRAVAT) and makes it interoperable with proteogenomic results. Protein sequence variants confirmed by proteogenomics are assessed for potential structure-function effects as well as associations with cancer using CRAVAT's rich suite of functionalities, including visualization of results directly within the Galaxy user interface. We demonstrate the effectiveness of this workflow on proteogenomic results generated from an MCF7 breast cancer cell line. Our free and open software should enable improved interpretation of the functional and pathological effects of protein sequence variants detected via proteogenomics, acting as a bridge between the C-HPP and B/D-HPP.

AB - The Chromosome-centric Human Proteome Project (C-HPP) seeks to comprehensively characterize all protein products coded by the genome, including those expressed sequence variants confirmed via proteogenomics methods. The closely related Biology/Disease-driven Human Proteome Project (B/D-HPP) seeks to understand the biological and pathological associations of expressed protein products, especially those carrying sequence variants that may be drivers of disease. To achieve these objectives, informatics tools are required that interpret potential functional or disease implications of variant protein sequence detected via proteogenomics. Toward this end, we have developed an automated workflow within the Galaxy for Proteomics (Galaxy-P) platform, which leverages the Cancer-Related Analysis of Variants Toolkit (CRAVAT) and makes it interoperable with proteogenomic results. Protein sequence variants confirmed by proteogenomics are assessed for potential structure-function effects as well as associations with cancer using CRAVAT's rich suite of functionalities, including visualization of results directly within the Galaxy user interface. We demonstrate the effectiveness of this workflow on proteogenomic results generated from an MCF7 breast cancer cell line. Our free and open software should enable improved interpretation of the functional and pathological effects of protein sequence variants detected via proteogenomics, acting as a bridge between the C-HPP and B/D-HPP.

KW - Biology/Disease-driven Human Proteome Project

KW - CRAVAT

KW - Chromosome-centric Human Proteome Project

KW - Galaxy-P

KW - bioinformatics

KW - cancer

KW - multiomics

KW - proteogenomics

UR - http://www.scopus.com/inward/record.url?scp=85053559319&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85053559319&partnerID=8YFLogxK

U2 - 10.1021/acs.jproteome.8b00404

DO - 10.1021/acs.jproteome.8b00404

M3 - Article

C2 - 30130115

AN - SCOPUS:85053559319

VL - 17

SP - 4329

EP - 4336

JO - Journal of Proteome Research

JF - Journal of Proteome Research

SN - 1535-3893

IS - 12

ER -