Generating GO slim using relational database management systems to support proteomics analysis

Innocent G Onsongo, Hongwei Xie, Timothy J Griffin, John V Carlis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

The Gene Ontology Consortium built the Gene Ontology database (GO) to address the need for a common standard in naming genes and gene products. Using different names for the same concepts and different concepts with the same name makes it effectively impossible for humans and computers alike to analyze biological processes across different organisms. The consortium addresses this need by defining terms for categorizing genes and gene products. A convention in GO is that each gene or gene product is annotated to the most specific GO term in the GO database. It is, however, also useful for researchers to be able to group genes or gene products into broad biological categories that give a higher-level view of their function when analyzing results of an experiment. A GO Slim is a subset of the GO ontology that provides such a higher-level view of functions. Existing GO Slim generation tools have two important limitations: programming language dependence, and an inability to dynamically generate a GO Slim while analyzing. We have extended the relational database engine to dynamically generate a GO Slim overcoming this limitations. Using this extention, we have developed a tool (DynamicGOSlim) that dynamically generates a GO Slim and uses the generated GO Slim to categorize genes or gene products. This tool is being used in an ongoing proteomics project aimed at identifying possible oral cancer biomarkers in saliva.

Original languageEnglish (US)
Title of host publicationProceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008
Pages215-217
Number of pages3
DOIs
StatePublished - Sep 22 2008
Event21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008 - Jyvaskyla, Finland
Duration: Jun 17 2008Jun 19 2008

Publication series

NameProceedings - IEEE Symposium on Computer-Based Medical Systems
ISSN (Print)1063-7125

Other

Other21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008
CountryFinland
CityJyvaskyla
Period6/17/086/19/08

Fingerprint

Database Management Systems
Gene Ontology
Proteomics
Ontology
Genes
Databases
Names
Programming Languages
Biological Phenomena
Mouth Neoplasms
Tumor Biomarkers
Saliva

Cite this

Onsongo, I. G., Xie, H., Griffin, T. J., & Carlis, J. V. (2008). Generating GO slim using relational database management systems to support proteomics analysis. In Proceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008 (pp. 215-217). [4561989] (Proceedings - IEEE Symposium on Computer-Based Medical Systems). https://doi.org/10.1109/CBMS.2008.77

Generating GO slim using relational database management systems to support proteomics analysis. / Onsongo, Innocent G; Xie, Hongwei; Griffin, Timothy J; Carlis, John V.

Proceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008. 2008. p. 215-217 4561989 (Proceedings - IEEE Symposium on Computer-Based Medical Systems).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Onsongo, IG, Xie, H, Griffin, TJ & Carlis, JV 2008, Generating GO slim using relational database management systems to support proteomics analysis. in Proceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008., 4561989, Proceedings - IEEE Symposium on Computer-Based Medical Systems, pp. 215-217, 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008, Jyvaskyla, Finland, 6/17/08. https://doi.org/10.1109/CBMS.2008.77
Onsongo IG, Xie H, Griffin TJ, Carlis JV. Generating GO slim using relational database management systems to support proteomics analysis. In Proceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008. 2008. p. 215-217. 4561989. (Proceedings - IEEE Symposium on Computer-Based Medical Systems). https://doi.org/10.1109/CBMS.2008.77
Onsongo, Innocent G ; Xie, Hongwei ; Griffin, Timothy J ; Carlis, John V. / Generating GO slim using relational database management systems to support proteomics analysis. Proceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008. 2008. pp. 215-217 (Proceedings - IEEE Symposium on Computer-Based Medical Systems).
@inproceedings{ddf954ddbbeb426989d5741482638d81,
title = "Generating GO slim using relational database management systems to support proteomics analysis",
abstract = "The Gene Ontology Consortium built the Gene Ontology database (GO) to address the need for a common standard in naming genes and gene products. Using different names for the same concepts and different concepts with the same name makes it effectively impossible for humans and computers alike to analyze biological processes across different organisms. The consortium addresses this need by defining terms for categorizing genes and gene products. A convention in GO is that each gene or gene product is annotated to the most specific GO term in the GO database. It is, however, also useful for researchers to be able to group genes or gene products into broad biological categories that give a higher-level view of their function when analyzing results of an experiment. A GO Slim is a subset of the GO ontology that provides such a higher-level view of functions. Existing GO Slim generation tools have two important limitations: programming language dependence, and an inability to dynamically generate a GO Slim while analyzing. We have extended the relational database engine to dynamically generate a GO Slim overcoming this limitations. Using this extention, we have developed a tool (DynamicGOSlim) that dynamically generates a GO Slim and uses the generated GO Slim to categorize genes or gene products. This tool is being used in an ongoing proteomics project aimed at identifying possible oral cancer biomarkers in saliva.",
author = "Onsongo, {Innocent G} and Hongwei Xie and Griffin, {Timothy J} and Carlis, {John V}",
year = "2008",
month = "9",
day = "22",
doi = "10.1109/CBMS.2008.77",
language = "English (US)",
isbn = "9780769531656",
series = "Proceedings - IEEE Symposium on Computer-Based Medical Systems",
pages = "215--217",
booktitle = "Proceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008",

}

TY - GEN

T1 - Generating GO slim using relational database management systems to support proteomics analysis

AU - Onsongo, Innocent G

AU - Xie, Hongwei

AU - Griffin, Timothy J

AU - Carlis, John V

PY - 2008/9/22

Y1 - 2008/9/22

N2 - The Gene Ontology Consortium built the Gene Ontology database (GO) to address the need for a common standard in naming genes and gene products. Using different names for the same concepts and different concepts with the same name makes it effectively impossible for humans and computers alike to analyze biological processes across different organisms. The consortium addresses this need by defining terms for categorizing genes and gene products. A convention in GO is that each gene or gene product is annotated to the most specific GO term in the GO database. It is, however, also useful for researchers to be able to group genes or gene products into broad biological categories that give a higher-level view of their function when analyzing results of an experiment. A GO Slim is a subset of the GO ontology that provides such a higher-level view of functions. Existing GO Slim generation tools have two important limitations: programming language dependence, and an inability to dynamically generate a GO Slim while analyzing. We have extended the relational database engine to dynamically generate a GO Slim overcoming this limitations. Using this extention, we have developed a tool (DynamicGOSlim) that dynamically generates a GO Slim and uses the generated GO Slim to categorize genes or gene products. This tool is being used in an ongoing proteomics project aimed at identifying possible oral cancer biomarkers in saliva.

AB - The Gene Ontology Consortium built the Gene Ontology database (GO) to address the need for a common standard in naming genes and gene products. Using different names for the same concepts and different concepts with the same name makes it effectively impossible for humans and computers alike to analyze biological processes across different organisms. The consortium addresses this need by defining terms for categorizing genes and gene products. A convention in GO is that each gene or gene product is annotated to the most specific GO term in the GO database. It is, however, also useful for researchers to be able to group genes or gene products into broad biological categories that give a higher-level view of their function when analyzing results of an experiment. A GO Slim is a subset of the GO ontology that provides such a higher-level view of functions. Existing GO Slim generation tools have two important limitations: programming language dependence, and an inability to dynamically generate a GO Slim while analyzing. We have extended the relational database engine to dynamically generate a GO Slim overcoming this limitations. Using this extention, we have developed a tool (DynamicGOSlim) that dynamically generates a GO Slim and uses the generated GO Slim to categorize genes or gene products. This tool is being used in an ongoing proteomics project aimed at identifying possible oral cancer biomarkers in saliva.

UR - http://www.scopus.com/inward/record.url?scp=51849099551&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=51849099551&partnerID=8YFLogxK

U2 - 10.1109/CBMS.2008.77

DO - 10.1109/CBMS.2008.77

M3 - Conference contribution

SN - 9780769531656

T3 - Proceedings - IEEE Symposium on Computer-Based Medical Systems

SP - 215

EP - 217

BT - Proceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008

ER -