TY - JOUR
T1 - Text-mining applied to autoimmune disease research
T2 - The Sjögrens syndrome knowledge base
AU - Gorr, Sven Ulrik
AU - Wennblom, Trevor J.
AU - Horvath, Steve
AU - Wong, David T.W.
AU - Michie, Sara A.
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2012
Y1 - 2012
N2 - Background: Sjögrens syndrome is a tissue-specific autoimmune disease that affects exocrine tissues, especially salivary glands and lacrimal glands. Despite a large body of evidence gathered over the past 60 years, significant gaps still exist in our understanding of Sjögrens syndrome. The goal of this study was to develop a database that collects and organizes gene and protein expression data from the existing literature for comparative analysis with future gene expression and proteomic studies of Sjögrens syndrome. Description. To catalog the existing knowledge in the field, we used text mining to generate the Sjögrens Syndrome Knowledge Base (SSKB) of published gene/protein data, which were extracted from PubMed using text mining of over 7,700 abstracts and listing approximately 500 potential genes/proteins. The raw data were manually evaluated to remove duplicates and false-positives and assign gene names. The data base was manually curated to 477 entries, including 377 potential functional genes, which were used for enrichment and pathway analysis using gene ontology and KEGG pathway analysis. Conclusions: The Sjögrens syndrome knowledge base (http://sskb.umn.edu) can form the foundation for an informed search of existing knowledge in the field as new potential therapeutic targets are identified by conventional or high throughput experimental techniques.
AB - Background: Sjögrens syndrome is a tissue-specific autoimmune disease that affects exocrine tissues, especially salivary glands and lacrimal glands. Despite a large body of evidence gathered over the past 60 years, significant gaps still exist in our understanding of Sjögrens syndrome. The goal of this study was to develop a database that collects and organizes gene and protein expression data from the existing literature for comparative analysis with future gene expression and proteomic studies of Sjögrens syndrome. Description. To catalog the existing knowledge in the field, we used text mining to generate the Sjögrens Syndrome Knowledge Base (SSKB) of published gene/protein data, which were extracted from PubMed using text mining of over 7,700 abstracts and listing approximately 500 potential genes/proteins. The raw data were manually evaluated to remove duplicates and false-positives and assign gene names. The data base was manually curated to 477 entries, including 377 potential functional genes, which were used for enrichment and pathway analysis using gene ontology and KEGG pathway analysis. Conclusions: The Sjögrens syndrome knowledge base (http://sskb.umn.edu) can form the foundation for an informed search of existing knowledge in the field as new potential therapeutic targets are identified by conventional or high throughput experimental techniques.
UR - http://www.scopus.com/inward/record.url?scp=84863211101&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863211101&partnerID=8YFLogxK
U2 - 10.1186/1471-2474-13-119
DO - 10.1186/1471-2474-13-119
M3 - Article
C2 - 22759918
AN - SCOPUS:84863211101
VL - 13
JO - BMC Musculoskeletal Disorders
JF - BMC Musculoskeletal Disorders
SN - 1471-2474
M1 - 119
ER -