Abstract
Sequence data is being produced by genomic sequencing laboratories at ever-increasing rates, making it impossible for individual researchers to keep track of all the new data that might affect their research. Computer systems are needed so that researchers can access this data. The systems must support high-level interfaces that communicate in the language of the researchers, database systems that guarantee availability and consistency of the data, and powerful search systems that rapidly scan for similarities between sequences. We have developed a prototype system that includes a graphical user interface, an object-oriented database management system, and high-performance similarity search algorithms. The prototype has the potential to increase researchers' productivity by automating entry of annotated sequence fragments as they are produced by sequencing machines, storing the fragments in the database, and automatically producing and displaying similarity search results of new sequences against the large public sequence databases GenBank and PIR. This paper describes the prototype, discusses the benefits of object-oriented databases for complex and changing sequence data, and presents an object-oriented schema for genetic information. Graphical tools for annotating sequences, storing them in the database, automating similarity searches, and viewing similarity search results are presented. A new suffix treebased data structure that supports rapid similarity searches on sequence data is introduced. Finally, future plans for the system are discussed.
| Original language | English (US) |
|---|---|
| Title of host publication | Proceedings of the 1993 ACM/SIGAPP Symposium on Applied Computing |
| Subtitle of host publication | States of the Art and Practice, SAC 1993 |
| Editors | Ed Deaton, George Hedrick, K.M. George, Hal Berghel |
| Publisher | Association for Computing Machinery |
| Pages | 641-651 |
| Number of pages | 11 |
| ISBN (Electronic) | 0897915674 |
| DOIs | |
| State | Published - Mar 1 1993 |
| Event | 1993 ACM/SIGAPP Symposium on Applied Computing: States of the Art and Practice, SAC 1993 - Indianapolis, United States Duration: Feb 14 1993 → Feb 16 1993 |
Publication series
| Name | Proceedings of the ACM Symposium on Applied Computing |
|---|---|
| Volume | Part F129680 |
Other
| Other | 1993 ACM/SIGAPP Symposium on Applied Computing: States of the Art and Practice, SAC 1993 |
|---|---|
| Country/Territory | United States |
| City | Indianapolis |
| Period | 2/14/93 → 2/16/93 |
Bibliographical note
Publisher Copyright:© 1993 ACM.
Keywords
- Computational molecular biology
- Genome sequencing
- Graphical user interface
- Object-oriented database
- Suffix tree
Fingerprint
Dive into the research topics of 'An object-oriented genetics information system'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS