Papyrologists analyze, transcribe, and edit papyrus fragments in order to enrich modern lives by better understanding the linguistics, culture, and literature of the ancient world. One of their common tasks is to match an unknown fragment to a known manuscript. This is especially challenging when the fragments are damaged and contain only limited information (e.g., due to deterioration). In the last 100 years, only about 10% of the more than 500,000 fragments recovered from the Egyptian village of Oxyrhynchus have been edited. We do not know what new ancient texts might be found and what can be learned from them, but using current methods of identification this process will take in excess of 1000 years. The identification of an anonymous string of characters with a collection of known text sequences is ubiquitous in computational biology. Genes are often represented by a sequence of continuous characters, each of which denotes an amino acid. Relationships are inferred by finding multi-letter patterns shared between the anonymous sequence and a known sequence. This process is commonly referred to as genetic sequence alignment. In this paper, we introduce a novel methodology that uses modern genetic sequence alignment algorithms as a method for identifying Ancient Greek text fragments. This application will offer papyrologists and other professionals in the humanities the ability to rapidly identify severely damaged texts. This approach leverages a new form of non-contextual, multi-line text identification for the Greek language that can greatly accelerate the tedious task of transcription and identification.
|Original language||English (US)|
|Title of host publication||Proceedings - 2014 IEEE 10th International Conference on eScience, eScience 2014|
|Publisher||Institute of Electrical and Electronics Engineers Inc.|
|Number of pages||6|
|State||Published - Dec 2 2014|
|Event||10th IEEE International Conference on eScience, eScience 2014 - Guaruja, Brazil|
Duration: Oct 20 2014 → Oct 24 2014
|Name||Proceedings - 2014 IEEE 10th International Conference on eScience, eScience 2014|
|Other||10th IEEE International Conference on eScience, eScience 2014|
|Period||10/20/14 → 10/24/14|
Bibliographical notePublisher Copyright:
© 2014 IEEE.
- Ancient Greek
- genetic sequence alignment