Abstract
The process of mapping markers from radiation hybrid mapping (RHM) experiments is equivalent to the traveling salesman problem and, thereby, has combinatorial complexity. As an additional problem, experiments typically result in some unreliable markers that reduce the overall quality of the map. We propose a clustering approach for addressing both problems efficiently by eliminating unreliable markers without the need for mapping the complete set of markers. Traditional approaches for eliminating markers use resampling of the full data set, which has an even higher computational complexity than the original mapping problem. In contrast, the proposed approach uses a divide and conquer strategy to construct framework maps based on clusters that exclude unreliable markers. Clusters are ordered using parallel processing and are then combined to form the complete map. Using an RHMdata set of the human genome, we compare the framework maps from our proposed approaches with published physical maps and with the Carthagene tool. Overall, our approach has a very low computational complexity and produces solid framework maps with good chromosome coverage and high agreement with the physical map marker order.
Original language | English (US) |
---|---|
Title of host publication | Proc. of the 12th Int. Workshop on Data Mining in Bioinformatics, BIOKDD 2013 - Held in Conj. with the 19th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, SIGKDD 2013 |
Publisher | Association for Computing Machinery |
Pages | 34-41 |
Number of pages | 8 |
ISBN (Print) | 9781450323277 |
DOIs | |
State | Published - 2013 |
Event | 12th Int.Workshop on Data Mining in Bioinformatics, BIOKDD 2013 - Held in Conjunction with the 19th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, SIGKDD 2013 - Chicago, IL, United States Duration: Aug 11 2013 → Aug 14 2013 |
Publication series
Name | Proc. of the 12th Int. Workshop on Data Mining in Bioinformatics, BIOKDD 2013 - Held in Conjunction with the 19th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, SIGKDD 2013 |
---|
Other
Other | 12th Int.Workshop on Data Mining in Bioinformatics, BIOKDD 2013 - Held in Conjunction with the 19th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, SIGKDD 2013 |
---|---|
Country/Territory | United States |
City | Chicago, IL |
Period | 8/11/13 → 8/14/13 |
Bibliographical note
Copyright:Copyright 2014 Elsevier B.V., All rights reserved.
Keywords
- Bioinformatics
- Clustering
- Framework mapping
- Radiation hybrid mapping