Utility of different gene enrichment approaches toward identifying and sequencing the maize gene space

Nathan Michael Springer, Xiequn Xu, W. Brad Barbazuk

Research output: Contribution to journalArticlepeer-review

41 Scopus citations


Maize (Zea mays) possesses a large, highly repetitive genome, and subsequently a number of reduced-representation sequencing approaches have been used to try and enrich for gene space while eluding difficulties associated with repetitive DNA. This article documents the ability of publicly available maize expressed sequence tag and Genome Survey Sequences (GSSs; many of which were isolated through the use of reduced representation techniques) to recognize and provide coverage of 78 maize full-length cDNAs (FLCs). All 78 FLCs in the dataset were identified by at least three GSSs, indicating that the majority of maize genes have been identified by at least one currently available GSS. Both methyl-filtration and high-Cot enrichment methods provided a 7- to 8-fold increase in gene discovery rates as compared to random sequencing. The available maize GSSs aligned to 75% of the FLC nucleotides used to perform searches, while the expressed sequence tag sequences aligned to 73% of the nucleotides. Our data suggest that at least approximately 95% of maize genes have been tagged by at least one GSS. While the GSSs are very effective for gene identification, relatively few (18%) of the FLCs are completely represented by GSSs. Analysis of the overlap of coverage and bias due to position within a gene suggest that RescueMu, methyl-filtration, and high-Cot methods are at least partially nonredundant.

Original languageEnglish (US)
Pages (from-to)3023-3033
Number of pages11
JournalPlant physiology
Issue number2
StatePublished - Oct 2004


Dive into the research topics of 'Utility of different gene enrichment approaches toward identifying and sequencing the maize gene space'. Together they form a unique fingerprint.

Cite this