TY - JOUR
T1 - Insights into the maize pan-genome and pan-transcriptome
AU - Hirsch, Candice N.
AU - Foerster, Jillian M.
AU - Johnson, James M.
AU - Sekhon, Rajandeep S.
AU - Muttoni, German
AU - Vaillancourt, Brieanne
AU - Peñagaricano, Francisco
AU - Lindquist, Erika
AU - Pedraza, Mary Ann
AU - Barry, Kerrie
AU - de Leon, Natalia
AU - Kaeppler, Shawn M.
AU - Robin Buell, C.
PY - 2014/1
Y1 - 2014/1
N2 - Genomes at the species level are dynamic, with genes present in every individual (core) and genes in a subset of individuals (dispensable) that collectively constitute the pan-genome. Using transcriptome sequencing of seedling RNA from 503 maize (Zea mays) inbred lines to characterize the maize pan-genome, we identified 8681 representative transcript assemblies (RTAs) with 16.4% expressed in all lines and 82.7% expressed in subsets of the lines. Interestingly, with linkage disequilibrium mapping, 76.7% of the RTAs with at least one single nucleotide polymorphism (SNP) could be mapped to a single genetic position, distributed primarily throughout the nonpericentromeric portion of the genome. Stepwise iterative clustering of RTAs suggests, within the context of the genotypes used in this study, that the maize genome is restricted and further sampling of seedling RNA within this germplasm base will result in minimal discovery. Genome-wide association studies based on SNPs and transcript abundance in the pan-genome revealed loci associated with the timing of the juvenile-to-adult vegetative and vegetative-to-reproductive developmental transitions, two traits important for fitness and adaptation. This study revealed the dynamic nature of the maize pan-genome and demonstrated that a substantial portion of variation may lie outside the single reference genome for a species.
AB - Genomes at the species level are dynamic, with genes present in every individual (core) and genes in a subset of individuals (dispensable) that collectively constitute the pan-genome. Using transcriptome sequencing of seedling RNA from 503 maize (Zea mays) inbred lines to characterize the maize pan-genome, we identified 8681 representative transcript assemblies (RTAs) with 16.4% expressed in all lines and 82.7% expressed in subsets of the lines. Interestingly, with linkage disequilibrium mapping, 76.7% of the RTAs with at least one single nucleotide polymorphism (SNP) could be mapped to a single genetic position, distributed primarily throughout the nonpericentromeric portion of the genome. Stepwise iterative clustering of RTAs suggests, within the context of the genotypes used in this study, that the maize genome is restricted and further sampling of seedling RNA within this germplasm base will result in minimal discovery. Genome-wide association studies based on SNPs and transcript abundance in the pan-genome revealed loci associated with the timing of the juvenile-to-adult vegetative and vegetative-to-reproductive developmental transitions, two traits important for fitness and adaptation. This study revealed the dynamic nature of the maize pan-genome and demonstrated that a substantial portion of variation may lie outside the single reference genome for a species.
UR - http://www.scopus.com/inward/record.url?scp=84896808010&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84896808010&partnerID=8YFLogxK
U2 - 10.1105/tpc.113.119982
DO - 10.1105/tpc.113.119982
M3 - Article
C2 - 24488960
AN - SCOPUS:84896808010
SN - 1040-4651
VL - 26
SP - 121
EP - 135
JO - Plant Cell
JF - Plant Cell
IS - 1
ER -