Evaluating methods of updating training data in long-term genomewide selection

Jeffrey L. Neyhart, Tyler Tiede, Aaron J. Lorenz, Kevin P. Smith

Research output: Contribution to journalArticlepeer-review

32 Scopus citations


Genomewide selection is hailed for its ability to facilitate greater genetic gains per unit time. Over breeding cycles, the requisite linkage disequilibrium (LD) between quantitative trait loci and markers is expected to change as a result of recombination, selection, and drift, leading to a decay in prediction accuracy. Previous research has identified the need to update the training population using data that may capture new LD generated over breeding cycles; however, optimal methods of updating have not been explored. In a barley (Hordeum vulgare L.) breeding simulation experiment, we examined prediction accuracy and response to selection when updating the training population each cycle with the best predicted lines, the worst predicted lines, both the best and worst predicted lines, random lines, criterion-selected lines, or no lines. In the short term, we found that updating with the best predicted lines or the best and worst predicted lines resulted in high prediction accuracy and genetic gain, but in the long term, all methods (besides not updating) performed similarly. We also examined the impact of including all data in the training population or only the most recent data. Though patterns among update methods were similar, using a smaller but more recent training population provided a slight advantage in prediction accuracy and genetic gain. In an actual breeding program, a breeder might desire to gather phenotypic data on lines predicted to be the best, perhaps to evaluate possible cultivars. Therefore, our results suggest that an optimal method of updating the training population is also very practical.

Original languageEnglish (US)
Pages (from-to)1499-1510
Number of pages12
JournalG3: Genes, Genomes, Genetics
Issue number5
StatePublished - May 1 2017

Bibliographical note

Publisher Copyright:
© 2017 Neyhart et al.


  • Barley
  • GenPred
  • Genomic Selection
  • Optimization
  • Shared Data Resources
  • Simulation
  • Training population


Dive into the research topics of 'Evaluating methods of updating training data in long-term genomewide selection'. Together they form a unique fingerprint.

Cite this