Maize Genomes to Fields: 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets

Naser Alkhalifah, Darwin A. Campbell, Celeste M. Falcon, Jack M. Gardiner, Nathan D. Miller, Maria Cinta Romay, Ramona Walls, Renee Walton, Cheng Ting Yeh, Martin Bohn, Jessica Bubert, Edward S. Buckler, Ignacio Ciampitti, Sherry Flint-Garcia, Michael A. Gore, Christopher Graham, Candice Hirsch, James B. Holland, David Hooker, Shawn KaepplerJoseph Knoll, Nick Lauter, Elizabeth C. Lee, Aaron Lorenz, Jonathan P. Lynch, Stephen P. Moose, Seth C. Murray, Rebecca Nelson, Torbert Rocheford, Oscar Rodriguez, James C. Schnable, Brian Scully, Margaret Smith, Nathan Springer, Peter Thomison, Mitchell Tuinstra, Randall J. Wisser, Wenwei Xu, David Ertl, Patrick S. Schnable, Natalia De Leon, Edgar P. Spalding, Jode Edwards, Carolyn J. Lawrence-Dill

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Objectives: Crop improvement relies on analysis of phenotypic, genotypic, and environmental data. Given large, well-integrated, multi-year datasets, diverse queries can be made: Which lines perform best in hot, dry environments? Which alleles of specific genes are required for optimal performance in each environment? Such datasets also can be leveraged to predict cultivar performance, even in uncharacterized environments. The maize Genomes to Fields (G2F) Initiative is a multi-institutional organization of scientists working to generate and analyze such datasets from existing, publicly available inbred lines and hybrids. G2F's genotype by environment project has released 2014 and 2015 datasets to the public, with 2016 and 2017 collected and soon to be made available. Data description: Datasets include DNA sequences; traditional phenotype descriptions, as well as detailed ear, cob, and kernel phenotypes quantified by image analysis; weather station measurements; and soil characterizations by site. Data are released as comma separated value spreadsheets accompanied by extensive README text descriptions. For genotypic and phenotypic data, both raw data and a version with outliers removed are reported. For weather data, two versions are reported: a full dataset calibrated against nearby National Weather Service sites and a second calibrated set with outliers and apparent artifacts removed.

Original languageEnglish (US)
Article number452
JournalBMC Research Notes
Volume11
Issue number1
DOIs
StatePublished - Jul 9 2018

Fingerprint

Zea mays
Ear
Genes
Genotype
Genome
Data description
Phenotype
DNA sequences
Spreadsheets
Weather
Image analysis
Crops
Soils
Artifacts
Datasets
Soil
Alleles

Keywords

  • Breeding
  • Environment
  • Genome
  • Genotype
  • Hybrid
  • Inbred
  • Maize
  • Phenotype
  • Prediction
  • Soil

Cite this

Alkhalifah, N., Campbell, D. A., Falcon, C. M., Gardiner, J. M., Miller, N. D., Romay, M. C., ... Lawrence-Dill, C. J. (2018). Maize Genomes to Fields: 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets. BMC Research Notes, 11(1), [452]. https://doi.org/10.1186/s13104-018-3508-1

Maize Genomes to Fields : 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets. / Alkhalifah, Naser; Campbell, Darwin A.; Falcon, Celeste M.; Gardiner, Jack M.; Miller, Nathan D.; Romay, Maria Cinta; Walls, Ramona; Walton, Renee; Yeh, Cheng Ting; Bohn, Martin; Bubert, Jessica; Buckler, Edward S.; Ciampitti, Ignacio; Flint-Garcia, Sherry; Gore, Michael A.; Graham, Christopher; Hirsch, Candice; Holland, James B.; Hooker, David; Kaeppler, Shawn; Knoll, Joseph; Lauter, Nick; Lee, Elizabeth C.; Lorenz, Aaron; Lynch, Jonathan P.; Moose, Stephen P.; Murray, Seth C.; Nelson, Rebecca; Rocheford, Torbert; Rodriguez, Oscar; Schnable, James C.; Scully, Brian; Smith, Margaret; Springer, Nathan; Thomison, Peter; Tuinstra, Mitchell; Wisser, Randall J.; Xu, Wenwei; Ertl, David; Schnable, Patrick S.; De Leon, Natalia; Spalding, Edgar P.; Edwards, Jode; Lawrence-Dill, Carolyn J.

In: BMC Research Notes, Vol. 11, No. 1, 452, 09.07.2018.

Research output: Contribution to journalArticle

Alkhalifah, N, Campbell, DA, Falcon, CM, Gardiner, JM, Miller, ND, Romay, MC, Walls, R, Walton, R, Yeh, CT, Bohn, M, Bubert, J, Buckler, ES, Ciampitti, I, Flint-Garcia, S, Gore, MA, Graham, C, Hirsch, C, Holland, JB, Hooker, D, Kaeppler, S, Knoll, J, Lauter, N, Lee, EC, Lorenz, A, Lynch, JP, Moose, SP, Murray, SC, Nelson, R, Rocheford, T, Rodriguez, O, Schnable, JC, Scully, B, Smith, M, Springer, N, Thomison, P, Tuinstra, M, Wisser, RJ, Xu, W, Ertl, D, Schnable, PS, De Leon, N, Spalding, EP, Edwards, J & Lawrence-Dill, CJ 2018, 'Maize Genomes to Fields: 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets', BMC Research Notes, vol. 11, no. 1, 452. https://doi.org/10.1186/s13104-018-3508-1
Alkhalifah, Naser ; Campbell, Darwin A. ; Falcon, Celeste M. ; Gardiner, Jack M. ; Miller, Nathan D. ; Romay, Maria Cinta ; Walls, Ramona ; Walton, Renee ; Yeh, Cheng Ting ; Bohn, Martin ; Bubert, Jessica ; Buckler, Edward S. ; Ciampitti, Ignacio ; Flint-Garcia, Sherry ; Gore, Michael A. ; Graham, Christopher ; Hirsch, Candice ; Holland, James B. ; Hooker, David ; Kaeppler, Shawn ; Knoll, Joseph ; Lauter, Nick ; Lee, Elizabeth C. ; Lorenz, Aaron ; Lynch, Jonathan P. ; Moose, Stephen P. ; Murray, Seth C. ; Nelson, Rebecca ; Rocheford, Torbert ; Rodriguez, Oscar ; Schnable, James C. ; Scully, Brian ; Smith, Margaret ; Springer, Nathan ; Thomison, Peter ; Tuinstra, Mitchell ; Wisser, Randall J. ; Xu, Wenwei ; Ertl, David ; Schnable, Patrick S. ; De Leon, Natalia ; Spalding, Edgar P. ; Edwards, Jode ; Lawrence-Dill, Carolyn J. / Maize Genomes to Fields : 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets. In: BMC Research Notes. 2018 ; Vol. 11, No. 1.
@article{8d435219debf4b969e9e53d5c1b3037d,
title = "Maize Genomes to Fields: 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets",
abstract = "Objectives: Crop improvement relies on analysis of phenotypic, genotypic, and environmental data. Given large, well-integrated, multi-year datasets, diverse queries can be made: Which lines perform best in hot, dry environments? Which alleles of specific genes are required for optimal performance in each environment? Such datasets also can be leveraged to predict cultivar performance, even in uncharacterized environments. The maize Genomes to Fields (G2F) Initiative is a multi-institutional organization of scientists working to generate and analyze such datasets from existing, publicly available inbred lines and hybrids. G2F's genotype by environment project has released 2014 and 2015 datasets to the public, with 2016 and 2017 collected and soon to be made available. Data description: Datasets include DNA sequences; traditional phenotype descriptions, as well as detailed ear, cob, and kernel phenotypes quantified by image analysis; weather station measurements; and soil characterizations by site. Data are released as comma separated value spreadsheets accompanied by extensive README text descriptions. For genotypic and phenotypic data, both raw data and a version with outliers removed are reported. For weather data, two versions are reported: a full dataset calibrated against nearby National Weather Service sites and a second calibrated set with outliers and apparent artifacts removed.",
keywords = "Breeding, Environment, Genome, Genotype, Hybrid, Inbred, Maize, Phenotype, Prediction, Soil",
author = "Naser Alkhalifah and Campbell, {Darwin A.} and Falcon, {Celeste M.} and Gardiner, {Jack M.} and Miller, {Nathan D.} and Romay, {Maria Cinta} and Ramona Walls and Renee Walton and Yeh, {Cheng Ting} and Martin Bohn and Jessica Bubert and Buckler, {Edward S.} and Ignacio Ciampitti and Sherry Flint-Garcia and Gore, {Michael A.} and Christopher Graham and Candice Hirsch and Holland, {James B.} and David Hooker and Shawn Kaeppler and Joseph Knoll and Nick Lauter and Lee, {Elizabeth C.} and Aaron Lorenz and Lynch, {Jonathan P.} and Moose, {Stephen P.} and Murray, {Seth C.} and Rebecca Nelson and Torbert Rocheford and Oscar Rodriguez and Schnable, {James C.} and Brian Scully and Margaret Smith and Nathan Springer and Peter Thomison and Mitchell Tuinstra and Wisser, {Randall J.} and Wenwei Xu and David Ertl and Schnable, {Patrick S.} and {De Leon}, Natalia and Spalding, {Edgar P.} and Jode Edwards and Lawrence-Dill, {Carolyn J.}",
year = "2018",
month = "7",
day = "9",
doi = "10.1186/s13104-018-3508-1",
language = "English (US)",
volume = "11",
journal = "BMC Research Notes",
issn = "1756-0500",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - Maize Genomes to Fields

T2 - 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets

AU - Alkhalifah, Naser

AU - Campbell, Darwin A.

AU - Falcon, Celeste M.

AU - Gardiner, Jack M.

AU - Miller, Nathan D.

AU - Romay, Maria Cinta

AU - Walls, Ramona

AU - Walton, Renee

AU - Yeh, Cheng Ting

AU - Bohn, Martin

AU - Bubert, Jessica

AU - Buckler, Edward S.

AU - Ciampitti, Ignacio

AU - Flint-Garcia, Sherry

AU - Gore, Michael A.

AU - Graham, Christopher

AU - Hirsch, Candice

AU - Holland, James B.

AU - Hooker, David

AU - Kaeppler, Shawn

AU - Knoll, Joseph

AU - Lauter, Nick

AU - Lee, Elizabeth C.

AU - Lorenz, Aaron

AU - Lynch, Jonathan P.

AU - Moose, Stephen P.

AU - Murray, Seth C.

AU - Nelson, Rebecca

AU - Rocheford, Torbert

AU - Rodriguez, Oscar

AU - Schnable, James C.

AU - Scully, Brian

AU - Smith, Margaret

AU - Springer, Nathan

AU - Thomison, Peter

AU - Tuinstra, Mitchell

AU - Wisser, Randall J.

AU - Xu, Wenwei

AU - Ertl, David

AU - Schnable, Patrick S.

AU - De Leon, Natalia

AU - Spalding, Edgar P.

AU - Edwards, Jode

AU - Lawrence-Dill, Carolyn J.

PY - 2018/7/9

Y1 - 2018/7/9

N2 - Objectives: Crop improvement relies on analysis of phenotypic, genotypic, and environmental data. Given large, well-integrated, multi-year datasets, diverse queries can be made: Which lines perform best in hot, dry environments? Which alleles of specific genes are required for optimal performance in each environment? Such datasets also can be leveraged to predict cultivar performance, even in uncharacterized environments. The maize Genomes to Fields (G2F) Initiative is a multi-institutional organization of scientists working to generate and analyze such datasets from existing, publicly available inbred lines and hybrids. G2F's genotype by environment project has released 2014 and 2015 datasets to the public, with 2016 and 2017 collected and soon to be made available. Data description: Datasets include DNA sequences; traditional phenotype descriptions, as well as detailed ear, cob, and kernel phenotypes quantified by image analysis; weather station measurements; and soil characterizations by site. Data are released as comma separated value spreadsheets accompanied by extensive README text descriptions. For genotypic and phenotypic data, both raw data and a version with outliers removed are reported. For weather data, two versions are reported: a full dataset calibrated against nearby National Weather Service sites and a second calibrated set with outliers and apparent artifacts removed.

AB - Objectives: Crop improvement relies on analysis of phenotypic, genotypic, and environmental data. Given large, well-integrated, multi-year datasets, diverse queries can be made: Which lines perform best in hot, dry environments? Which alleles of specific genes are required for optimal performance in each environment? Such datasets also can be leveraged to predict cultivar performance, even in uncharacterized environments. The maize Genomes to Fields (G2F) Initiative is a multi-institutional organization of scientists working to generate and analyze such datasets from existing, publicly available inbred lines and hybrids. G2F's genotype by environment project has released 2014 and 2015 datasets to the public, with 2016 and 2017 collected and soon to be made available. Data description: Datasets include DNA sequences; traditional phenotype descriptions, as well as detailed ear, cob, and kernel phenotypes quantified by image analysis; weather station measurements; and soil characterizations by site. Data are released as comma separated value spreadsheets accompanied by extensive README text descriptions. For genotypic and phenotypic data, both raw data and a version with outliers removed are reported. For weather data, two versions are reported: a full dataset calibrated against nearby National Weather Service sites and a second calibrated set with outliers and apparent artifacts removed.

KW - Breeding

KW - Environment

KW - Genome

KW - Genotype

KW - Hybrid

KW - Inbred

KW - Maize

KW - Phenotype

KW - Prediction

KW - Soil

UR - http://www.scopus.com/inward/record.url?scp=85049778367&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85049778367&partnerID=8YFLogxK

U2 - 10.1186/s13104-018-3508-1

DO - 10.1186/s13104-018-3508-1

M3 - Article

C2 - 29986751

AN - SCOPUS:85049778367

VL - 11

JO - BMC Research Notes

JF - BMC Research Notes

SN - 1756-0500

IS - 1

M1 - 452

ER -