Abstract
Paleoscience data are extremely heterogeneous; hundreds of different types of measurements and reconstructions are routinely made by scientists on a variety of types of physical samples. This heterogeneity is one of the biggest barriers to finding paleoclimatic records, to building large-scale data products, and to the use of paleoscience data beyond the community of specialists. Here, we document the Paleoenvironmental Standard Terms (PaST) thesaurus, the first authoritative vocabulary of standardized variable names for paleoclimatic and paleoenvironmental data developed in a formal knowledge organization structure. This structure is designed to improve data set discovery, support automated processing of data, and provide connectivity to other vocabularies. PaST is now used operationally at the World Data Service for Paleoclimatology (WDS-Paleo), one of the largest repositories of paleoscience information. Terms from the PaST thesaurus standardize a broad array of paleoenvironmental and paleoclimatic measured and inferred variables, providing enough detail for accurate and precise data discovery and thereby promoting data reuse. We describe the main design decisions and features of the thesaurus, the governance structure for ongoing maintenance, and WDS-Paleo services that now employ PaST. These services include an advanced search by variable name, an interface for thesaurus navigation, and a machine-readable representation in the Simple Knowledge Organization System (SKOS) standard. This overview is designed for developers of thesauri, data contributors, and users of the WDS-Paleo, and serves as a building block for future efforts within the broader paleoscience community to improve how data are described for long-term findability, accessibility, interoperability, and reusability. © 2021. American Geophysical Union. All Rights Reserved.
Original language | English (US) |
---|---|
Article number | e2020PA004193 |
Number of pages | 17 |
Journal | Paleoceanography and Paleoclimatology |
Volume | 36 |
Issue number | 6 |
DOIs | |
State | Published - Jun 1 2021 |
Bibliographical note
Funding Information:Development of the thesaurus was supported by funding from the NSF Paleoclimate Program (AGS‐1545762) and NOAA's Climate Program Office (Cooperative Agreement #NA17OAR4320101). The authors thank John Williams and two anonymous reviewers for their helpful reviews, and thank the scientists who provided feedback on drafts of the thesaurus through virtual advisory panels: Christo Buizert, Emilie Dassie, Kristine DeLong, Sarah Feakins, T. J. Fudge, Ben Hardt, Sharon Hoffmann, Bronwen Konecky, Shaun Marcott, Nick McKay, Anders Noren, Alexandra Noronha, Ian Orland, David Porinchu, Julie Richey, Jeffery Stone, Jessica Tierney, Amy Wagner, Eugene Wahl, Branwen Williams, Erika Wise, and Corinne Wong.
Funding Information:
Development of the thesaurus was supported by funding from the NSF Paleoclimate Program (AGS-1545762) and NOAA's Climate Program Office (Cooperative Agreement #NA17OAR4320101). The authors thank John Williams and two anonymous reviewers for their helpful reviews, and thank the scientists who provided feedback on drafts of the thesaurus through virtual advisory panels: Christo Buizert, Emilie Dassie, Kristine DeLong, Sarah Feakins, T. J. Fudge, Ben Hardt, Sharon Hoffmann, Bronwen Konecky, Shaun Marcott, Nick McKay, Anders Noren, Alexandra Noronha, Ian Orland, David Porinchu, Julie Richey, Jeffery Stone, Jessica Tierney, Amy Wagner, Eugene Wahl, Branwen Williams, Erika Wise, and Corinne Wong.
Publisher Copyright:
© 2021. American Geophysical Union. All Rights Reserved.
Keywords
- controlled vocabulary
- data discovery
- paleoclimate
- paleoenvironment
- proxy measurement
- SKOS data model