An approach to estimating forest biomass while quantifying estimate uncertainty and correcting bias in machine learning maps

Ethan Emick, Chad Babcock, Grayson W. White, Andrew T. Hudak, Grant M. Domke, Andrew O. Finley

Research output: Contribution to journalArticlepeer-review

Abstract

Providing forest biomass estimates with desired accuracy and precision for small areas is a key challenge to incorporating forest carbon offsets into commodity trading programs. Enrolled forest carbon projects and verification entities typically rely on probabilistically sampled field data and design-based (DB) estimators to estimate carbon storage and characterize uncertainty. However, this methodology requires a large amount of field data to achieve sufficient precision and collection of these data can be prohibitively expensive. This has spurred interest in developing regional-scale maps of forest biomass that incorporate remote sensing data as an alternative to collecting expensive plot data. These maps are often generated using machine learning (ML) algorithms that combine remote sensing products and field measurements. While these maps can produce estimates across large geographic regions at fine spatial resolutions, the estimates are prone to bias and do not have associated uncertainty estimates. Here, we assess one such map developed by the National Aeronautics and Space Administration's Carbon Monitoring System. We consider model-assisted (MA) and geostatistical model-based (GMB) estimators to address map bias and uncertainty quantification. The MA and GMB estimators use a sample of field observations as the response, and the ML-produced map as an auxiliary variable to achieve statistically defensible predictions. We compare MA and GMB estimator performance to DB and direct (DR) estimators. This assessment considers both counties and a small areal extent experimental forest, all within Oregon USA. Results suggest the MA and GMB estimators perform similar to the DB estimator at the state level and in counties containing many field plots. But in counties with moderate to small field sample sizes, the GMB and MA estimators are more precise than the DB estimator. As within-county sample sizes get smaller, the GMB estimator tends to outperform MA. Results also show the DR estimator's state-level estimates are substantially larger than the DB, MA and GMB estimates, indicating that that the DR estimator may be biased. When assessing the GMB estimator for the experimental forest, we find the GMB estimator has sufficient precision for stand-level carbon accounting even when no field observations are available within the stand. Plot-level GMB uncertainty interval coverage probabilities were estimated and showed adequate coverage. This suggests that the GMB estimator is producing statistically rigorous uncertainty estimates.

Original languageEnglish (US)
Article number113678
JournalRemote Sensing of Environment
Volume295
DOIs
StatePublished - Sep 1 2023

Bibliographical note

Funding Information:
This work was supported, in part, by the USDA Forest Service Forest Inventory and Analysis (FIA) and Forest Health Monitoring (FHM) programs, and National Aeronautics and Space Administration's Carbon Monitoring System Program. The first author's work was also supported through the Minnesota Agricultural Research, Education and Extension Tech Transfer program (AGREETT). The third author received additional support from the National Science Foundation (NSF) NSF/EF 1253225 and NSF/DMS 1916395 . Data were provided by the H.J. Andrews Experimental Forest and Long Term Ecological Research (LTER) program, administered cooperatively by Oregon State University, the USDA Forest Service Pacific Northwest Research Station, and the Willamette National Forest. This material is based upon work supported by the National Science Foundation under the grant LTER8 DEB- 2025755 .

Funding Information:
This work was supported, in part, by the USDA Forest Service Forest Inventory and Analysis (FIA) and Forest Health Monitoring (FHM) programs, and National Aeronautics and Space Administration's Carbon Monitoring System Program. The first author's work was also supported through the Minnesota Agricultural Research, Education and Extension Tech Transfer program (AGREETT). The third author received additional support from the National Science Foundation (NSF) NSF/EF 1253225 and NSF/DMS 1916395. Data were provided by the H.J. Andrews Experimental Forest and Long Term Ecological Research (LTER) program, administered cooperatively by Oregon State University, the USDA Forest Service Pacific Northwest Research Station, and the Willamette National Forest. This material is based upon work supported by the National Science Foundation under the grant LTER8 DEB-2025755.

Publisher Copyright:
© 2023 Elsevier Inc.

Keywords

  • Bayesian hierarchical spatial modeling
  • Bias correction
  • Carbon monitoring
  • Design-based inference
  • Forest inventory
  • Machine learning
  • Model-based inference
  • Random forest
  • Remote sensing
  • Small area estimation

Fingerprint

Dive into the research topics of 'An approach to estimating forest biomass while quantifying estimate uncertainty and correcting bias in machine learning maps'. Together they form a unique fingerprint.

Cite this