Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region
Various approaches of differing mathematical complexities are being applied for spatial prediction of soil properties. Regression kriging is a widely used hybrid approach of spatial variation that combines correlation between soil properties and environmental factors with spatial autocorrelation bet...
Published in: | Frontiers in Big Data |
---|---|
Main Authors: | , , , |
Other Authors: | |
Format: | Article in Journal/Newspaper |
Language: | unknown |
Published: |
Frontiers Media SA
2020
|
Subjects: | |
Online Access: | http://dx.doi.org/10.3389/fdata.2020.528441 https://www.frontiersin.org/articles/10.3389/fdata.2020.528441/full |
id |
crfrontiers:10.3389/fdata.2020.528441 |
---|---|
record_format |
openpolar |
spelling |
crfrontiers:10.3389/fdata.2020.528441 2024-09-09T20:03:19+00:00 Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region Mishra, Umakant Gautam, Sagar Riley, William J. Hoffman, Forrest M. Argonne National Laboratory 2020 http://dx.doi.org/10.3389/fdata.2020.528441 https://www.frontiersin.org/articles/10.3389/fdata.2020.528441/full unknown Frontiers Media SA https://creativecommons.org/licenses/by/4.0/ Frontiers in Big Data volume 3 ISSN 2624-909X journal-article 2020 crfrontiers https://doi.org/10.3389/fdata.2020.528441 2024-08-13T04:04:55Z Various approaches of differing mathematical complexities are being applied for spatial prediction of soil properties. Regression kriging is a widely used hybrid approach of spatial variation that combines correlation between soil properties and environmental factors with spatial autocorrelation between soil observations. In this study, we compared four machine learning approaches (gradient boosting machine, multinarrative adaptive regression spline, random forest, and support vector machine) with regression kriging to predict the spatial variation of surface (0–30 cm) soil organic carbon (SOC) stocks at 250-m spatial resolution across the northern circumpolar permafrost region. We combined 2,374 soil profile observations (calibration datasets) with georeferenced datasets of environmental factors (climate, topography, land cover, bedrock geology, and soil types) to predict the spatial variation of surface SOC stocks. We evaluated the prediction accuracy at randomly selected sites (validation datasets) across the study area. We found that different techniques inferred different numbers of environmental factors and their relative importance for prediction of SOC stocks. Regression kriging produced lower prediction errors in comparison to multinarrative adaptive regression spline and support vector machine, and comparable prediction accuracy to gradient boosting machine and random forest. However, the ensemble median prediction of SOC stocks obtained from all four machine learning techniques showed highest prediction accuracy. Although the use of different approaches in spatial prediction of soil properties will depend on the availability of soil and environmental datasets and computational resources, we conclude that the ensemble median prediction obtained from multiple machine learning approaches provides greater spatial details and produces the highest prediction accuracy. Thus an ensemble prediction approach can be a better choice than any single prediction technique for predicting the spatial variation of SOC ... Article in Journal/Newspaper permafrost Frontiers (Publisher) Frontiers in Big Data 3 |
institution |
Open Polar |
collection |
Frontiers (Publisher) |
op_collection_id |
crfrontiers |
language |
unknown |
description |
Various approaches of differing mathematical complexities are being applied for spatial prediction of soil properties. Regression kriging is a widely used hybrid approach of spatial variation that combines correlation between soil properties and environmental factors with spatial autocorrelation between soil observations. In this study, we compared four machine learning approaches (gradient boosting machine, multinarrative adaptive regression spline, random forest, and support vector machine) with regression kriging to predict the spatial variation of surface (0–30 cm) soil organic carbon (SOC) stocks at 250-m spatial resolution across the northern circumpolar permafrost region. We combined 2,374 soil profile observations (calibration datasets) with georeferenced datasets of environmental factors (climate, topography, land cover, bedrock geology, and soil types) to predict the spatial variation of surface SOC stocks. We evaluated the prediction accuracy at randomly selected sites (validation datasets) across the study area. We found that different techniques inferred different numbers of environmental factors and their relative importance for prediction of SOC stocks. Regression kriging produced lower prediction errors in comparison to multinarrative adaptive regression spline and support vector machine, and comparable prediction accuracy to gradient boosting machine and random forest. However, the ensemble median prediction of SOC stocks obtained from all four machine learning techniques showed highest prediction accuracy. Although the use of different approaches in spatial prediction of soil properties will depend on the availability of soil and environmental datasets and computational resources, we conclude that the ensemble median prediction obtained from multiple machine learning approaches provides greater spatial details and produces the highest prediction accuracy. Thus an ensemble prediction approach can be a better choice than any single prediction technique for predicting the spatial variation of SOC ... |
author2 |
Argonne National Laboratory |
format |
Article in Journal/Newspaper |
author |
Mishra, Umakant Gautam, Sagar Riley, William J. Hoffman, Forrest M. |
spellingShingle |
Mishra, Umakant Gautam, Sagar Riley, William J. Hoffman, Forrest M. Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region |
author_facet |
Mishra, Umakant Gautam, Sagar Riley, William J. Hoffman, Forrest M. |
author_sort |
Mishra, Umakant |
title |
Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region |
title_short |
Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region |
title_full |
Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region |
title_fullStr |
Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region |
title_full_unstemmed |
Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region |
title_sort |
ensemble machine learning approach improves predicted spatial variation of surface soil organic carbon stocks in data-limited northern circumpolar region |
publisher |
Frontiers Media SA |
publishDate |
2020 |
url |
http://dx.doi.org/10.3389/fdata.2020.528441 https://www.frontiersin.org/articles/10.3389/fdata.2020.528441/full |
genre |
permafrost |
genre_facet |
permafrost |
op_source |
Frontiers in Big Data volume 3 ISSN 2624-909X |
op_rights |
https://creativecommons.org/licenses/by/4.0/ |
op_doi |
https://doi.org/10.3389/fdata.2020.528441 |
container_title |
Frontiers in Big Data |
container_volume |
3 |
_version_ |
1809935271294140416 |