Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm

Summary Including predictors in species distribution models at inappropriate spatial scales can decrease the variance explained, add residual spatial autocorrelation ( RSA ) and lead to the wrong conclusions. Some studies have measured predictors within different buffer sizes (scales) around sample...

Full description

Bibliographic Details
Published in:Methods in Ecology and Evolution
Main Authors: Bradter, Ute, Kunin, William E., Altringham, John D., Thom, Tim J., Benton, Tim G.
Other Authors: Peres‐Neto, Pedro
Format: Article in Journal/Newspaper
Language:English
Published: Wiley 2012
Subjects:
Online Access:http://dx.doi.org/10.1111/j.2041-210x.2012.00253.x
https://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1111%2Fj.2041-210x.2012.00253.x
https://besjournals.onlinelibrary.wiley.com/doi/pdf/10.1111/j.2041-210x.2012.00253.x
id crwiley:10.1111/j.2041-210x.2012.00253.x
record_format openpolar
spelling crwiley:10.1111/j.2041-210x.2012.00253.x 2024-09-15T18:26:53+00:00 Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm Bradter, Ute Kunin, William E. Altringham, John D. Thom, Tim J. Benton, Tim G. Peres‐Neto, Pedro 2012 http://dx.doi.org/10.1111/j.2041-210x.2012.00253.x https://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1111%2Fj.2041-210x.2012.00253.x https://besjournals.onlinelibrary.wiley.com/doi/pdf/10.1111/j.2041-210x.2012.00253.x en eng Wiley http://onlinelibrary.wiley.com/termsAndConditions#vor Methods in Ecology and Evolution volume 4, issue 2, page 167-174 ISSN 2041-210X 2041-210X journal-article 2012 crwiley https://doi.org/10.1111/j.2041-210x.2012.00253.x 2024-09-05T05:04:49Z Summary Including predictors in species distribution models at inappropriate spatial scales can decrease the variance explained, add residual spatial autocorrelation ( RSA ) and lead to the wrong conclusions. Some studies have measured predictors within different buffer sizes (scales) around sample locations, regressed each predictor against the response at each scale and selected the scale with the best model fit as the appropriate scale for this predictor. However, a predictor can influence a species at several scales or show several scales with good model fit due to a bias caused by RSA . This makes the evaluation of all scales with good model fit necessary. With potentially several scales per predictor and multiple predictors to evaluate, the number of predictors can be large relative to the number of data points, potentially impeding variable selection with traditional statistical techniques, such as logistic regression. We trialled a variable selection process using the random forest algorithm, which allows the simultaneous evaluation of several scales of multiple predictors. Using simulated responses, we compared the performance of models resulting from this approach with models using the known predictors at arbitrary and at the known spatial scales. We also apply the proposed approach to a real data set of curlew ( Numenius arquata ). AIC , AUC and Naglekerke's pseudo R 2 of the models resulting from the proposed variable selection were often very similar to the models with the known predictors at known spatial scales. Only two of nine models required the addition of spatial eigenvectors to account for RSA . Arbitrary scale models always required the addition of spatial eigenvectors. 75% (50–100%) of the known predictors were selected at scales similar to the known scale (within 3 km). In the curlew model, predictors at large, medium and small spatial scales were selected, suggesting that for appropriate landscape‐scale models multiple scales need to be evaluated. The proposed approach selected several ... Article in Journal/Newspaper Numenius arquata Wiley Online Library Methods in Ecology and Evolution 4 2 167 174
institution Open Polar
collection Wiley Online Library
op_collection_id crwiley
language English
description Summary Including predictors in species distribution models at inappropriate spatial scales can decrease the variance explained, add residual spatial autocorrelation ( RSA ) and lead to the wrong conclusions. Some studies have measured predictors within different buffer sizes (scales) around sample locations, regressed each predictor against the response at each scale and selected the scale with the best model fit as the appropriate scale for this predictor. However, a predictor can influence a species at several scales or show several scales with good model fit due to a bias caused by RSA . This makes the evaluation of all scales with good model fit necessary. With potentially several scales per predictor and multiple predictors to evaluate, the number of predictors can be large relative to the number of data points, potentially impeding variable selection with traditional statistical techniques, such as logistic regression. We trialled a variable selection process using the random forest algorithm, which allows the simultaneous evaluation of several scales of multiple predictors. Using simulated responses, we compared the performance of models resulting from this approach with models using the known predictors at arbitrary and at the known spatial scales. We also apply the proposed approach to a real data set of curlew ( Numenius arquata ). AIC , AUC and Naglekerke's pseudo R 2 of the models resulting from the proposed variable selection were often very similar to the models with the known predictors at known spatial scales. Only two of nine models required the addition of spatial eigenvectors to account for RSA . Arbitrary scale models always required the addition of spatial eigenvectors. 75% (50–100%) of the known predictors were selected at scales similar to the known scale (within 3 km). In the curlew model, predictors at large, medium and small spatial scales were selected, suggesting that for appropriate landscape‐scale models multiple scales need to be evaluated. The proposed approach selected several ...
author2 Peres‐Neto, Pedro
format Article in Journal/Newspaper
author Bradter, Ute
Kunin, William E.
Altringham, John D.
Thom, Tim J.
Benton, Tim G.
spellingShingle Bradter, Ute
Kunin, William E.
Altringham, John D.
Thom, Tim J.
Benton, Tim G.
Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm
author_facet Bradter, Ute
Kunin, William E.
Altringham, John D.
Thom, Tim J.
Benton, Tim G.
author_sort Bradter, Ute
title Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm
title_short Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm
title_full Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm
title_fullStr Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm
title_full_unstemmed Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm
title_sort identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm
publisher Wiley
publishDate 2012
url http://dx.doi.org/10.1111/j.2041-210x.2012.00253.x
https://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1111%2Fj.2041-210x.2012.00253.x
https://besjournals.onlinelibrary.wiley.com/doi/pdf/10.1111/j.2041-210x.2012.00253.x
genre Numenius arquata
genre_facet Numenius arquata
op_source Methods in Ecology and Evolution
volume 4, issue 2, page 167-174
ISSN 2041-210X 2041-210X
op_rights http://onlinelibrary.wiley.com/termsAndConditions#vor
op_doi https://doi.org/10.1111/j.2041-210x.2012.00253.x
container_title Methods in Ecology and Evolution
container_volume 4
container_issue 2
container_start_page 167
op_container_end_page 174
_version_ 1810467512212520960