Confronting preferential sampling when analysing population distributions: diagnosis and model‐based triage

Summary Population surveys are often used to estimate the density, abundance, or distribution of natural populations. Recently, model‐based approaches to analyzing survey data have become popular because one can more readily accommodate departures from pre‐planned survey routes and construct more de...

Full description

Bibliographic Details
Published in:Methods in Ecology and Evolution
Main Authors: Conn, Paul B., Thorson, James T., Johnson, Devin S.
Other Authors: Yoccoz, Nigel, National Oceanic and Atmospheric Administration
Format: Article in Journal/Newspaper
Language:English
Published: Wiley 2017
Subjects:
Online Access:http://dx.doi.org/10.1111/2041-210x.12803
https://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1111%2F2041-210X.12803
https://onlinelibrary.wiley.com/doi/pdf/10.1111/2041-210X.12803
https://onlinelibrary.wiley.com/doi/full-xml/10.1111/2041-210X.12803
https://besjournals.onlinelibrary.wiley.com/doi/pdf/10.1111/2041-210X.12803
Description
Summary:Summary Population surveys are often used to estimate the density, abundance, or distribution of natural populations. Recently, model‐based approaches to analyzing survey data have become popular because one can more readily accommodate departures from pre‐planned survey routes and construct more detailed maps than one can with design‐based procedures. Spatial models for population distributions (SMPDs) often make the implicit assumption that locations chosen for sampling and animal abundance at those locations are conditionally independent given modelled covariates. However, this assumption may be violated when survey effort is non‐randomized, leading to preferential sampling. We develop a hierarchical statistical modelling framework for detecting and alleviating the biasing effects of preferential sampling in spatial distribution models fitted to count data. The approach works by specifying a joint model for population density and the locations selected for sampling, and specifying a dependent correlation structure between the two processes. Using simulation, we show that moderate levels of preferential sampling can lead to large (e.g. 40%) bias in estimates of animal density and that our modelling approach can considerably reduce this bias. In contrast, preferential sampling did not appear to bias inferences about parameters informing species–habitat relationships (i.e. slope parameters). We apply our approach to aerial survey counts of bearded seals ( Erignathus barbatus ) in the eastern Bering Sea. As expected, models with a preferential sampling effect led to lower abundance than those without. However, several lines of reasoning (better predictive performance, higher biological realism) led us to prefer models without a preferential sampling effect for this dataset. When population surveys break from traditional scientific survey design principles, ecologists should recognize the potentially biasing effects of preferential sampling when estimating population density or occurrence. Joint models, such as ...