Overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ...
Many variables in biological research - from body size to life history timing to environmental characteristics - are measured continuously (e.g., body mass in kilograms) but analyzed as categories (e.g., large versus small), which can lower statistical power and change interpretation. We conducted a...
Main Authors: | , |
---|---|
Format: | Dataset |
Language: | English |
Published: |
Dryad
2023
|
Subjects: | |
Online Access: | https://dx.doi.org/10.5061/dryad.5x69p8d9r https://datadryad.org/stash/dataset/doi:10.5061/dryad.5x69p8d9r |
id |
ftdatacite:10.5061/dryad.5x69p8d9r |
---|---|
record_format |
openpolar |
spelling |
ftdatacite:10.5061/dryad.5x69p8d9r 2024-09-15T17:47:11+00:00 Overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ... Beltran, Roxanne Tarwater, Corey 2023 https://dx.doi.org/10.5061/dryad.5x69p8d9r https://datadryad.org/stash/dataset/doi:10.5061/dryad.5x69p8d9r en eng Dryad https://dx.doi.org/10.5281/zenodo.12690514 Creative Commons Zero v1.0 Universal https://creativecommons.org/publicdomain/zero/1.0/legalcode cc0-1.0 FOS Biological sciences Binning Statistics FOS Mathematics Behavior Breakpoint Threshhold Category dataset Dataset 2023 ftdatacite https://doi.org/10.5061/dryad.5x69p8d9r10.5281/zenodo.12690514 2024-08-01T10:11:36Z Many variables in biological research - from body size to life history timing to environmental characteristics - are measured continuously (e.g., body mass in kilograms) but analyzed as categories (e.g., large versus small), which can lower statistical power and change interpretation. We conducted a mini-review of 72 recent publications in six popular ecology, evolution, and behavior journals to quantify the prevalence of categorization. We then summarized commonly categorized metrics and simulated a dataset to demonstrate the drawbacks of categorization using common variables and realistic examples. We show that categorizing continuous variables is common (31% of publications reviewed). We also underscore that predictor variables can and should be collected and analyzed continuously. Finally, we provide recommendations on how to keep variables continuous throughout the entire scientific process. Together, these pieces comprise an actionable guide to increasing statistical power and facilitating large ... : # Overcoming the pitfalls of categorizing continuous variables in ecology and evolutionary biology [https://doi.org/10.5061/dryad.5x69p8d9r](https://doi.org/10.5061/dryad.5x69p8d9r) We simulated data to quantify the detrimental impact of categorizing continuous variables using various statistical breakpoints and sample sizes (details below). To give the example biological relevance, we created a dataset that illustrates the complexity of life history theory and climate change impacts, and contains a predictor variable that is frequently categorized (Table 2) - reproductive timing in one year and its effect on body size in the following year. A reasonable research question would be: How does timing of reproduction in year t influence body mass at the start of the breeding season in year t+1? For illustrative purposes, let’s say we collected data from individually banded penguins in Antarctica. Based on the mechanistic relationships between seasonally available sea ice and food availability, we hypothesize ... Dataset Antarc* Antarctica Sea ice DataCite |
institution |
Open Polar |
collection |
DataCite |
op_collection_id |
ftdatacite |
language |
English |
topic |
FOS Biological sciences Binning Statistics FOS Mathematics Behavior Breakpoint Threshhold Category |
spellingShingle |
FOS Biological sciences Binning Statistics FOS Mathematics Behavior Breakpoint Threshhold Category Beltran, Roxanne Tarwater, Corey Overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ... |
topic_facet |
FOS Biological sciences Binning Statistics FOS Mathematics Behavior Breakpoint Threshhold Category |
description |
Many variables in biological research - from body size to life history timing to environmental characteristics - are measured continuously (e.g., body mass in kilograms) but analyzed as categories (e.g., large versus small), which can lower statistical power and change interpretation. We conducted a mini-review of 72 recent publications in six popular ecology, evolution, and behavior journals to quantify the prevalence of categorization. We then summarized commonly categorized metrics and simulated a dataset to demonstrate the drawbacks of categorization using common variables and realistic examples. We show that categorizing continuous variables is common (31% of publications reviewed). We also underscore that predictor variables can and should be collected and analyzed continuously. Finally, we provide recommendations on how to keep variables continuous throughout the entire scientific process. Together, these pieces comprise an actionable guide to increasing statistical power and facilitating large ... : # Overcoming the pitfalls of categorizing continuous variables in ecology and evolutionary biology [https://doi.org/10.5061/dryad.5x69p8d9r](https://doi.org/10.5061/dryad.5x69p8d9r) We simulated data to quantify the detrimental impact of categorizing continuous variables using various statistical breakpoints and sample sizes (details below). To give the example biological relevance, we created a dataset that illustrates the complexity of life history theory and climate change impacts, and contains a predictor variable that is frequently categorized (Table 2) - reproductive timing in one year and its effect on body size in the following year. A reasonable research question would be: How does timing of reproduction in year t influence body mass at the start of the breeding season in year t+1? For illustrative purposes, let’s say we collected data from individually banded penguins in Antarctica. Based on the mechanistic relationships between seasonally available sea ice and food availability, we hypothesize ... |
format |
Dataset |
author |
Beltran, Roxanne Tarwater, Corey |
author_facet |
Beltran, Roxanne Tarwater, Corey |
author_sort |
Beltran, Roxanne |
title |
Overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ... |
title_short |
Overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ... |
title_full |
Overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ... |
title_fullStr |
Overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ... |
title_full_unstemmed |
Overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ... |
title_sort |
overcoming the pitfalls of categorizing continuous variables in ecology, evolution, and behavior ... |
publisher |
Dryad |
publishDate |
2023 |
url |
https://dx.doi.org/10.5061/dryad.5x69p8d9r https://datadryad.org/stash/dataset/doi:10.5061/dryad.5x69p8d9r |
genre |
Antarc* Antarctica Sea ice |
genre_facet |
Antarc* Antarctica Sea ice |
op_relation |
https://dx.doi.org/10.5281/zenodo.12690514 |
op_rights |
Creative Commons Zero v1.0 Universal https://creativecommons.org/publicdomain/zero/1.0/legalcode cc0-1.0 |
op_doi |
https://doi.org/10.5061/dryad.5x69p8d9r10.5281/zenodo.12690514 |
_version_ |
1810495998154244096 |