Data from: Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae)

Extant species of the genus Equus (e.g., horses, asses, and zebras) have a widespread distribution today on all continents except Antarctica. Extinct species of Equus represented by fossils were likewise widely distributed in the Pliocene and even more so during the Pleistocene. In order to understa...

Full description

Bibliographic Details
Main Authors: MacFadden, Bruce J., Guralnick, Robert P.
Format: Article in Journal/Newspaper
Language:unknown
Published: 2016
Subjects:
Online Access:http://hdl.handle.net/10255/dryad.126118
https://doi.org/10.5061/dryad.qc2fm
id ftdryad:oai:v1.datadryad.org:10255/dryad.126118
record_format openpolar
spelling ftdryad:oai:v1.datadryad.org:10255/dryad.126118 2023-05-15T13:31:24+02:00 Data from: Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae) MacFadden, Bruce J. Guralnick, Robert P. Global Pleistocene Pliocene Quaternary Recent 2016-09-14T16:30:09Z http://hdl.handle.net/10255/dryad.126118 https://doi.org/10.5061/dryad.qc2fm unknown doi:10.5061/dryad.qc2fm/1 doi:10.5061/dryad.qc2fm/2 doi:10.1017/pab.2016.42 doi:10.5061/dryad.qc2fm MacFadden BJ, Guralnick RP (2017) Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae). Paleobiology 43(01): 1-14. 0094-8373 http://hdl.handle.net/10255/dryad.126118 Big Data biodiversity iDigBio GBIF Paleobiology Database Article 2016 ftdryad https://doi.org/10.5061/dryad.qc2fm https://doi.org/10.5061/dryad.qc2fm/1 https://doi.org/10.5061/dryad.qc2fm/2 https://doi.org/10.1017/pab.2016.42 2020-01-01T15:40:24Z Extant species of the genus Equus (e.g., horses, asses, and zebras) have a widespread distribution today on all continents except Antarctica. Extinct species of Equus represented by fossils were likewise widely distributed in the Pliocene and even more so during the Pleistocene. In order to understand the efficacy of “big data” for (paleo)biogeographic analyses, location records (latitude, longitude) and fossil occurrences for the genus Equus were mined and further explored from six databases, including iDigBio, Paleobiology Database, VertNet, BISON, Neotoma, and GBIF. These were chosen from a priori knowledge of where relevant data might be aggregated. We also realized that these databases have different objectives and data sources and therefore would provide a useful comparative study of the widespread taxon Equus in space and time. The mining of Equus data from these six sources yielded a combined total of 123.8 K location records, including 116.2K fossil specimens. These include individual points that are unique, that is, only occurring in one of these databases, and those that are duplicated in multiple databases. Of the six databases, three (iDigBio, Paleobiology Database, and GBIF) were judged to be the most useful in the Equus use case. Most of the databases are biased toward North American records, thus limiting the reconstruction of the actual distribution of the genus Equus in space and time outside of this continent. Although Equus has a large number of digitally accessible records, fundamentally interesting questions pertaining to evolutionary dynamics and extinction geography are still a challenge for these kinds of biodiversity databases due primarily to the lack of sufficiently dense and precise temporal data. Article in Journal/Newspaper Antarc* Antarctica Dryad Digital Repository (Duke University)
institution Open Polar
collection Dryad Digital Repository (Duke University)
op_collection_id ftdryad
language unknown
topic Big Data
biodiversity
iDigBio
GBIF
Paleobiology Database
spellingShingle Big Data
biodiversity
iDigBio
GBIF
Paleobiology Database
MacFadden, Bruce J.
Guralnick, Robert P.
Data from: Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae)
topic_facet Big Data
biodiversity
iDigBio
GBIF
Paleobiology Database
description Extant species of the genus Equus (e.g., horses, asses, and zebras) have a widespread distribution today on all continents except Antarctica. Extinct species of Equus represented by fossils were likewise widely distributed in the Pliocene and even more so during the Pleistocene. In order to understand the efficacy of “big data” for (paleo)biogeographic analyses, location records (latitude, longitude) and fossil occurrences for the genus Equus were mined and further explored from six databases, including iDigBio, Paleobiology Database, VertNet, BISON, Neotoma, and GBIF. These were chosen from a priori knowledge of where relevant data might be aggregated. We also realized that these databases have different objectives and data sources and therefore would provide a useful comparative study of the widespread taxon Equus in space and time. The mining of Equus data from these six sources yielded a combined total of 123.8 K location records, including 116.2K fossil specimens. These include individual points that are unique, that is, only occurring in one of these databases, and those that are duplicated in multiple databases. Of the six databases, three (iDigBio, Paleobiology Database, and GBIF) were judged to be the most useful in the Equus use case. Most of the databases are biased toward North American records, thus limiting the reconstruction of the actual distribution of the genus Equus in space and time outside of this continent. Although Equus has a large number of digitally accessible records, fundamentally interesting questions pertaining to evolutionary dynamics and extinction geography are still a challenge for these kinds of biodiversity databases due primarily to the lack of sufficiently dense and precise temporal data.
format Article in Journal/Newspaper
author MacFadden, Bruce J.
Guralnick, Robert P.
author_facet MacFadden, Bruce J.
Guralnick, Robert P.
author_sort MacFadden, Bruce J.
title Data from: Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae)
title_short Data from: Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae)
title_full Data from: Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae)
title_fullStr Data from: Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae)
title_full_unstemmed Data from: Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae)
title_sort data from: horses in the cloud: big data exploration and mining of fossil and extant equus (mammalia: equidae)
publishDate 2016
url http://hdl.handle.net/10255/dryad.126118
https://doi.org/10.5061/dryad.qc2fm
op_coverage Global
Pleistocene
Pliocene
Quaternary
Recent
genre Antarc*
Antarctica
genre_facet Antarc*
Antarctica
op_relation doi:10.5061/dryad.qc2fm/1
doi:10.5061/dryad.qc2fm/2
doi:10.1017/pab.2016.42
doi:10.5061/dryad.qc2fm
MacFadden BJ, Guralnick RP (2017) Horses in the Cloud: big data exploration and mining of fossil and extant Equus (Mammalia: Equidae). Paleobiology 43(01): 1-14.
0094-8373
http://hdl.handle.net/10255/dryad.126118
op_doi https://doi.org/10.5061/dryad.qc2fm
https://doi.org/10.5061/dryad.qc2fm/1
https://doi.org/10.5061/dryad.qc2fm/2
https://doi.org/10.1017/pab.2016.42
_version_ 1766017874340610048