Infodemiological data of high-school drop-out related web searches in Canada correlating with real-world statistical data in the period 2004–2012

The present data article describes high-school drop-out related web activities in Canada, from 2004 to 2012, obtained mining Google Trends (GT), using high-school drop-out as key-word. The searches volumes were processed, correlated and cross-correlated with statistical data obtained at national and...

Full description

Bibliographic Details
Published in:Data in Brief
Main Authors: Anna Siri, Hicham Khabbache, Ali Al-Jafar, Mariano Martini, Francesco Brigo, Nicola Luigi Bragazzi
Format: Article in Journal/Newspaper
Language:English
Published: Elsevier 2016
Subjects:
Online Access:https://doi.org/10.1016/j.dib.2016.09.032
https://doaj.org/article/5d81fdab321c4a05aa02226400c191db
id ftdoajarticles:oai:doaj.org/article:5d81fdab321c4a05aa02226400c191db
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:5d81fdab321c4a05aa02226400c191db 2023-05-15T17:22:45+02:00 Infodemiological data of high-school drop-out related web searches in Canada correlating with real-world statistical data in the period 2004–2012 Anna Siri Hicham Khabbache Ali Al-Jafar Mariano Martini Francesco Brigo Nicola Luigi Bragazzi 2016-12-01T00:00:00Z https://doi.org/10.1016/j.dib.2016.09.032 https://doaj.org/article/5d81fdab321c4a05aa02226400c191db EN eng Elsevier http://www.sciencedirect.com/science/article/pii/S2352340916306047 https://doaj.org/toc/2352-3409 2352-3409 doi:10.1016/j.dib.2016.09.032 https://doaj.org/article/5d81fdab321c4a05aa02226400c191db Data in Brief, Vol 9, Iss C, Pp 679-684 (2016) High-school drop-out Google Trends Computer applications to medicine. Medical informatics R858-859.7 Science (General) Q1-390 article 2016 ftdoajarticles https://doi.org/10.1016/j.dib.2016.09.032 2022-12-31T12:48:05Z The present data article describes high-school drop-out related web activities in Canada, from 2004 to 2012, obtained mining Google Trends (GT), using high-school drop-out as key-word. The searches volumes were processed, correlated and cross-correlated with statistical data obtained at national and province level and broken down for gender. Further, an autoregressive moving-average (ARMA) model was used to model the GT-generated data. From a qualitative point of view, GT-generated relative search volumes (RSVs) reflect the decrease in drop-out rate. The peak in the Internet-related activities occurs in 2004 (56.35%, normalized value), and gradually declines to 40.59% (normalized value) in 2007. After, it remains substantially stable until 2012 (40.32%, normalized value). From a quantitative standpoint, the correlations between Canadian high-school drop-out rate and GT-generated RSVs in the study period (2004–2012) were statistically significant both using the drop-out rate for academic year and the 3-years moving average. Examining the data broken down by gender, the correlations were higher and statistically significant in males than in females. GT-based data for drop-out resulted best modeled by an ARMA(1,0) model. Considering the cross correlation of Canadian regions, all of them resulted statistically significant at lag 0, apart from for New Brunswick, Newfoundland and Labrador and the Prince Edward island. A number or cross-correlations resulted statistically significant also at lag −1 (namely, Alberta, Manitoba, New Brunswick and Saskatchewan). Article in Journal/Newspaper Newfoundland Prince Edward Island Directory of Open Access Journals: DOAJ Articles Newfoundland Canada Data in Brief 9 679 684
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic High-school drop-out
Google Trends
Computer applications to medicine. Medical informatics
R858-859.7
Science (General)
Q1-390
spellingShingle High-school drop-out
Google Trends
Computer applications to medicine. Medical informatics
R858-859.7
Science (General)
Q1-390
Anna Siri
Hicham Khabbache
Ali Al-Jafar
Mariano Martini
Francesco Brigo
Nicola Luigi Bragazzi
Infodemiological data of high-school drop-out related web searches in Canada correlating with real-world statistical data in the period 2004–2012
topic_facet High-school drop-out
Google Trends
Computer applications to medicine. Medical informatics
R858-859.7
Science (General)
Q1-390
description The present data article describes high-school drop-out related web activities in Canada, from 2004 to 2012, obtained mining Google Trends (GT), using high-school drop-out as key-word. The searches volumes were processed, correlated and cross-correlated with statistical data obtained at national and province level and broken down for gender. Further, an autoregressive moving-average (ARMA) model was used to model the GT-generated data. From a qualitative point of view, GT-generated relative search volumes (RSVs) reflect the decrease in drop-out rate. The peak in the Internet-related activities occurs in 2004 (56.35%, normalized value), and gradually declines to 40.59% (normalized value) in 2007. After, it remains substantially stable until 2012 (40.32%, normalized value). From a quantitative standpoint, the correlations between Canadian high-school drop-out rate and GT-generated RSVs in the study period (2004–2012) were statistically significant both using the drop-out rate for academic year and the 3-years moving average. Examining the data broken down by gender, the correlations were higher and statistically significant in males than in females. GT-based data for drop-out resulted best modeled by an ARMA(1,0) model. Considering the cross correlation of Canadian regions, all of them resulted statistically significant at lag 0, apart from for New Brunswick, Newfoundland and Labrador and the Prince Edward island. A number or cross-correlations resulted statistically significant also at lag −1 (namely, Alberta, Manitoba, New Brunswick and Saskatchewan).
format Article in Journal/Newspaper
author Anna Siri
Hicham Khabbache
Ali Al-Jafar
Mariano Martini
Francesco Brigo
Nicola Luigi Bragazzi
author_facet Anna Siri
Hicham Khabbache
Ali Al-Jafar
Mariano Martini
Francesco Brigo
Nicola Luigi Bragazzi
author_sort Anna Siri
title Infodemiological data of high-school drop-out related web searches in Canada correlating with real-world statistical data in the period 2004–2012
title_short Infodemiological data of high-school drop-out related web searches in Canada correlating with real-world statistical data in the period 2004–2012
title_full Infodemiological data of high-school drop-out related web searches in Canada correlating with real-world statistical data in the period 2004–2012
title_fullStr Infodemiological data of high-school drop-out related web searches in Canada correlating with real-world statistical data in the period 2004–2012
title_full_unstemmed Infodemiological data of high-school drop-out related web searches in Canada correlating with real-world statistical data in the period 2004–2012
title_sort infodemiological data of high-school drop-out related web searches in canada correlating with real-world statistical data in the period 2004–2012
publisher Elsevier
publishDate 2016
url https://doi.org/10.1016/j.dib.2016.09.032
https://doaj.org/article/5d81fdab321c4a05aa02226400c191db
geographic Newfoundland
Canada
geographic_facet Newfoundland
Canada
genre Newfoundland
Prince Edward Island
genre_facet Newfoundland
Prince Edward Island
op_source Data in Brief, Vol 9, Iss C, Pp 679-684 (2016)
op_relation http://www.sciencedirect.com/science/article/pii/S2352340916306047
https://doaj.org/toc/2352-3409
2352-3409
doi:10.1016/j.dib.2016.09.032
https://doaj.org/article/5d81fdab321c4a05aa02226400c191db
op_doi https://doi.org/10.1016/j.dib.2016.09.032
container_title Data in Brief
container_volume 9
container_start_page 679
op_container_end_page 684
_version_ 1766109578629480448