Improving the Estimation of the Atmospheric Water Vapor Pressure Using Interpretable Long Short-Term Memory Networks: Dataset, Python code, and trained models

Atmospheric water vapor pressure is an essential meteorological control on land surface and hydrologic processes. It is not as frequently observed as other meteorologic conditions, but often inferred through the August–Roche–Magnus formula by simply assuming dew point and daily minimum temperatures...

Full description

Bibliographic Details
Main Authors: Bo Gao, Ethan Coon, Peter Thornton, Dan Lu
Format: Dataset
Language:unknown
Published: ESS-DIVE: Deep Insight for Earth Science Data 2023
Subjects:
Online Access:https://search.dataone.org/view/ess-dive-993a2817d456629-20231122T152659004
id dataone:ess-dive-993a2817d456629-20231122T152659004
record_format openpolar
institution Open Polar
collection ESS-DIVE: Deep Insight for Earth Science Data (via DataONE)
op_collection_id dataone:urn:node:ESS_DIVE
language unknown
topic Atmospheric humidity
Atmospheric water vapor pressure
Long Short-term Memory
Static attributes
Interpretable deep learning
spellingShingle Atmospheric humidity
Atmospheric water vapor pressure
Long Short-term Memory
Static attributes
Interpretable deep learning
Bo Gao
Ethan Coon
Peter Thornton
Dan Lu
Improving the Estimation of the Atmospheric Water Vapor Pressure Using Interpretable Long Short-Term Memory Networks: Dataset, Python code, and trained models
topic_facet Atmospheric humidity
Atmospheric water vapor pressure
Long Short-term Memory
Static attributes
Interpretable deep learning
description Atmospheric water vapor pressure is an essential meteorological control on land surface and hydrologic processes. It is not as frequently observed as other meteorologic conditions, but often inferred through the August–Roche–Magnus formula by simply assuming dew point and daily minimum temperatures are equivalent or by empirically correlating the two temperatures using an aridity correction. The performance of both methods varies considerably across different regions and during different time periods; obtaining consistently accurate estimates across space and time remains a great challenge. We applied an interpretable Long Short-Term Memory (iLSTM) network conditioned on static, location specific attributes to estimate daily vapor pressure for 83 FLUXNET sites in the United States and Canada. This data package includes all raw data of the 83 FLUXNET sites, input data for model training/validation/test, trained models and results, and python codes for the manuscript "Improving the Estimation of the Atmospheric Water Vapor Pressure Using an Interpretable Long Short-term Memory Network". Specifically, it consists of five parts. - First, "1_Daymet_data_83sites.zip" includes raw data downloaded from Daymet for the 83 sites used in the paper according to their longitude and latitude, in which vapor pressure is used. It also includes a pre-processed CSV data file combining all data from the 83 sites which is specifically used for the paper. - Second, "2_Fluxnet2015_data_83sites.zip" includes raw half hourly data of the 83 sites downloaded from FLUXNET2015 data portal, pre-processed daily data of the 83 sites, a CSV file including combined pre-processed daily data of the 83 sites, and a CSV file including the information (site ID, site name, latitude, longitude, data available period) of the 83 sites. - Third, "3_MODIS_LAI_data_83sites_raw.zip" includes raw leaf area index (LAI) data downloaded from the AppEEARs data portal. - Fourth, "4_Scripts.zip" includes all scripts related to model training and post-processing of a trained model, and a jupyter notebook showing an example for model post-processing. - Finally, "Trained_models_and_results.zip" includes three folders and three files with suffix ".npy", and each folder corresponds to one file with suffix ".npy" with the same title. Each of the three folders include all trained models associated with one iLSTM model configuration (35 models for each configuration, details are described in the paper). Each file with suffix ".npy" includes the post-processed results of the corresponding 35 models under one iLSTM model configuration.
format Dataset
author Bo Gao
Ethan Coon
Peter Thornton
Dan Lu
author_facet Bo Gao
Ethan Coon
Peter Thornton
Dan Lu
author_sort Bo Gao
title Improving the Estimation of the Atmospheric Water Vapor Pressure Using Interpretable Long Short-Term Memory Networks: Dataset, Python code, and trained models
title_short Improving the Estimation of the Atmospheric Water Vapor Pressure Using Interpretable Long Short-Term Memory Networks: Dataset, Python code, and trained models
title_full Improving the Estimation of the Atmospheric Water Vapor Pressure Using Interpretable Long Short-Term Memory Networks: Dataset, Python code, and trained models
title_fullStr Improving the Estimation of the Atmospheric Water Vapor Pressure Using Interpretable Long Short-Term Memory Networks: Dataset, Python code, and trained models
title_full_unstemmed Improving the Estimation of the Atmospheric Water Vapor Pressure Using Interpretable Long Short-Term Memory Networks: Dataset, Python code, and trained models
title_sort improving the estimation of the atmospheric water vapor pressure using interpretable long short-term memory networks: dataset, python code, and trained models
publisher ESS-DIVE: Deep Insight for Earth Science Data
publishDate 2023
url https://search.dataone.org/view/ess-dive-993a2817d456629-20231122T152659004
op_coverage CA-Gro, Ontario - Groundhog River, Boreal Mixedwood Forest, Groundhog River is situated in a typical boreal mixedwood forest in northeastern Ontario (https://doi.org/10.18140/FLX/1440034).
CA-NS1, UCI-1850 burn site, The UCI-1850 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440036).
CA-NS2, UCI-1930 burn site, The UCI-1930 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440037).
CA-NS3, UCI-1964 burn site, The UCI-1964 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440038).
CA-NS4, UCI-1964 burn site wet, The UCI-1964 wet site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440039).
CA-NS5, UCI-1981 burn site, The UCI-1981 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440040).
CA-NS6, UCI-1989 burn site, The UCI-1989 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440041).
CA-NS7, UCI-1998 burn site, The UCI-1998 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440042).
CA-Oas, Saskatchewan - Western Boreal, Mature Aspen, BOREAS 1994, 1996, BERMS climate and flux measurements began Dec. 1996 (https://doi.org/10.18140/FLX/1440043).
CA-Obs, Saskatchewan - Western Boreal, Mature Black Spruce, BOREAS 1994, 1996, BERMS climate measurements began Dec. 1996 and flux measurements in Apr. 1999 (https://doi.org/10.18140/FLX/1440044).
CA-Qfo, Quebec - Eastern Boreal, Mature Black Spruce, 90 - 100 yr old Black Spruce, Jack Pine, feather moss (https://doi.org/10.18140/FLX/1440045).
CA-SF1, Saskatchewan - Western Boreal, forest burned in 1977, Regenerated jack pine (Pinus banksiana) following fire in 1977 (https://doi.org/10.18140/FLX/1440046).
CA-SF2, Saskatchewan - Western Boreal, forest burned in 1989, The 1989 burn site (F89) was northeast of Prince Albert National Park, Saskatchewan (https://doi.org/10.18140/FLX/1440047).
CA-SF3, Saskatchewan - Western Boreal, forest burned in 1998, The 1998 burn site (F98) was in the east part of Prince Albert National Park, Saskatchewan (https://doi.org/10.18140/FLX/1440048).
CA-TP1, Ontario - Turkey Point 2002 Plantation White Pine, Plantation established in 2002 on a former sandy agricultural field, which was abandoned three years prior to planting (https://doi.org/10.18140/FLX/1440050).
CA-TP2, Ontario - Turkey Point 1989 Plantation White Pine, Plantation established in 1989 over sandy agriculture land (https://doi.org/10.18140/FLX/1440051).
CA-TP3, Ontario - Turkey Point 1974 Plantation White Pine, White pine plantation established in 1974 over sandy abandoned land (https://doi.org/10.18140/FLX/1440052).
CA-TP4, Ontario - Turkey Point 1939 Plantation White Pine, White pine plantation established in 1939 over sandy abandoned land (https://doi.org/10.18140/FLX/1440053).
CA-TPD, Ontario - Turkey Point Mature Deciduous, The forest is approximately 90 years old. Naturally regenerated on sandy terrain and abandoned agricultural land (https://doi.org/10.18140/FLX/1440112).
US-AR1, ARM USDA UNL OSU Woodward Switchgrass 1, The ARM USDA UNL OSU Woodward Switchgrass 1 tower is located on public land owned by the USDA-ARS Southern Plains Range Research Station in Woodward, Oklahoma (https://doi.org/10.18140/FLX/1440103).
US-AR2, ARM USDA UNL OSU Woodward Switchgrass 2, The ARM USDA UNL OSU Woodward Switchgrass 2 tower is located on public land owned by the USDA-ARS Southern Plains Range Research Station in Woodward, Oklahoma (https://doi.org/10.18140/FLX/1440104).
US-ARM, ARM Southern Great Plains site- Lamont, The ARM SGP Burn site is located in the native tallgrass prairies of the USDA Grazinglands Research Laboratory near El Reno, OK (https://doi.org/10.18140/FLX/1440064).
US-ARb, ARM Southern Great Plains burn site- Lamont, The ARM SGP Control site is located in the native tallgrass prairies of the USDA Grazinglands Research Laboratory near El Reno, OK (https://doi.org/10.18140/FLX/1440065).
US-ARc, ARM Southern Great Plains control site- Lamont, Central facility tower crop field (winter wheat, corn, soy, alfalfa) (https://doi.org/10.18140/FLX/1440066).
US-Atq, Atqasuk, This site is 100 km south of Barrow, Alaska, Variety of moist-wet coastal sedge tundra, and moist-tussock tundra surfaces in the more well-drained upland (https://doi.org/10.18140/FLX/1440067).
US-Blo, Blodgett Forest, The flux tower site at Blodgett Forest is on a 1200 ha parcel of land owned by Sierra Pacific Industries in the Sierra Nevada range near Georgetown, California (https://doi.org/10.18140/FLX/1440068).
US-CRT, Curtice Walter-Berger cropland, The conventional cropland site is rain fed and no irrigation is applied. It is located in a part of the historic Great Black Swamp (https://doi.org/10.18140/FLX/1440117).
US-Cop, Corral Pocket, The Corral Pocket site is located in a semi-arid grassland in southeastern Utah, just east of Canyonlands National park (https://doi.org/10.18140/FLX/1440100).
US-GBT, GLEES Brooklyn Tower, US-GBT has been incorporated into the US-GLE dataset, making this site obsolete. The Glacier Lakes Ecosystem Experiments Site (GLEES) site is located on land owned by the U.S. government and managed by US Forest Service as part of the Medicine Bow National Forest (https://doi.org/10.18140/FLX/1440118).
US-GLE, GLEES, The Glacier Lakes Ecosystem Experiments Site (GLEES) site is located on land owned by the U.S. government and managed by US Forest Service as part of the Medicine Bow National Forest (https://doi.org/10.18140/FLX/1440069).
US-Goo, Goodwin Creek, The Goodwin Creek site is located in the Bluff Hills, just east of the Mississippi River valley (https://doi.org/10.18140/FLX/1440070).
US-Ha1, Harvard Forest EMS Tower (HFR1), The Harvard Forest tower is on land owned by Harvard University. The site is designated as an LTER site (https://doi.org/10.18140/FLX/1440071).
US-IB2, Fermi National Accelerator Laboratory- Batavia (Prairie site), Two eddy correlation systems are installed at Fermi National Accelerator Laboratory: one on a restored prairie (established October 2004) and one on a corn/soybean rotation agricultural field (established in July 2005) (https://doi.org/10.18140/FLX/1440072).
US-Ivo, Ivotuk, This site is 300 km south of Barrow and is located at the foothill of the Brooks Range and is classified as tussock sedge, dwarf-shrub, moss tundra (https://doi.org/10.18140/FLX/1440073).
US-KS1, Kennedy Space Center (slash pine), The Kennedy Space Center Slash Pine Flatwoods site is located in the Merritt Island National Wildlife Refuge at the Kennedy Space Center (KSC) on the east coast of central Florida (https://doi.org/10.18140/FLX/1440074).
US-KS2, Kennedy Space Center (scrub oak), The Kennedy Space Center Scrub Oak site is located within the Merritt Island National Wildlife Refuge at the Kennedy Space Center (KSC) on the east coast of central Florida (https://doi.org/10.18140/FLX/1440075).
US-LWW, Little Washita Watershed, Grasslands, rangeland, Temperate, continental. Severe drought in 1998 (https://doi.org/10.18140/FLX/1440077).
US-Lin, Lindcove Orange Orchard, The experimental site was a citrus orchard about three km west of the UC Lindcove Research and Experiment Station (https://doi.org/10.18140/FLX/1440107).
US-Los, Lost Creek, Shrub wetland site, chosen to be representative of the wetlands within the WLEF tall tower flux footprint (https://doi.org/10.18140/FLX/1440076).
US-MMS, Morgan Monroe State Forest, Owned by the Indiana Department of Natural Resources (IDNR), including data from 1999 to 2014 (https://doi.org/10.18140/FLX/1440083).
US-Me1, Metolius - Eyerly burn, An intermediate aged ponderosa pine forest that was severely burned in the 2002 Eyerly wildfire (https://doi.org/10.18140/FLX/1440078).
US-Me2, Metolius mature ponderosa pine (https://doi.org/10.18140/FLX/1440079).
US-Me3, Metolius-second young aged pine, This site is located on a Forest Service mensuration plot (various seed sources) that was planted in 1987 (https://doi.org/10.18140/FLX/1440080).
US-Me4, Metolius-old aged ponderosa pine, The site is located on land designated as a Research Natural Area (RNA) (https://doi.org/10.18140/FLX/1440081).
US-Me5, Metolius-first young aged pine, Previously old-growth ponderosa pine, clearcut in 1978 and allowed to regenerate naturally (https://doi.org/10.18140/FLX/1440082).
US-Me6, Metolius Young Pine Burn, The study site is located east of the Cascade mountains, near Sisters, Central Oregon and is part of the Metolius cluster sites (https://doi.org/10.18140/FLX/1440099).
US-Myb, Mayberry Wetland, The Mayberry Wetland site is a 300-acre restored wetland on Sherman Island, north of Mayberry Slough, that is on the property of Mayberry Farms (https://doi.org/10.18140/FLX/1440105).
US-NR1, Niwot Ridge Forest (LTER NWT1), The Niwot Ridge AmeriFlux site is located in a subalpine forest ecosystem just below the Continental Divide near Nederland, CO. The site is located at 3050 m elevation, within 600m of the NOAA C1 long-term monitoring station, approximately 8 km east of the Continental Divide (https://doi.org/10.18140/FLX/1440087).
US-Ne1, Mead - irrigated continuous maize site, The study site is one of three fields (all located within 1.6 km of each other) at the University of Nebraska Agricultural Research and Development Center near Mead, Nebraska (https://doi.org/10.18140/FLX/1440084).
US-Ne2, Mead - irrigated maize-soybean rotation site, The study site is one of three fields (all located within 1.6 km of each other) at the University of Nebraska Agricultural Research and Development Center near Mead, Nebraska (https://doi.org/10.18140/FLX/1440085).
US-Ne3, Mead - rainfed maize-soybean rotation site, The study site is one of three fields (all located within 1.6 km of each other) at the University of Nebraska Agricultural Research and Development Center near Mead, Nebraska (https://doi.org/10.18140/FLX/1440086).
US-ORv, Olentangy River Wetland Research Park, The ORWRP site is a 21-ha large-scale, long-term wetland campus facility owned by Ohio State University (https://doi.org/10.18140/FLX/1440102).
US-Oho, Oak Openings, The Ohio Oak Openings site is located within the Oak Openings Preserve Metropark of northwest Ohio (https://doi.org/10.18140/FLX/1440088).
US-PFa, Park Falls/WLEF, The flux footprint encompasses a highly heterogeneous landscape of upland forests and wetlands (forested and nonforested) (https://doi.org/10.18140/FLX/1440089).
US-Prr, Poker Flat Research Range Black Spruce Forest, This site is located in a blackspruce forest within the property of the Poker Flat Research Range, University of Alaska, Fairbanks (https://doi.org/10.18140/FLX/1440113).
US-SRC, Santa Rita Creosote, Part of the Santa Rita Experimental Range since 1901; Site vegetation has been dominated by Creosote bush since at least 1934 (https://doi.org/10.18140/FLX/1440098).
US-SRG, Santa Rita Grassland, Semidesert C4 grassland, lies in Pasture 1 on the Santa Rita Experimental Range (https://doi.org/10.18140/FLX/1440114).
US-SRM, Santa Rita Mesquite, Semidesert grassland encroached by mesquite (Prosopis velutina) trees (https://doi.org/10.18140/FLX/1440090).
US-Sta, Saratoga, Sagebrush steppe ecosystem (https://doi.org/10.18140/FLX/1440115).
US-Syv, Sylvania Wilderness Area, Old growth forest consisting primarily of sugar maple and eastern hemlock (https://doi.org/10.18140/FLX/1440091).
US-Ton, Tonzi Ranch, Located in the lower foothills of the Sierra Nevada Mountains (https://doi.org/10.18140/FLX/1440092).
US-Tw1, Twitchell Wetland West Pond, The Twitchell Wetland site is a 7.4-acre restored wetland on Twitchell Island (https://doi.org/10.18140/FLX/1440108).
US-Tw2, Twitchell Corn, The Twitchell Corn site is a corn field on peat soil (https://doi.org/10.18140/FLX/1440109).
US-Tw3, Twitchell Alfalfa, The Twitchell Alfalfa site is an alfalfa field owned by the state of California and leased to third parties for farming (https://doi.org/10.18140/FLX/1440110).
US-Tw4, Twitchell East End Wetland, The Twitchell East End Wetland is a newly constructed restored wetland on Twitchell Island, CA (https://doi.org/10.18140/FLX/1440111).
US-Twt, Twitchell Island, The Twitchell Island site is a rice paddy that is owned by the state and managed by the California Department of Water Resources (https://doi.org/10.18140/FLX/1440106).
US-UMB, Univ. of Mich. Biological Station, The UMBS site is located within a protected forest owned by the University of Michigan (https://doi.org/10.18140/FLX/1440093).
US-UMd, UMBS Disturbance, The UMBS Disturbance site is an artificial disturbance site that has recently been created as part of the Forest Accelerate Succession ExperimenT (FASET) (https://doi.org/10.18140/FLX/1440101).
US-Var, Vaira Ranch- Ione, Located in the lower foothills of the Sierra Nevada Mountains on privately owned land (https://doi.org/10.18140/FLX/1440094).
US-WCr, Willow Creek, Upland decduous broadleaf forest. Mainly sugar maple, also basswood (https://doi.org/10.18140/FLX/1440095).
US-WPT, Winous Point North Marsh, The marsh site has been owned by the Winous Point Shooting Club since 1856 and has been managed by wildlife biologists since 1946 (https://doi.org/10.18140/FLX/1440116).
US-Whs, Walnut Gulch Lucky Hills Shrub, A semiarid Chihuahuan Desert shrubland located in the USDA-ARS Walnut Gulch Experimental Watershed (WGEW) surrounding the town of Tombstone, AZ (https://doi.org/10.18140/FLX/1440097).
US-Wi0, Young red pine (YRP), The Wisconsin Young Red Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440055).
US-Wi1, Intermediate hardwood (IHW), The Wisconsin Intermediate Hardwoods site is located in the Washburn Ranger District of the Chequamegon National Forest (https://doi.org/10.18140/FLX/1440054).
US-Wi2, Intermediate red pine (IRP), The Wisconsin Intermediate Red Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440056).
US-Wi3, Mature hardwood (MHW), The Wisconsin Mature Hardwood site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440057).
US-Wi4, Mature red pine (MRP), The Wisconsin Mature Red Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440058).
US-Wi5, Mixed young jack pine (MYJP), The Wisconsin Mixed Young Jack Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440059).
US-Wi6, Pine barrens #1 (PB1), The Wisconsin Pine Barrens site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440060).
US-Wi7, Red pine clearcut (RPCC), The Wisconsin Clearcut Red Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440061).
US-Wi8, Young hardwood clearcut (YHW), The Wisconsin Clearcut Young Hardwood site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440062).
US-Wi9, Young Jack pine (YJP), The Wisconsin Young Jack Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440063).
US-Wkg, Walnut Gulch Kendall Grasslands, This site is located in a small, intensively-studied, experimental watershed within USDA-ARS's Walnut Gulch Experimental Watershed (https://doi.org/10.18140/FLX/1440096).
ENVELOPE(-82.1556,-82.1556,48.2167,48.2167)
BEGINDATE: 1991-01-01T00:00:00Z ENDDATE: 2014-12-31T00:00:00Z
long_lat ENVELOPE(-59.828,-59.828,-63.497,-63.497)
ENVELOPE(-3.933,-3.933,-71.300,-71.300)
ENVELOPE(-62.083,-62.083,-64.650,-64.650)
ENVELOPE(-61.483,-61.483,-63.997,-63.997)
ENVELOPE(-46.050,-46.050,-60.600,-60.600)
ENVELOPE(-62.950,-62.950,-64.900,-64.900)
ENVELOPE(-145.500,-145.500,-77.283,-77.283)
ENVELOPE(-62.833,-62.833,-65.100,-65.100)
ENVELOPE(-86.133,-86.133,-77.617,-77.617)
ENVELOPE(-63.533,-63.533,-64.817,-64.817)
ENVELOPE(-61.567,-61.567,-64.367,-64.367)
ENVELOPE(-100.000,-100.000,-73.050,-73.050)
ENVELOPE(-117.003,-117.003,56.000,56.000)
ENVELOPE(-102.734,-102.734,56.217,56.217)
ENVELOPE(-82.1556,-82.1556,48.2167,48.2167)
geographic Fairbanks
Canada
Pacific
Kendall
Boreas
Brooklyn
Gulch
Monroe
Corral
Woodward
Goodwin
Washburn
Tombstone
The Bluff
Sherman Island
Reno
Merritt Island
geographic_facet Fairbanks
Canada
Pacific
Kendall
Boreas
Brooklyn
Gulch
Monroe
Corral
Woodward
Goodwin
Washburn
Tombstone
The Bluff
Sherman Island
Reno
Merritt Island
genre Barrow
Brooks Range
glacier
glacier*
Sherman Island
Tundra
Alaska
genre_facet Barrow
Brooks Range
glacier
glacier*
Sherman Island
Tundra
Alaska
_version_ 1800870546930401280
spelling dataone:ess-dive-993a2817d456629-20231122T152659004 2024-06-03T18:46:45+00:00 Improving the Estimation of the Atmospheric Water Vapor Pressure Using Interpretable Long Short-Term Memory Networks: Dataset, Python code, and trained models Bo Gao Ethan Coon Peter Thornton Dan Lu CA-Gro, Ontario - Groundhog River, Boreal Mixedwood Forest, Groundhog River is situated in a typical boreal mixedwood forest in northeastern Ontario (https://doi.org/10.18140/FLX/1440034). CA-NS1, UCI-1850 burn site, The UCI-1850 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440036). CA-NS2, UCI-1930 burn site, The UCI-1930 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440037). CA-NS3, UCI-1964 burn site, The UCI-1964 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440038). CA-NS4, UCI-1964 burn site wet, The UCI-1964 wet site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440039). CA-NS5, UCI-1981 burn site, The UCI-1981 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440040). CA-NS6, UCI-1989 burn site, The UCI-1989 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440041). CA-NS7, UCI-1998 burn site, The UCI-1998 site is located in a continental boreal forest, dominated by black spruce trees, within the BOREAS northern study area in central Manitoba (https://doi.org/10.18140/FLX/1440042). CA-Oas, Saskatchewan - Western Boreal, Mature Aspen, BOREAS 1994, 1996, BERMS climate and flux measurements began Dec. 1996 (https://doi.org/10.18140/FLX/1440043). CA-Obs, Saskatchewan - Western Boreal, Mature Black Spruce, BOREAS 1994, 1996, BERMS climate measurements began Dec. 1996 and flux measurements in Apr. 1999 (https://doi.org/10.18140/FLX/1440044). CA-Qfo, Quebec - Eastern Boreal, Mature Black Spruce, 90 - 100 yr old Black Spruce, Jack Pine, feather moss (https://doi.org/10.18140/FLX/1440045). CA-SF1, Saskatchewan - Western Boreal, forest burned in 1977, Regenerated jack pine (Pinus banksiana) following fire in 1977 (https://doi.org/10.18140/FLX/1440046). CA-SF2, Saskatchewan - Western Boreal, forest burned in 1989, The 1989 burn site (F89) was northeast of Prince Albert National Park, Saskatchewan (https://doi.org/10.18140/FLX/1440047). CA-SF3, Saskatchewan - Western Boreal, forest burned in 1998, The 1998 burn site (F98) was in the east part of Prince Albert National Park, Saskatchewan (https://doi.org/10.18140/FLX/1440048). CA-TP1, Ontario - Turkey Point 2002 Plantation White Pine, Plantation established in 2002 on a former sandy agricultural field, which was abandoned three years prior to planting (https://doi.org/10.18140/FLX/1440050). CA-TP2, Ontario - Turkey Point 1989 Plantation White Pine, Plantation established in 1989 over sandy agriculture land (https://doi.org/10.18140/FLX/1440051). CA-TP3, Ontario - Turkey Point 1974 Plantation White Pine, White pine plantation established in 1974 over sandy abandoned land (https://doi.org/10.18140/FLX/1440052). CA-TP4, Ontario - Turkey Point 1939 Plantation White Pine, White pine plantation established in 1939 over sandy abandoned land (https://doi.org/10.18140/FLX/1440053). CA-TPD, Ontario - Turkey Point Mature Deciduous, The forest is approximately 90 years old. Naturally regenerated on sandy terrain and abandoned agricultural land (https://doi.org/10.18140/FLX/1440112). US-AR1, ARM USDA UNL OSU Woodward Switchgrass 1, The ARM USDA UNL OSU Woodward Switchgrass 1 tower is located on public land owned by the USDA-ARS Southern Plains Range Research Station in Woodward, Oklahoma (https://doi.org/10.18140/FLX/1440103). US-AR2, ARM USDA UNL OSU Woodward Switchgrass 2, The ARM USDA UNL OSU Woodward Switchgrass 2 tower is located on public land owned by the USDA-ARS Southern Plains Range Research Station in Woodward, Oklahoma (https://doi.org/10.18140/FLX/1440104). US-ARM, ARM Southern Great Plains site- Lamont, The ARM SGP Burn site is located in the native tallgrass prairies of the USDA Grazinglands Research Laboratory near El Reno, OK (https://doi.org/10.18140/FLX/1440064). US-ARb, ARM Southern Great Plains burn site- Lamont, The ARM SGP Control site is located in the native tallgrass prairies of the USDA Grazinglands Research Laboratory near El Reno, OK (https://doi.org/10.18140/FLX/1440065). US-ARc, ARM Southern Great Plains control site- Lamont, Central facility tower crop field (winter wheat, corn, soy, alfalfa) (https://doi.org/10.18140/FLX/1440066). US-Atq, Atqasuk, This site is 100 km south of Barrow, Alaska, Variety of moist-wet coastal sedge tundra, and moist-tussock tundra surfaces in the more well-drained upland (https://doi.org/10.18140/FLX/1440067). US-Blo, Blodgett Forest, The flux tower site at Blodgett Forest is on a 1200 ha parcel of land owned by Sierra Pacific Industries in the Sierra Nevada range near Georgetown, California (https://doi.org/10.18140/FLX/1440068). US-CRT, Curtice Walter-Berger cropland, The conventional cropland site is rain fed and no irrigation is applied. It is located in a part of the historic Great Black Swamp (https://doi.org/10.18140/FLX/1440117). US-Cop, Corral Pocket, The Corral Pocket site is located in a semi-arid grassland in southeastern Utah, just east of Canyonlands National park (https://doi.org/10.18140/FLX/1440100). US-GBT, GLEES Brooklyn Tower, US-GBT has been incorporated into the US-GLE dataset, making this site obsolete. The Glacier Lakes Ecosystem Experiments Site (GLEES) site is located on land owned by the U.S. government and managed by US Forest Service as part of the Medicine Bow National Forest (https://doi.org/10.18140/FLX/1440118). US-GLE, GLEES, The Glacier Lakes Ecosystem Experiments Site (GLEES) site is located on land owned by the U.S. government and managed by US Forest Service as part of the Medicine Bow National Forest (https://doi.org/10.18140/FLX/1440069). US-Goo, Goodwin Creek, The Goodwin Creek site is located in the Bluff Hills, just east of the Mississippi River valley (https://doi.org/10.18140/FLX/1440070). US-Ha1, Harvard Forest EMS Tower (HFR1), The Harvard Forest tower is on land owned by Harvard University. The site is designated as an LTER site (https://doi.org/10.18140/FLX/1440071). US-IB2, Fermi National Accelerator Laboratory- Batavia (Prairie site), Two eddy correlation systems are installed at Fermi National Accelerator Laboratory: one on a restored prairie (established October 2004) and one on a corn/soybean rotation agricultural field (established in July 2005) (https://doi.org/10.18140/FLX/1440072). US-Ivo, Ivotuk, This site is 300 km south of Barrow and is located at the foothill of the Brooks Range and is classified as tussock sedge, dwarf-shrub, moss tundra (https://doi.org/10.18140/FLX/1440073). US-KS1, Kennedy Space Center (slash pine), The Kennedy Space Center Slash Pine Flatwoods site is located in the Merritt Island National Wildlife Refuge at the Kennedy Space Center (KSC) on the east coast of central Florida (https://doi.org/10.18140/FLX/1440074). US-KS2, Kennedy Space Center (scrub oak), The Kennedy Space Center Scrub Oak site is located within the Merritt Island National Wildlife Refuge at the Kennedy Space Center (KSC) on the east coast of central Florida (https://doi.org/10.18140/FLX/1440075). US-LWW, Little Washita Watershed, Grasslands, rangeland, Temperate, continental. Severe drought in 1998 (https://doi.org/10.18140/FLX/1440077). US-Lin, Lindcove Orange Orchard, The experimental site was a citrus orchard about three km west of the UC Lindcove Research and Experiment Station (https://doi.org/10.18140/FLX/1440107). US-Los, Lost Creek, Shrub wetland site, chosen to be representative of the wetlands within the WLEF tall tower flux footprint (https://doi.org/10.18140/FLX/1440076). US-MMS, Morgan Monroe State Forest, Owned by the Indiana Department of Natural Resources (IDNR), including data from 1999 to 2014 (https://doi.org/10.18140/FLX/1440083). US-Me1, Metolius - Eyerly burn, An intermediate aged ponderosa pine forest that was severely burned in the 2002 Eyerly wildfire (https://doi.org/10.18140/FLX/1440078). US-Me2, Metolius mature ponderosa pine (https://doi.org/10.18140/FLX/1440079). US-Me3, Metolius-second young aged pine, This site is located on a Forest Service mensuration plot (various seed sources) that was planted in 1987 (https://doi.org/10.18140/FLX/1440080). US-Me4, Metolius-old aged ponderosa pine, The site is located on land designated as a Research Natural Area (RNA) (https://doi.org/10.18140/FLX/1440081). US-Me5, Metolius-first young aged pine, Previously old-growth ponderosa pine, clearcut in 1978 and allowed to regenerate naturally (https://doi.org/10.18140/FLX/1440082). US-Me6, Metolius Young Pine Burn, The study site is located east of the Cascade mountains, near Sisters, Central Oregon and is part of the Metolius cluster sites (https://doi.org/10.18140/FLX/1440099). US-Myb, Mayberry Wetland, The Mayberry Wetland site is a 300-acre restored wetland on Sherman Island, north of Mayberry Slough, that is on the property of Mayberry Farms (https://doi.org/10.18140/FLX/1440105). US-NR1, Niwot Ridge Forest (LTER NWT1), The Niwot Ridge AmeriFlux site is located in a subalpine forest ecosystem just below the Continental Divide near Nederland, CO. The site is located at 3050 m elevation, within 600m of the NOAA C1 long-term monitoring station, approximately 8 km east of the Continental Divide (https://doi.org/10.18140/FLX/1440087). US-Ne1, Mead - irrigated continuous maize site, The study site is one of three fields (all located within 1.6 km of each other) at the University of Nebraska Agricultural Research and Development Center near Mead, Nebraska (https://doi.org/10.18140/FLX/1440084). US-Ne2, Mead - irrigated maize-soybean rotation site, The study site is one of three fields (all located within 1.6 km of each other) at the University of Nebraska Agricultural Research and Development Center near Mead, Nebraska (https://doi.org/10.18140/FLX/1440085). US-Ne3, Mead - rainfed maize-soybean rotation site, The study site is one of three fields (all located within 1.6 km of each other) at the University of Nebraska Agricultural Research and Development Center near Mead, Nebraska (https://doi.org/10.18140/FLX/1440086). US-ORv, Olentangy River Wetland Research Park, The ORWRP site is a 21-ha large-scale, long-term wetland campus facility owned by Ohio State University (https://doi.org/10.18140/FLX/1440102). US-Oho, Oak Openings, The Ohio Oak Openings site is located within the Oak Openings Preserve Metropark of northwest Ohio (https://doi.org/10.18140/FLX/1440088). US-PFa, Park Falls/WLEF, The flux footprint encompasses a highly heterogeneous landscape of upland forests and wetlands (forested and nonforested) (https://doi.org/10.18140/FLX/1440089). US-Prr, Poker Flat Research Range Black Spruce Forest, This site is located in a blackspruce forest within the property of the Poker Flat Research Range, University of Alaska, Fairbanks (https://doi.org/10.18140/FLX/1440113). US-SRC, Santa Rita Creosote, Part of the Santa Rita Experimental Range since 1901; Site vegetation has been dominated by Creosote bush since at least 1934 (https://doi.org/10.18140/FLX/1440098). US-SRG, Santa Rita Grassland, Semidesert C4 grassland, lies in Pasture 1 on the Santa Rita Experimental Range (https://doi.org/10.18140/FLX/1440114). US-SRM, Santa Rita Mesquite, Semidesert grassland encroached by mesquite (Prosopis velutina) trees (https://doi.org/10.18140/FLX/1440090). US-Sta, Saratoga, Sagebrush steppe ecosystem (https://doi.org/10.18140/FLX/1440115). US-Syv, Sylvania Wilderness Area, Old growth forest consisting primarily of sugar maple and eastern hemlock (https://doi.org/10.18140/FLX/1440091). US-Ton, Tonzi Ranch, Located in the lower foothills of the Sierra Nevada Mountains (https://doi.org/10.18140/FLX/1440092). US-Tw1, Twitchell Wetland West Pond, The Twitchell Wetland site is a 7.4-acre restored wetland on Twitchell Island (https://doi.org/10.18140/FLX/1440108). US-Tw2, Twitchell Corn, The Twitchell Corn site is a corn field on peat soil (https://doi.org/10.18140/FLX/1440109). US-Tw3, Twitchell Alfalfa, The Twitchell Alfalfa site is an alfalfa field owned by the state of California and leased to third parties for farming (https://doi.org/10.18140/FLX/1440110). US-Tw4, Twitchell East End Wetland, The Twitchell East End Wetland is a newly constructed restored wetland on Twitchell Island, CA (https://doi.org/10.18140/FLX/1440111). US-Twt, Twitchell Island, The Twitchell Island site is a rice paddy that is owned by the state and managed by the California Department of Water Resources (https://doi.org/10.18140/FLX/1440106). US-UMB, Univ. of Mich. Biological Station, The UMBS site is located within a protected forest owned by the University of Michigan (https://doi.org/10.18140/FLX/1440093). US-UMd, UMBS Disturbance, The UMBS Disturbance site is an artificial disturbance site that has recently been created as part of the Forest Accelerate Succession ExperimenT (FASET) (https://doi.org/10.18140/FLX/1440101). US-Var, Vaira Ranch- Ione, Located in the lower foothills of the Sierra Nevada Mountains on privately owned land (https://doi.org/10.18140/FLX/1440094). US-WCr, Willow Creek, Upland decduous broadleaf forest. Mainly sugar maple, also basswood (https://doi.org/10.18140/FLX/1440095). US-WPT, Winous Point North Marsh, The marsh site has been owned by the Winous Point Shooting Club since 1856 and has been managed by wildlife biologists since 1946 (https://doi.org/10.18140/FLX/1440116). US-Whs, Walnut Gulch Lucky Hills Shrub, A semiarid Chihuahuan Desert shrubland located in the USDA-ARS Walnut Gulch Experimental Watershed (WGEW) surrounding the town of Tombstone, AZ (https://doi.org/10.18140/FLX/1440097). US-Wi0, Young red pine (YRP), The Wisconsin Young Red Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440055). US-Wi1, Intermediate hardwood (IHW), The Wisconsin Intermediate Hardwoods site is located in the Washburn Ranger District of the Chequamegon National Forest (https://doi.org/10.18140/FLX/1440054). US-Wi2, Intermediate red pine (IRP), The Wisconsin Intermediate Red Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440056). US-Wi3, Mature hardwood (MHW), The Wisconsin Mature Hardwood site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440057). US-Wi4, Mature red pine (MRP), The Wisconsin Mature Red Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440058). US-Wi5, Mixed young jack pine (MYJP), The Wisconsin Mixed Young Jack Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440059). US-Wi6, Pine barrens #1 (PB1), The Wisconsin Pine Barrens site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440060). US-Wi7, Red pine clearcut (RPCC), The Wisconsin Clearcut Red Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440061). US-Wi8, Young hardwood clearcut (YHW), The Wisconsin Clearcut Young Hardwood site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440062). US-Wi9, Young Jack pine (YJP), The Wisconsin Young Jack Pine site is located in the Washburn Ranger District of the northeastern section of Chequamegon National Forest (https://doi.org/10.18140/FLX/1440063). US-Wkg, Walnut Gulch Kendall Grasslands, This site is located in a small, intensively-studied, experimental watershed within USDA-ARS's Walnut Gulch Experimental Watershed (https://doi.org/10.18140/FLX/1440096). ENVELOPE(-82.1556,-82.1556,48.2167,48.2167) BEGINDATE: 1991-01-01T00:00:00Z ENDDATE: 2014-12-31T00:00:00Z 2023-01-01T00:00:00Z https://search.dataone.org/view/ess-dive-993a2817d456629-20231122T152659004 unknown ESS-DIVE: Deep Insight for Earth Science Data Atmospheric humidity Atmospheric water vapor pressure Long Short-term Memory Static attributes Interpretable deep learning Dataset 2023 dataone:urn:node:ESS_DIVE 2024-06-03T18:20:13Z Atmospheric water vapor pressure is an essential meteorological control on land surface and hydrologic processes. It is not as frequently observed as other meteorologic conditions, but often inferred through the August–Roche–Magnus formula by simply assuming dew point and daily minimum temperatures are equivalent or by empirically correlating the two temperatures using an aridity correction. The performance of both methods varies considerably across different regions and during different time periods; obtaining consistently accurate estimates across space and time remains a great challenge. We applied an interpretable Long Short-Term Memory (iLSTM) network conditioned on static, location specific attributes to estimate daily vapor pressure for 83 FLUXNET sites in the United States and Canada. This data package includes all raw data of the 83 FLUXNET sites, input data for model training/validation/test, trained models and results, and python codes for the manuscript "Improving the Estimation of the Atmospheric Water Vapor Pressure Using an Interpretable Long Short-term Memory Network". Specifically, it consists of five parts. - First, "1_Daymet_data_83sites.zip" includes raw data downloaded from Daymet for the 83 sites used in the paper according to their longitude and latitude, in which vapor pressure is used. It also includes a pre-processed CSV data file combining all data from the 83 sites which is specifically used for the paper. - Second, "2_Fluxnet2015_data_83sites.zip" includes raw half hourly data of the 83 sites downloaded from FLUXNET2015 data portal, pre-processed daily data of the 83 sites, a CSV file including combined pre-processed daily data of the 83 sites, and a CSV file including the information (site ID, site name, latitude, longitude, data available period) of the 83 sites. - Third, "3_MODIS_LAI_data_83sites_raw.zip" includes raw leaf area index (LAI) data downloaded from the AppEEARs data portal. - Fourth, "4_Scripts.zip" includes all scripts related to model training and post-processing of a trained model, and a jupyter notebook showing an example for model post-processing. - Finally, "Trained_models_and_results.zip" includes three folders and three files with suffix ".npy", and each folder corresponds to one file with suffix ".npy" with the same title. Each of the three folders include all trained models associated with one iLSTM model configuration (35 models for each configuration, details are described in the paper). Each file with suffix ".npy" includes the post-processed results of the corresponding 35 models under one iLSTM model configuration. Dataset Barrow Brooks Range glacier glacier* Sherman Island Tundra Alaska ESS-DIVE: Deep Insight for Earth Science Data (via DataONE) Fairbanks Canada Pacific Kendall ENVELOPE(-59.828,-59.828,-63.497,-63.497) Boreas ENVELOPE(-3.933,-3.933,-71.300,-71.300) Brooklyn ENVELOPE(-62.083,-62.083,-64.650,-64.650) Gulch ENVELOPE(-61.483,-61.483,-63.997,-63.997) Monroe ENVELOPE(-46.050,-46.050,-60.600,-60.600) Corral ENVELOPE(-62.950,-62.950,-64.900,-64.900) Woodward ENVELOPE(-145.500,-145.500,-77.283,-77.283) Goodwin ENVELOPE(-62.833,-62.833,-65.100,-65.100) Washburn ENVELOPE(-86.133,-86.133,-77.617,-77.617) Tombstone ENVELOPE(-63.533,-63.533,-64.817,-64.817) The Bluff ENVELOPE(-61.567,-61.567,-64.367,-64.367) Sherman Island ENVELOPE(-100.000,-100.000,-73.050,-73.050) Reno ENVELOPE(-117.003,-117.003,56.000,56.000) Merritt Island ENVELOPE(-102.734,-102.734,56.217,56.217) ENVELOPE(-82.1556,-82.1556,48.2167,48.2167)