Data for simultaneous inference of sea ice state and surface emissivity model using machine learning and data assimilation

Overview This dataset supports the draft manuscript "Simultaneous inference of sea ice state and surface emissivity model using machine learning and data assimilation" which describes a way to infer the daily maps of the sea ice concentration and empirical properties of the sea ice (relati...

Full description

Bibliographic Details
Main Author: Geer, Alan
Format: Other/Unknown Material
Language:unknown
Published: Zenodo 2024
Subjects:
Online Access:https://doi.org/10.5281/zenodo.10453713
Description
Summary:Overview This dataset supports the draft manuscript "Simultaneous inference of sea ice state and surface emissivity model using machine learning and data assimilation" which describes a way to infer the daily maps of the sea ice concentration and empirical properties of the sea ice (relating to its snow cover and its physical properties, such as air inclusions) along with the creation of a new empirical model for the sea ice surface emissivity. This is done using knowledge of the atmosphere state, skin temperature and ocean water emissivity from the European Centre for Medium-range Weather Forecasts (ECMWF) weather forecasting model and the observed radiances at microwave frequencies from the Advanced Microwave Scanning Radiometer 2 (AMSR2). The inverse modelling and state estimation is achieved by combining empirical machine learning elements in a Bayesian-inspired network along with a number of physical components. The work also introduces the idea of an "empirical state", in this case describing the aspects of the sea ice physical state which affect the observations, and which is defined by the inputs to the new empirical model component (in machine learning terms, it is defined by the latent input state of a neural network). This dataset includes the data used in training the model and inferring the sea ice parameters, as well as the outputs from that training process. The software used to perform the training is in Python and uses the Keras and Tensorflow software. See the draft manuscript for full details of this data. The code used in the draft manuscript is archived at https://doi.org/10.5281/zenodo.10013542 The data used in the draft manuscript is archived at https://doi.org/10.5281/zenodo.10033377 Training data Observation space training and ancillary data Training is done at the location of AMSR2 superobservations (superobs) over ocean with less than 1% land contamination and polewards of 45 degrees latitude, between 1st July 2020 and 30th June 2021. There are 64,184,021 superobs used. A superob is ...