Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles

International audience In the data mining community, unsupervised classification (or clustering) technics are used to reveal and explore the hidden structure of a dataset. Among them, mixture models and in particular Gaussian Mixture Models (GMM), are a very popular tool. With GMM, the statistical d...

Full description

Bibliographic Details
Main Authors: Maze, Guillaume, Mercier, Herlé, Fablet, Ronan, Lenca, Philippe, Lopez Radcenco, Manuel, Tandeo, Pierre, Le Goff, Clement, Feucher, Charlène
Other Authors: Laboratoire de physique des océans (LPO), Institut de Recherche pour le Développement (IRD)-Institut Français de Recherche pour l'Exploitation de la Mer (IFREMER)-Université de Brest (UBO)-Centre National de la Recherche Scientifique (CNRS), Lab-STICC_TB_CID_TOMS, Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance (Lab-STICC), Université européenne de Bretagne - European University of Brittany (UEB)-École Nationale d'Ingénieurs de Brest (ENIB)-Université de Bretagne Sud (UBS)-Université de Brest (UBO)-Télécom Bretagne-Institut Brestois du Numérique et des Mathématiques (IBNM), Université de Brest (UBO)-École Nationale Supérieure de Techniques Avancées Bretagne (ENSTA Bretagne)-Institut Mines-Télécom Paris (IMT)-Centre National de la Recherche Scientifique (CNRS)-Université européenne de Bretagne - European University of Brittany (UEB)-École Nationale d'Ingénieurs de Brest (ENIB)-Université de Bretagne Sud (UBS)-Université de Brest (UBO)-Télécom Bretagne-Institut Brestois du Numérique et des Mathématiques (IBNM), Université de Brest (UBO)-École Nationale Supérieure de Techniques Avancées Bretagne (ENSTA Bretagne)-Institut Mines-Télécom Paris (IMT)-Centre National de la Recherche Scientifique (CNRS), Département Signal et Communications (SC), Université européenne de Bretagne - European University of Brittany (UEB)-Télécom Bretagne-Institut Mines-Télécom Paris (IMT), Lab-STICC_TB_CID_DECIDE, Département Logique des Usages, Sciences sociales et Sciences de l'Information (LUSSI)
Format: Conference Object
Language:French
Published: HAL CCSD 2016
Subjects:
Online Access:https://hal.science/hal-01345101
id ftunivbrest:oai:HAL:hal-01345101v1
record_format openpolar
institution Open Polar
collection Université de Bretagne Occidentale: HAL
op_collection_id ftunivbrest
language French
topic Ocean heat
Unsupervised classification
Hydrographic Profiles
[INFO.INFO-CY]Computer Science [cs]/Computers and Society [cs.CY]
[SDU.OCEAN]Sciences of the Universe [physics]/Ocean
Atmosphere
[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
[STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
spellingShingle Ocean heat
Unsupervised classification
Hydrographic Profiles
[INFO.INFO-CY]Computer Science [cs]/Computers and Society [cs.CY]
[SDU.OCEAN]Sciences of the Universe [physics]/Ocean
Atmosphere
[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
[STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
Maze, Guillaume
Mercier, Herlé
Fablet, Ronan
Lenca, Philippe
Lopez Radcenco, Manuel
Tandeo, Pierre
Le Goff, Clement
Feucher, Charlène
Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles
topic_facet Ocean heat
Unsupervised classification
Hydrographic Profiles
[INFO.INFO-CY]Computer Science [cs]/Computers and Society [cs.CY]
[SDU.OCEAN]Sciences of the Universe [physics]/Ocean
Atmosphere
[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
[STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
description International audience In the data mining community, unsupervised classification (or clustering) technics are used to reveal and explore the hidden structure of a dataset. Among them, mixture models and in particular Gaussian Mixture Models (GMM), are a very popular tool. With GMM, the statistical distribution of the dataset is decomposed into a weighted sum of Gaussian modes that maximizes the likelihood of the dataset. Multivariate Gaussian distributions characterize a mode in the D-dimensional space of the dataset. This technic is routinely used in atmospheric science to describe weather regimes but is yet to be explored in physical oceanography. Here, we are interested in determining the structure of the North Atlantic ocean temperature field. Argo profiles in the North atlantic were compressed along their vertical axis in order to reduce the dimensionality of the problem. We then fitted GMMs to this reduced profile dataset according to a Maximum Likelihood criterion using the Expectation-Maximization algorithm. A seven-mode GMM was the most relevant to describe the dataset. Each mode or cluster is characterized by the parameters of a Gaussian distribution (a mean and a covariance matrix). GMM also provides for each profile of the dataset the probability it belongs to a specific cluster. We will show that these informations can be used to describe physically coherent heat reservoirs and their variability. Indeed, we found that clusters capture the large scale climatological structure of the temperature field. Each of the cluster correspond to physically coherent regions, namely the equatorial, tropical, subtropical, intergyre and subpolar regions and are associated with reference profiles. A hierarchical clustering was applied to characterize the regional variability of the dataset. We will present those clusters and possible use both for scientific analysis of the heat content variability in the North Atlantic and technical validation of the Argo array.
author2 Laboratoire de physique des océans (LPO)
Institut de Recherche pour le Développement (IRD)-Institut Français de Recherche pour l'Exploitation de la Mer (IFREMER)-Université de Brest (UBO)-Centre National de la Recherche Scientifique (CNRS)
Lab-STICC_TB_CID_TOMS
Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance (Lab-STICC)
Université européenne de Bretagne - European University of Brittany (UEB)-École Nationale d'Ingénieurs de Brest (ENIB)-Université de Bretagne Sud (UBS)-Université de Brest (UBO)-Télécom Bretagne-Institut Brestois du Numérique et des Mathématiques (IBNM)
Université de Brest (UBO)-École Nationale Supérieure de Techniques Avancées Bretagne (ENSTA Bretagne)-Institut Mines-Télécom Paris (IMT)-Centre National de la Recherche Scientifique (CNRS)-Université européenne de Bretagne - European University of Brittany (UEB)-École Nationale d'Ingénieurs de Brest (ENIB)-Université de Bretagne Sud (UBS)-Université de Brest (UBO)-Télécom Bretagne-Institut Brestois du Numérique et des Mathématiques (IBNM)
Université de Brest (UBO)-École Nationale Supérieure de Techniques Avancées Bretagne (ENSTA Bretagne)-Institut Mines-Télécom Paris (IMT)-Centre National de la Recherche Scientifique (CNRS)
Département Signal et Communications (SC)
Université européenne de Bretagne - European University of Brittany (UEB)-Télécom Bretagne-Institut Mines-Télécom Paris (IMT)
Lab-STICC_TB_CID_DECIDE
Département Logique des Usages, Sciences sociales et Sciences de l'Information (LUSSI)
format Conference Object
author Maze, Guillaume
Mercier, Herlé
Fablet, Ronan
Lenca, Philippe
Lopez Radcenco, Manuel
Tandeo, Pierre
Le Goff, Clement
Feucher, Charlène
author_facet Maze, Guillaume
Mercier, Herlé
Fablet, Ronan
Lenca, Philippe
Lopez Radcenco, Manuel
Tandeo, Pierre
Le Goff, Clement
Feucher, Charlène
author_sort Maze, Guillaume
title Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles
title_short Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles
title_full Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles
title_fullStr Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles
title_full_unstemmed Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles
title_sort ocean heat content structure revealed by un-supervised classification of hydrographic profiles
publisher HAL CCSD
publishDate 2016
url https://hal.science/hal-01345101
op_coverage New Orleans, United States
long_lat ENVELOPE(-60.667,-60.667,-63.950,-63.950)
geographic Orleans
geographic_facet Orleans
genre North Atlantic
genre_facet North Atlantic
op_source Actes OSD 2016 : Ocean Sciences Meeting
OSD 2016 : Ocean Sciences Meeting
https://hal.science/hal-01345101
OSD 2016 : Ocean Sciences Meeting , Feb 2016, New Orleans, États-Unis
op_relation hal-01345101
https://hal.science/hal-01345101
_version_ 1785569862918078464
spelling ftunivbrest:oai:HAL:hal-01345101v1 2023-12-17T10:46:24+01:00 Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles Maze, Guillaume Mercier, Herlé Fablet, Ronan Lenca, Philippe Lopez Radcenco, Manuel Tandeo, Pierre Le Goff, Clement Feucher, Charlène Laboratoire de physique des océans (LPO) Institut de Recherche pour le Développement (IRD)-Institut Français de Recherche pour l'Exploitation de la Mer (IFREMER)-Université de Brest (UBO)-Centre National de la Recherche Scientifique (CNRS) Lab-STICC_TB_CID_TOMS Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance (Lab-STICC) Université européenne de Bretagne - European University of Brittany (UEB)-École Nationale d'Ingénieurs de Brest (ENIB)-Université de Bretagne Sud (UBS)-Université de Brest (UBO)-Télécom Bretagne-Institut Brestois du Numérique et des Mathématiques (IBNM) Université de Brest (UBO)-École Nationale Supérieure de Techniques Avancées Bretagne (ENSTA Bretagne)-Institut Mines-Télécom Paris (IMT)-Centre National de la Recherche Scientifique (CNRS)-Université européenne de Bretagne - European University of Brittany (UEB)-École Nationale d'Ingénieurs de Brest (ENIB)-Université de Bretagne Sud (UBS)-Université de Brest (UBO)-Télécom Bretagne-Institut Brestois du Numérique et des Mathématiques (IBNM) Université de Brest (UBO)-École Nationale Supérieure de Techniques Avancées Bretagne (ENSTA Bretagne)-Institut Mines-Télécom Paris (IMT)-Centre National de la Recherche Scientifique (CNRS) Département Signal et Communications (SC) Université européenne de Bretagne - European University of Brittany (UEB)-Télécom Bretagne-Institut Mines-Télécom Paris (IMT) Lab-STICC_TB_CID_DECIDE Département Logique des Usages, Sciences sociales et Sciences de l'Information (LUSSI) New Orleans, United States 2016-02-21 https://hal.science/hal-01345101 fr fre HAL CCSD hal-01345101 https://hal.science/hal-01345101 Actes OSD 2016 : Ocean Sciences Meeting OSD 2016 : Ocean Sciences Meeting https://hal.science/hal-01345101 OSD 2016 : Ocean Sciences Meeting , Feb 2016, New Orleans, États-Unis Ocean heat Unsupervised classification Hydrographic Profiles [INFO.INFO-CY]Computer Science [cs]/Computers and Society [cs.CY] [SDU.OCEAN]Sciences of the Universe [physics]/Ocean Atmosphere [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing [STAT.ML]Statistics [stat]/Machine Learning [stat.ML] info:eu-repo/semantics/conferenceObject Conference papers 2016 ftunivbrest 2023-11-21T23:43:27Z International audience In the data mining community, unsupervised classification (or clustering) technics are used to reveal and explore the hidden structure of a dataset. Among them, mixture models and in particular Gaussian Mixture Models (GMM), are a very popular tool. With GMM, the statistical distribution of the dataset is decomposed into a weighted sum of Gaussian modes that maximizes the likelihood of the dataset. Multivariate Gaussian distributions characterize a mode in the D-dimensional space of the dataset. This technic is routinely used in atmospheric science to describe weather regimes but is yet to be explored in physical oceanography. Here, we are interested in determining the structure of the North Atlantic ocean temperature field. Argo profiles in the North atlantic were compressed along their vertical axis in order to reduce the dimensionality of the problem. We then fitted GMMs to this reduced profile dataset according to a Maximum Likelihood criterion using the Expectation-Maximization algorithm. A seven-mode GMM was the most relevant to describe the dataset. Each mode or cluster is characterized by the parameters of a Gaussian distribution (a mean and a covariance matrix). GMM also provides for each profile of the dataset the probability it belongs to a specific cluster. We will show that these informations can be used to describe physically coherent heat reservoirs and their variability. Indeed, we found that clusters capture the large scale climatological structure of the temperature field. Each of the cluster correspond to physically coherent regions, namely the equatorial, tropical, subtropical, intergyre and subpolar regions and are associated with reference profiles. A hierarchical clustering was applied to characterize the regional variability of the dataset. We will present those clusters and possible use both for scientific analysis of the heat content variability in the North Atlantic and technical validation of the Argo array. Conference Object North Atlantic Université de Bretagne Occidentale: HAL Orleans ENVELOPE(-60.667,-60.667,-63.950,-63.950)