Evaluation of feature ranking and regression methods for oceanic chlorophyll-a estimation

Accepted manuscript version. Published version available at https://doi.org/10.1109/JSTARS.2018.2810704. This paper evaluates two alternative regression techniques for oceanic chlorophyll-a (Chl-a) content estimation. One of the investigated methodologies is the recently introduced Gaussian process...

Full description

Bibliographic Details
Published in:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Main Authors: Blix, Katalin, Eltoft, Torbjørn
Format: Article in Journal/Newspaper
Language:English
Published: Institute of Electrical and Electronics Engineers 2018
Subjects:
Online Access:https://hdl.handle.net/10037/15028
https://doi.org/10.1109/JSTARS.2018.2810704
Description
Summary:Accepted manuscript version. Published version available at https://doi.org/10.1109/JSTARS.2018.2810704. This paper evaluates two alternative regression techniques for oceanic chlorophyll-a (Chl-a) content estimation. One of the investigated methodologies is the recently introduced Gaussian process regression (GPR) model. We explore two feature ranking methods derived for the GPR model, namely sensitivity analysis (SA) and automatic relevance determination (ARD). We also investigate a second regression method, the partial least squares regression (PLSR) for oceanic Chl-a content estimation. Feature relevance in the PLSR model can be accessed through the variable importance in projection (VIP) feature ranking algorithm. This paper thus analyzes three feature ranking models, SA, ARD, and VIP, which are all derived from different fundamental principles, and uses the ranked features as inputs to the GPR and PLSR to assess regression strengths. We compare the regression performances using some common performance measures, and show how the feature ranking methods can be used to find the lowest number of features to estimate oceanic Chl-a content by using the GPR and PLSR models, while still producing comparable performance to the state-of-the-art algorithms. We evaluate the models on a global MEdium Resolution Imaging Spectrometer matchup dataset. Our results show that the GPR model has the best regression performance for most of the input feature sets we used, and our conclusion is this model can favorably be used for Chl-a content retrieval, already with two features, ranked by either the SA or ARD methods.