An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images

Unsupervised clustering methods on remote sensing images have shown good results. However, this type of machine learning needs additional labelling to be an end-to-end classification in the same manner as traditional supervised classification. The automation of the labelling needs further exploratio...

Full description

Bibliographic Details
Main Author:	Kiærbech, Åshild
Format:	Master Thesis
Language:	English
Published:	UiT Norges arktiske universitet 2019
Subjects:	VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Simulering visualisering signalbehandling bildeanalyse: 429 VDP::Mathematics and natural science: 400::Information and communication science: 420::Simulation visualization signal processing image processing: 429 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Analyse: 411 VDP::Mathematics and natural science: 400::Mathematics: 410::Analysis: 411 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 Automatic labelling Classification Image segmentation Expectation Maximization Gaussian Mixture Model Maximum Likelihood Sea ice Sentinel-1 SAR Remote sensing Satellite images FYS-3941
Online Access:	https://hdl.handle.net/10037/15761

Description
Summary:	Unsupervised clustering methods on remote sensing images have shown good results. However, this type of machine learning needs additional labelling to be an end-to-end classification in the same manner as traditional supervised classification. The automation of the labelling needs further exploration. We want to investigate the robustness of a supervised automatic labelling scheme by comparing a segmentation with additional automatic labelling against a supervised classification method. Using synthetic aperture radar (SAR) satellite images of sea ice from Sentinel-1, an automatic Expectation Maximization method with a Gaussian mixture model is used for the segmentation, taking into consideration the incidence angle variation within a SAR image. The additional labelling is a likelihood majority vote related to the Mahalanobis distance measure. The Bayesian Maximum Likelihood (ML) is used as the fully supervised reference method. The experiments of comparison are done using various amounts of training data and different percentages of mislabelling in the training data set. The classification results are compared both visually and using classification accuracy. As training data size increases, the accuracy of the ML method tends to decay faster than for the segment-then-label approach, particularly when sample sizes per class are less than a hundred. As more contamination is introduced, the decay is not distinct, probably due to the large within-class variations in the training set. Based on the results, the ML method generally gets a higher overall classification accuracy, but there are weak tendencies for the segment-then-label method to be more robust to decreasing training data size and more mislabelling.

An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images

Similar Items