An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images
Unsupervised clustering methods on remote sensing images have shown good results. However, this type of machine learning needs additional labelling to be an end-to-end classification in the same manner as traditional supervised classification. The automation of the labelling needs further exploratio...
Main Author: | |
---|---|
Format: | Master Thesis |
Language: | English |
Published: |
UiT Norges arktiske universitet
2019
|
Subjects: | |
Online Access: | https://hdl.handle.net/10037/15761 |
_version_ | 1829299107648241664 |
---|---|
author | Kiærbech, Åshild |
author_facet | Kiærbech, Åshild |
author_sort | Kiærbech, Åshild |
collection | University of Tromsø: Munin Open Research Archive |
description | Unsupervised clustering methods on remote sensing images have shown good results. However, this type of machine learning needs additional labelling to be an end-to-end classification in the same manner as traditional supervised classification. The automation of the labelling needs further exploration. We want to investigate the robustness of a supervised automatic labelling scheme by comparing a segmentation with additional automatic labelling against a supervised classification method. Using synthetic aperture radar (SAR) satellite images of sea ice from Sentinel-1, an automatic Expectation Maximization method with a Gaussian mixture model is used for the segmentation, taking into consideration the incidence angle variation within a SAR image. The additional labelling is a likelihood majority vote related to the Mahalanobis distance measure. The Bayesian Maximum Likelihood (ML) is used as the fully supervised reference method. The experiments of comparison are done using various amounts of training data and different percentages of mislabelling in the training data set. The classification results are compared both visually and using classification accuracy. As training data size increases, the accuracy of the ML method tends to decay faster than for the segment-then-label approach, particularly when sample sizes per class are less than a hundred. As more contamination is introduced, the decay is not distinct, probably due to the large within-class variations in the training set. Based on the results, the ML method generally gets a higher overall classification accuracy, but there are weak tendencies for the segment-then-label method to be more robust to decreasing training data size and more mislabelling. |
format | Master Thesis |
genre | Sea ice |
genre_facet | Sea ice |
id | ftunivtroemsoe:oai:munin.uit.no:10037/15761 |
institution | Open Polar |
language | English |
op_collection_id | ftunivtroemsoe |
op_relation | https://hdl.handle.net/10037/15761 |
op_rights | Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) openAccess Copyright 2019 The Author(s) https://creativecommons.org/licenses/by-nc-sa/4.0 |
publishDate | 2019 |
publisher | UiT Norges arktiske universitet |
record_format | openpolar |
spelling | ftunivtroemsoe:oai:munin.uit.no:10037/15761 2025-04-13T14:26:50+00:00 An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images Kiærbech, Åshild 2019-06-03 https://hdl.handle.net/10037/15761 eng eng UiT Norges arktiske universitet UiT The Arctic University of Norway https://hdl.handle.net/10037/15761 Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) openAccess Copyright 2019 The Author(s) https://creativecommons.org/licenses/by-nc-sa/4.0 VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Simulering visualisering signalbehandling bildeanalyse: 429 VDP::Mathematics and natural science: 400::Information and communication science: 420::Simulation visualization signal processing image processing: 429 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Analyse: 411 VDP::Mathematics and natural science: 400::Mathematics: 410::Analysis: 411 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 Automatic labelling Classification Image segmentation Expectation Maximization Gaussian Mixture Model Maximum Likelihood Sea ice Sentinel-1 SAR Remote sensing Satellite images FYS-3941 Master thesis Mastergradsoppgave 2019 ftunivtroemsoe 2025-03-14T05:17:55Z Unsupervised clustering methods on remote sensing images have shown good results. However, this type of machine learning needs additional labelling to be an end-to-end classification in the same manner as traditional supervised classification. The automation of the labelling needs further exploration. We want to investigate the robustness of a supervised automatic labelling scheme by comparing a segmentation with additional automatic labelling against a supervised classification method. Using synthetic aperture radar (SAR) satellite images of sea ice from Sentinel-1, an automatic Expectation Maximization method with a Gaussian mixture model is used for the segmentation, taking into consideration the incidence angle variation within a SAR image. The additional labelling is a likelihood majority vote related to the Mahalanobis distance measure. The Bayesian Maximum Likelihood (ML) is used as the fully supervised reference method. The experiments of comparison are done using various amounts of training data and different percentages of mislabelling in the training data set. The classification results are compared both visually and using classification accuracy. As training data size increases, the accuracy of the ML method tends to decay faster than for the segment-then-label approach, particularly when sample sizes per class are less than a hundred. As more contamination is introduced, the decay is not distinct, probably due to the large within-class variations in the training set. Based on the results, the ML method generally gets a higher overall classification accuracy, but there are weak tendencies for the segment-then-label method to be more robust to decreasing training data size and more mislabelling. Master Thesis Sea ice University of Tromsø: Munin Open Research Archive |
spellingShingle | VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Simulering visualisering signalbehandling bildeanalyse: 429 VDP::Mathematics and natural science: 400::Information and communication science: 420::Simulation visualization signal processing image processing: 429 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Analyse: 411 VDP::Mathematics and natural science: 400::Mathematics: 410::Analysis: 411 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 Automatic labelling Classification Image segmentation Expectation Maximization Gaussian Mixture Model Maximum Likelihood Sea ice Sentinel-1 SAR Remote sensing Satellite images FYS-3941 Kiærbech, Åshild An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images |
title | An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images |
title_full | An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images |
title_fullStr | An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images |
title_full_unstemmed | An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images |
title_short | An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images |
title_sort | investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images |
topic | VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Simulering visualisering signalbehandling bildeanalyse: 429 VDP::Mathematics and natural science: 400::Information and communication science: 420::Simulation visualization signal processing image processing: 429 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Analyse: 411 VDP::Mathematics and natural science: 400::Mathematics: 410::Analysis: 411 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 Automatic labelling Classification Image segmentation Expectation Maximization Gaussian Mixture Model Maximum Likelihood Sea ice Sentinel-1 SAR Remote sensing Satellite images FYS-3941 |
topic_facet | VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Simulering visualisering signalbehandling bildeanalyse: 429 VDP::Mathematics and natural science: 400::Information and communication science: 420::Simulation visualization signal processing image processing: 429 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Analyse: 411 VDP::Mathematics and natural science: 400::Mathematics: 410::Analysis: 411 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 Automatic labelling Classification Image segmentation Expectation Maximization Gaussian Mixture Model Maximum Likelihood Sea ice Sentinel-1 SAR Remote sensing Satellite images FYS-3941 |
url | https://hdl.handle.net/10037/15761 |