Description
Summary:Unsupervised clustering methods on remote sensing images have shown good results. However, this type of machine learning needs additional labelling to be an end-to-end classification in the same manner as traditional supervised classification. The automation of the labelling needs further exploration. We want to investigate the robustness of a supervised automatic labelling scheme by comparing a segmentation with additional automatic labelling against a supervised classification method. Using synthetic aperture radar (SAR) satellite images of sea ice from Sentinel-1, an automatic Expectation Maximization method with a Gaussian mixture model is used for the segmentation, taking into consideration the incidence angle variation within a SAR image. The additional labelling is a likelihood majority vote related to the Mahalanobis distance measure. The Bayesian Maximum Likelihood (ML) is used as the fully supervised reference method. The experiments of comparison are done using various amounts of training data and different percentages of mislabelling in the training data set. The classification results are compared both visually and using classification accuracy. As training data size increases, the accuracy of the ML method tends to decay faster than for the segment-then-label approach, particularly when sample sizes per class are less than a hundred. As more contamination is introduced, the decay is not distinct, probably due to the large within-class variations in the training set. Based on the results, the ML method generally gets a higher overall classification accuracy, but there are weak tendencies for the segment-then-label method to be more robust to decreasing training data size and more mislabelling.