A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels

Misalignment in Earth observation (EO) images and building labels impact the training of accurate convolutional neural networks (CNNs) for semantic segmentation of building footprints. Recently, three Teacher-Student knowledge transfer methods have been introduced to address this issue: supervised d...

Full description

Bibliographic Details
Main Authors: Neupane, Bipul, Aryal, Jagannath, Rajabifard, Abbas
Format: Text
Language:unknown
Published: 2023
Subjects:
DML
Online Access:http://arxiv.org/abs/2311.03867
id ftarxivpreprints:oai:arXiv.org:2311.03867
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2311.03867 2023-12-10T09:48:11+01:00 A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels Neupane, Bipul Aryal, Jagannath Rajabifard, Abbas 2023-11-07 http://arxiv.org/abs/2311.03867 unknown http://arxiv.org/abs/2311.03867 Computer Science - Computer Vision and Pattern Recognition text 2023 ftarxivpreprints 2023-11-12T02:07:52Z Misalignment in Earth observation (EO) images and building labels impact the training of accurate convolutional neural networks (CNNs) for semantic segmentation of building footprints. Recently, three Teacher-Student knowledge transfer methods have been introduced to address this issue: supervised domain adaptation (SDA), knowledge distillation (KD), and deep mutual learning (DML). However, these methods are merely studied for different urban buildings (low-rise, mid-rise, high-rise, and skyscrapers), where misalignment increases with building height and spatial resolution. In this study, we present a workflow for the systematic comparative study of the three methods. The workflow first identifies the best (with the highest evaluation scores) hyperparameters, lightweight CNNs for the Student (among 43 CNNs from Computer Vision), and encoder-decoder networks (EDNs) for both Teachers and Students. Secondly, three building footprint datasets are developed to train and evaluate the identified Teachers and Students in the three transfer methods. The results show that U-Net with VGG19 (U-VGG19) is the best Teacher, and U-EfficientNetv2B3 and U-EfficientNet-lite0 are among the best Students. With these Teacher-Student pairs, SDA could yield upto 0.943, 0.868, 0.912, and 0.697 F1 scores in the low-rise, mid-rise, high-rise, and skyscrapers respectively. KD and DML provide model compression of upto 82%, despite marginal loss in performance. This new comparison concludes that SDA is the most effective method to address the misalignment problem, while KD and DML can efficiently compress network size without significant loss in performance. The 158 experiments and datasets developed in this study will be valuable to minimise the misaligned labels. Comment: This work has been submitted to Elsevier for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible Text DML ArXiv.org (Cornell University Library)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Computer Vision and Pattern Recognition
spellingShingle Computer Science - Computer Vision and Pattern Recognition
Neupane, Bipul
Aryal, Jagannath
Rajabifard, Abbas
A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels
topic_facet Computer Science - Computer Vision and Pattern Recognition
description Misalignment in Earth observation (EO) images and building labels impact the training of accurate convolutional neural networks (CNNs) for semantic segmentation of building footprints. Recently, three Teacher-Student knowledge transfer methods have been introduced to address this issue: supervised domain adaptation (SDA), knowledge distillation (KD), and deep mutual learning (DML). However, these methods are merely studied for different urban buildings (low-rise, mid-rise, high-rise, and skyscrapers), where misalignment increases with building height and spatial resolution. In this study, we present a workflow for the systematic comparative study of the three methods. The workflow first identifies the best (with the highest evaluation scores) hyperparameters, lightweight CNNs for the Student (among 43 CNNs from Computer Vision), and encoder-decoder networks (EDNs) for both Teachers and Students. Secondly, three building footprint datasets are developed to train and evaluate the identified Teachers and Students in the three transfer methods. The results show that U-Net with VGG19 (U-VGG19) is the best Teacher, and U-EfficientNetv2B3 and U-EfficientNet-lite0 are among the best Students. With these Teacher-Student pairs, SDA could yield upto 0.943, 0.868, 0.912, and 0.697 F1 scores in the low-rise, mid-rise, high-rise, and skyscrapers respectively. KD and DML provide model compression of upto 82%, despite marginal loss in performance. This new comparison concludes that SDA is the most effective method to address the misalignment problem, while KD and DML can efficiently compress network size without significant loss in performance. The 158 experiments and datasets developed in this study will be valuable to minimise the misaligned labels. Comment: This work has been submitted to Elsevier for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
format Text
author Neupane, Bipul
Aryal, Jagannath
Rajabifard, Abbas
author_facet Neupane, Bipul
Aryal, Jagannath
Rajabifard, Abbas
author_sort Neupane, Bipul
title A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels
title_short A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels
title_full A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels
title_fullStr A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels
title_full_unstemmed A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels
title_sort comparative study of knowledge transfer methods for misaligned urban building labels
publishDate 2023
url http://arxiv.org/abs/2311.03867
genre DML
genre_facet DML
op_relation http://arxiv.org/abs/2311.03867
_version_ 1784892093368369152