Improving cross-dataset generalization in image classification with contrastive representation learning

Regular monitoring of marine wildlife is essential for rapid detection of changes in the marine ecosystem allowing for adaptive strategies. However, the manual analysis of large volumes of underwater images taken by cameras is highly time-consuming. Deep learning techniques have been adopted in mari...

Full description

Bibliographic Details
Main Author: Saffar, Najmeh
Other Authors: Ashraf, Ahmed, Khoshdarregi, Matt (Mechanical Engineering), Yahampath, Pradeepa (Electrical and Computer Engineering), Khan, Sheroz (KITE Research Institute)
Format: Master Thesis
Language:English
Published: 2022
Subjects:
Online Access:http://hdl.handle.net/1993/36792
id ftunivmanitoba:oai:mspace.lib.umanitoba.ca:1993/36792
record_format openpolar
spelling ftunivmanitoba:oai:mspace.lib.umanitoba.ca:1993/36792 2023-06-18T03:40:00+02:00 Improving cross-dataset generalization in image classification with contrastive representation learning Saffar, Najmeh Ashraf, Ahmed Khoshdarregi, Matt (Mechanical Engineering) Yahampath, Pradeepa (Electrical and Computer Engineering) Khan, Sheroz (KITE Research Institute) 2022-08-25T19:58:42Z application/pdf http://hdl.handle.net/1993/36792 eng eng http://hdl.handle.net/1993/36792 open access Image classification Object detection Deep learning Convolutional neural network master thesis 2022 ftunivmanitoba 2023-06-04T17:41:18Z Regular monitoring of marine wildlife is essential for rapid detection of changes in the marine ecosystem allowing for adaptive strategies. However, the manual analysis of large volumes of underwater images taken by cameras is highly time-consuming. Deep learning techniques have been adopted in marine wildlife for the automatic classification of underwater photos to accelerate image analysis. However, water quality varies at different locations, depths, and acquisition times during data collection. This, along with differences in other acquisition parameters, leads to datasets with idiosyncratic footprints and, therefore, limited generalization of the trained deep learning model to other sets of images different from the training set. As a result, more work is required toward improving the cross-dataset generalization of deep learning models. In our research, we started by assessing dataset biases' impact on cross-dataset generalization in the classification of beluga whale images from empty underwater image frames. We used three underwater image datasets with varying image acquisition profiles: a dataset of good water quality photos, moderately bad water quality photos, and a dataset of images with both the horizon and water in the same frame. Then, we investigated two frameworks to improve cross-dataset generalization. One attempts to unlearn dataset-specific information for explicitly handling the dataset bias problem. The other uses a contrastive loss for learning a representation by contrasting the images with beluga whales against the images with empty frames regardless of their dataset membership. We conducted an exhaustive evaluation of proposed deep learning architectures and compared performance using cross-dataset approaches with traditional architectures. The supervised contrastive approach outperforms the other architectures. To the best of our knowledge, this was the first use of contrastive settings to implicitly address the dataset bias problem. Mitacs Accelerate Program University of Manitoba ... Master Thesis Beluga Beluga whale Beluga* MSpace at the University of Manitoba
institution Open Polar
collection MSpace at the University of Manitoba
op_collection_id ftunivmanitoba
language English
topic Image classification
Object detection
Deep learning
Convolutional neural network
spellingShingle Image classification
Object detection
Deep learning
Convolutional neural network
Saffar, Najmeh
Improving cross-dataset generalization in image classification with contrastive representation learning
topic_facet Image classification
Object detection
Deep learning
Convolutional neural network
description Regular monitoring of marine wildlife is essential for rapid detection of changes in the marine ecosystem allowing for adaptive strategies. However, the manual analysis of large volumes of underwater images taken by cameras is highly time-consuming. Deep learning techniques have been adopted in marine wildlife for the automatic classification of underwater photos to accelerate image analysis. However, water quality varies at different locations, depths, and acquisition times during data collection. This, along with differences in other acquisition parameters, leads to datasets with idiosyncratic footprints and, therefore, limited generalization of the trained deep learning model to other sets of images different from the training set. As a result, more work is required toward improving the cross-dataset generalization of deep learning models. In our research, we started by assessing dataset biases' impact on cross-dataset generalization in the classification of beluga whale images from empty underwater image frames. We used three underwater image datasets with varying image acquisition profiles: a dataset of good water quality photos, moderately bad water quality photos, and a dataset of images with both the horizon and water in the same frame. Then, we investigated two frameworks to improve cross-dataset generalization. One attempts to unlearn dataset-specific information for explicitly handling the dataset bias problem. The other uses a contrastive loss for learning a representation by contrasting the images with beluga whales against the images with empty frames regardless of their dataset membership. We conducted an exhaustive evaluation of proposed deep learning architectures and compared performance using cross-dataset approaches with traditional architectures. The supervised contrastive approach outperforms the other architectures. To the best of our knowledge, this was the first use of contrastive settings to implicitly address the dataset bias problem. Mitacs Accelerate Program University of Manitoba ...
author2 Ashraf, Ahmed
Khoshdarregi, Matt (Mechanical Engineering)
Yahampath, Pradeepa (Electrical and Computer Engineering)
Khan, Sheroz (KITE Research Institute)
format Master Thesis
author Saffar, Najmeh
author_facet Saffar, Najmeh
author_sort Saffar, Najmeh
title Improving cross-dataset generalization in image classification with contrastive representation learning
title_short Improving cross-dataset generalization in image classification with contrastive representation learning
title_full Improving cross-dataset generalization in image classification with contrastive representation learning
title_fullStr Improving cross-dataset generalization in image classification with contrastive representation learning
title_full_unstemmed Improving cross-dataset generalization in image classification with contrastive representation learning
title_sort improving cross-dataset generalization in image classification with contrastive representation learning
publishDate 2022
url http://hdl.handle.net/1993/36792
genre Beluga
Beluga whale
Beluga*
genre_facet Beluga
Beluga whale
Beluga*
op_relation http://hdl.handle.net/1993/36792
op_rights open access
_version_ 1769004803223977984