A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver

Univariate and multivariate feature selection methods can be used for biomarker discovery in analysis of toxicant exposure. Among the univariate methods, differential expression analysis (DEA) is often applied for its simplicity and interpretability. A characteristic of methods for DEA is that they...

Full description

Bibliographic Details
Main Authors: Zhang, Xiaokang, Jonassen, Inge
Format: Text
Language:unknown
Published: arXiv 2019
Subjects:
Online Access:https://dx.doi.org/10.48550/arxiv.1905.08048
https://arxiv.org/abs/1905.08048
id ftdatacite:10.48550/arxiv.1905.08048
record_format openpolar
spelling ftdatacite:10.48550/arxiv.1905.08048 2023-05-15T15:27:41+02:00 A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver Zhang, Xiaokang Jonassen, Inge 2019 https://dx.doi.org/10.48550/arxiv.1905.08048 https://arxiv.org/abs/1905.08048 unknown arXiv https://dx.doi.org/10.1007/978-3-030-35664-4_11 arXiv.org perpetual, non-exclusive license http://arxiv.org/licenses/nonexclusive-distrib/1.0/ Machine Learning cs.LG Quantitative Methods q-bio.QM Machine Learning stat.ML FOS Computer and information sciences FOS Biological sciences article-journal Article ScholarlyArticle Text 2019 ftdatacite https://doi.org/10.48550/arxiv.1905.08048 https://doi.org/10.1007/978-3-030-35664-4_11 2022-04-01T08:38:23Z Univariate and multivariate feature selection methods can be used for biomarker discovery in analysis of toxicant exposure. Among the univariate methods, differential expression analysis (DEA) is often applied for its simplicity and interpretability. A characteristic of methods for DEA is that they treat genes individually, disregarding the correlation that exists between them. On the other hand, some multivariate feature selection methods are proposed for biomarker discovery. Provided with various biomarker discovery methods, how to choose the most suitable method for a specific dataset becomes a problem. In this paper, we present a framework for comparison of potential biomarker discovery methods: three methods that stem from different theories are compared by how stable they are and how well they can improve the classification accuracy. The three methods we have considered are: Significance Analysis of Microarrays (SAM) which identifies the differentially expressed genes; minimum Redundancy Maximum Relevance (mRMR) based on information theory; and Characteristic Direction (GeoDE) inspired by a graphical perspective. Tested on the gene expression data from two experiments exposing the cod fish to two different toxicants (MeHg and PCB 153), different methods stand out in different cases, so a decision upon the most suitable method should be made based on the dataset under study and the research interest. : 11 pages, 4 figures, 2019 NAIS Symposium Text atlantic cod Gadus morhua DataCite Metadata Store (German National Library of Science and Technology)
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
topic Machine Learning cs.LG
Quantitative Methods q-bio.QM
Machine Learning stat.ML
FOS Computer and information sciences
FOS Biological sciences
spellingShingle Machine Learning cs.LG
Quantitative Methods q-bio.QM
Machine Learning stat.ML
FOS Computer and information sciences
FOS Biological sciences
Zhang, Xiaokang
Jonassen, Inge
A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver
topic_facet Machine Learning cs.LG
Quantitative Methods q-bio.QM
Machine Learning stat.ML
FOS Computer and information sciences
FOS Biological sciences
description Univariate and multivariate feature selection methods can be used for biomarker discovery in analysis of toxicant exposure. Among the univariate methods, differential expression analysis (DEA) is often applied for its simplicity and interpretability. A characteristic of methods for DEA is that they treat genes individually, disregarding the correlation that exists between them. On the other hand, some multivariate feature selection methods are proposed for biomarker discovery. Provided with various biomarker discovery methods, how to choose the most suitable method for a specific dataset becomes a problem. In this paper, we present a framework for comparison of potential biomarker discovery methods: three methods that stem from different theories are compared by how stable they are and how well they can improve the classification accuracy. The three methods we have considered are: Significance Analysis of Microarrays (SAM) which identifies the differentially expressed genes; minimum Redundancy Maximum Relevance (mRMR) based on information theory; and Characteristic Direction (GeoDE) inspired by a graphical perspective. Tested on the gene expression data from two experiments exposing the cod fish to two different toxicants (MeHg and PCB 153), different methods stand out in different cases, so a decision upon the most suitable method should be made based on the dataset under study and the research interest. : 11 pages, 4 figures, 2019 NAIS Symposium
format Text
author Zhang, Xiaokang
Jonassen, Inge
author_facet Zhang, Xiaokang
Jonassen, Inge
author_sort Zhang, Xiaokang
title A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver
title_short A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver
title_full A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver
title_fullStr A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver
title_full_unstemmed A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver
title_sort comparative analysis of feature selection methods for biomarker discovery in study of toxicant-treated atlantic cod (gadus morhua) liver
publisher arXiv
publishDate 2019
url https://dx.doi.org/10.48550/arxiv.1905.08048
https://arxiv.org/abs/1905.08048
genre atlantic cod
Gadus morhua
genre_facet atlantic cod
Gadus morhua
op_relation https://dx.doi.org/10.1007/978-3-030-35664-4_11
op_rights arXiv.org perpetual, non-exclusive license
http://arxiv.org/licenses/nonexclusive-distrib/1.0/
op_doi https://doi.org/10.48550/arxiv.1905.08048
https://doi.org/10.1007/978-3-030-35664-4_11
_version_ 1766358107300036608