A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver
Univariate and multivariate feature selection methods can be used for biomarker discovery in analysis of toxicant exposure. Among the univariate methods, differential expression analysis (DEA) is often applied for its simplicity and interpretability. A characteristic of methods for DEA is that they...
Main Authors: | , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2019
|
Subjects: | |
Online Access: | http://arxiv.org/abs/1905.08048 https://doi.org/10.1007/978-3-030-35664-4_11 |
id |
ftarxivpreprints:oai:arXiv.org:1905.08048 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:1905.08048 2023-09-05T13:18:00+02:00 A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver Zhang, Xiaokang Jonassen, Inge 2019-05-20 http://arxiv.org/abs/1905.08048 https://doi.org/10.1007/978-3-030-35664-4_11 unknown http://arxiv.org/abs/1905.08048 doi:10.1007/978-3-030-35664-4_11 Computer Science - Machine Learning Quantitative Biology - Quantitative Methods Statistics - Machine Learning text 2019 ftarxivpreprints https://doi.org/10.1007/978-3-030-35664-4_11 2023-08-16T15:20:23Z Univariate and multivariate feature selection methods can be used for biomarker discovery in analysis of toxicant exposure. Among the univariate methods, differential expression analysis (DEA) is often applied for its simplicity and interpretability. A characteristic of methods for DEA is that they treat genes individually, disregarding the correlation that exists between them. On the other hand, some multivariate feature selection methods are proposed for biomarker discovery. Provided with various biomarker discovery methods, how to choose the most suitable method for a specific dataset becomes a problem. In this paper, we present a framework for comparison of potential biomarker discovery methods: three methods that stem from different theories are compared by how stable they are and how well they can improve the classification accuracy. The three methods we have considered are: Significance Analysis of Microarrays (SAM) which identifies the differentially expressed genes; minimum Redundancy Maximum Relevance (mRMR) based on information theory; and Characteristic Direction (GeoDE) inspired by a graphical perspective. Tested on the gene expression data from two experiments exposing the cod fish to two different toxicants (MeHg and PCB 153), different methods stand out in different cases, so a decision upon the most suitable method should be made based on the dataset under study and the research interest. Comment: 11 pages, 4 figures, 2019 NAIS Symposium Text atlantic cod Gadus morhua ArXiv.org (Cornell University Library) 114 123 |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Machine Learning Quantitative Biology - Quantitative Methods Statistics - Machine Learning |
spellingShingle |
Computer Science - Machine Learning Quantitative Biology - Quantitative Methods Statistics - Machine Learning Zhang, Xiaokang Jonassen, Inge A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver |
topic_facet |
Computer Science - Machine Learning Quantitative Biology - Quantitative Methods Statistics - Machine Learning |
description |
Univariate and multivariate feature selection methods can be used for biomarker discovery in analysis of toxicant exposure. Among the univariate methods, differential expression analysis (DEA) is often applied for its simplicity and interpretability. A characteristic of methods for DEA is that they treat genes individually, disregarding the correlation that exists between them. On the other hand, some multivariate feature selection methods are proposed for biomarker discovery. Provided with various biomarker discovery methods, how to choose the most suitable method for a specific dataset becomes a problem. In this paper, we present a framework for comparison of potential biomarker discovery methods: three methods that stem from different theories are compared by how stable they are and how well they can improve the classification accuracy. The three methods we have considered are: Significance Analysis of Microarrays (SAM) which identifies the differentially expressed genes; minimum Redundancy Maximum Relevance (mRMR) based on information theory; and Characteristic Direction (GeoDE) inspired by a graphical perspective. Tested on the gene expression data from two experiments exposing the cod fish to two different toxicants (MeHg and PCB 153), different methods stand out in different cases, so a decision upon the most suitable method should be made based on the dataset under study and the research interest. Comment: 11 pages, 4 figures, 2019 NAIS Symposium |
format |
Text |
author |
Zhang, Xiaokang Jonassen, Inge |
author_facet |
Zhang, Xiaokang Jonassen, Inge |
author_sort |
Zhang, Xiaokang |
title |
A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver |
title_short |
A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver |
title_full |
A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver |
title_fullStr |
A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver |
title_full_unstemmed |
A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver |
title_sort |
comparative analysis of feature selection methods for biomarker discovery in study of toxicant-treated atlantic cod (gadus morhua) liver |
publishDate |
2019 |
url |
http://arxiv.org/abs/1905.08048 https://doi.org/10.1007/978-3-030-35664-4_11 |
genre |
atlantic cod Gadus morhua |
genre_facet |
atlantic cod Gadus morhua |
op_relation |
http://arxiv.org/abs/1905.08048 doi:10.1007/978-3-030-35664-4_11 |
op_doi |
https://doi.org/10.1007/978-3-030-35664-4_11 |
container_start_page |
114 |
op_container_end_page |
123 |
_version_ |
1776199068793438208 |