Classification of colorectal cancer based on gene sequencing data with

Purpose: This study aims to classify open-access colorectal cancer gene data and identify essential genes with the XGBoost method, a machine learning method. Materials and Methods: The open-access colorectal cancer gene dataset was used in the study. The dataset included gene sequencing results of 1...

Full description

Bibliographic Details
Main Authors: Akbulut, S, Kucukakcali, Z, Colak, C
Language:unknown
Published: 2022
Subjects:
Online Access:http://hdl.handle.net/11616/86027
id ftinonuuniv:oai:abakus.inonu.edu.tr:11616/86027
record_format openpolar
spelling ftinonuuniv:oai:abakus.inonu.edu.tr:11616/86027 2023-05-15T18:11:21+02:00 Classification of colorectal cancer based on gene sequencing data with Akbulut, S Kucukakcali, Z Colak, C 2022 http://hdl.handle.net/11616/86027 unknown http://hdl.handle.net/11616/86027 CUKUROVA MEDICAL JOURNAL 2022 ftinonuuniv 2023-01-05T18:01:41Z Purpose: This study aims to classify open-access colorectal cancer gene data and identify essential genes with the XGBoost method, a machine learning method. Materials and Methods: The open-access colorectal cancer gene dataset was used in the study. The dataset included gene sequencing results of 10 mucosae from healthy controls and the colonic mucosa of 12 patients with colorectal cancer. XGboost, one of the machine learning methods, was used to classify the disease. Accuracy, balanced accuracy, sensitivity, selectivity, positive predictive value, and negative predictive value performance metrics were evaluated for model performance. Results: According to the variable selection method, 17 genes were selected, and modeling was performed with these input variables. Accuracy, balanced accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score obtained from modeling results were 95.5%, 95.8%, 91.7%, 1%, 1%, and 90.9%, and 95.7%, respectively. According to the variable impotance acquired from the XGboost technique results, the CYR61, NR4A, FOSB, and NR4A2 genes can be employed as biomarkers for colorectal cancer. Conclusion: As a consequence of this research, genes that may be linked to colorectal cancer and genetic biomarkers for the illness were identified. In the future, the detected genes' reliability can be verified, therapeutic procedures can be established based on these genes, and their usefulness in clinical practice may be documented. C1 [Akbulut, Sami] Inonu Univ, Fac Med, Dept Surg, Malatya, Turkey. [Akbulut, Sami; Kucukakcali, Zeynep; Colak, Cemil] Inonu Univ, Fac Med, Dept Biostat & Med Informat, Malatya, Turkey. [Akbulut, Sami] Inonu Univ, Fac Med, Dept Publ Hlth, Malatya, Turkey. C3 Inonu University; Inonu University; Inonu University Other/Unknown Material sami Unknown
institution Open Polar
collection Unknown
op_collection_id ftinonuuniv
language unknown
description Purpose: This study aims to classify open-access colorectal cancer gene data and identify essential genes with the XGBoost method, a machine learning method. Materials and Methods: The open-access colorectal cancer gene dataset was used in the study. The dataset included gene sequencing results of 10 mucosae from healthy controls and the colonic mucosa of 12 patients with colorectal cancer. XGboost, one of the machine learning methods, was used to classify the disease. Accuracy, balanced accuracy, sensitivity, selectivity, positive predictive value, and negative predictive value performance metrics were evaluated for model performance. Results: According to the variable selection method, 17 genes were selected, and modeling was performed with these input variables. Accuracy, balanced accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score obtained from modeling results were 95.5%, 95.8%, 91.7%, 1%, 1%, and 90.9%, and 95.7%, respectively. According to the variable impotance acquired from the XGboost technique results, the CYR61, NR4A, FOSB, and NR4A2 genes can be employed as biomarkers for colorectal cancer. Conclusion: As a consequence of this research, genes that may be linked to colorectal cancer and genetic biomarkers for the illness were identified. In the future, the detected genes' reliability can be verified, therapeutic procedures can be established based on these genes, and their usefulness in clinical practice may be documented. C1 [Akbulut, Sami] Inonu Univ, Fac Med, Dept Surg, Malatya, Turkey. [Akbulut, Sami; Kucukakcali, Zeynep; Colak, Cemil] Inonu Univ, Fac Med, Dept Biostat & Med Informat, Malatya, Turkey. [Akbulut, Sami] Inonu Univ, Fac Med, Dept Publ Hlth, Malatya, Turkey. C3 Inonu University; Inonu University; Inonu University
author Akbulut, S
Kucukakcali, Z
Colak, C
spellingShingle Akbulut, S
Kucukakcali, Z
Colak, C
Classification of colorectal cancer based on gene sequencing data with
author_facet Akbulut, S
Kucukakcali, Z
Colak, C
author_sort Akbulut, S
title Classification of colorectal cancer based on gene sequencing data with
title_short Classification of colorectal cancer based on gene sequencing data with
title_full Classification of colorectal cancer based on gene sequencing data with
title_fullStr Classification of colorectal cancer based on gene sequencing data with
title_full_unstemmed Classification of colorectal cancer based on gene sequencing data with
title_sort classification of colorectal cancer based on gene sequencing data with
publishDate 2022
url http://hdl.handle.net/11616/86027
genre sami
genre_facet sami
op_source CUKUROVA MEDICAL JOURNAL
op_relation http://hdl.handle.net/11616/86027
_version_ 1766184029235707904