LogSum + L 2 penalized logistic regression model for biomarker selection and cancer classification

Abstract Biomarker selection and cancer classification play an important role in knowledge discovery using genomic data. Successful identification of gene biomarkers and biological pathways can significantly improve the accuracy of diagnosis and help machine learning models have better performance o...

Full description

Bibliographic Details
Main Authors: Xiao-Ying Liu, Sheng-Bing Wu, Wen-Quan Zeng, Zhan-Jiang Yuan, Hong-Bo Xu
Format: Article
Language:English
Published: Nature Publishing Group 2020-12-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-020-79028-0
Description
Summary:Abstract Biomarker selection and cancer classification play an important role in knowledge discovery using genomic data. Successful identification of gene biomarkers and biological pathways can significantly improve the accuracy of diagnosis and help machine learning models have better performance on classification of different types of cancer. In this paper, we proposed a LogSum + L 2 penalized logistic regression model, and furthermore used a coordinate decent algorithm to solve it. The results of simulations and real experiments indicate that the proposed method is highly competitive among several state-of-the-art methods. Our proposed model achieves the excellent performance in group feature selection and classification problems.
ISSN:2045-2322