Fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six STR loci in a northern Chinese Han population
Objective Gastrointestinal cancer is the leading cause of cancer-related death worldwide. The aim of this study was to verify whether the genotype of six short tandem repeat (STR) loci including AR, Bat-25, D5S346, ER1, ER2, and FGA is associated with the risk of gastric cancer (GC) and colorectal c...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2019-05-01
|
Series: | PeerJ |
Subjects: | |
Online Access: | https://peerj.com/articles/7004.pdf |
id |
doaj-2ad467f46e4241f880170218079aac8b |
---|---|
record_format |
Article |
spelling |
doaj-2ad467f46e4241f880170218079aac8b2020-11-25T02:01:07ZengPeerJ Inc.PeerJ2167-83592019-05-017e700410.7717/peerj.7004Fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six STR loci in a northern Chinese Han populationShuhong Hao0Ming Ren1Dong Li2Yujie Sui3Qingyu Wang4Gaoyang Chen5Zhaoyan Li6Qiwei Yang7Department of Hematology and Oncology, The Second Hospital of Jilin University, Changchun, Jilin Province, ChinaDepartment of Orthopedics, The Second Hospital of Jilin University, Changchun, Jilin Province, ChinaDepartment of Obstetrics and Gynecology, The Second Hospital of Jilin University, Changchun, Jilin Province, ChinaMedical Research Center, The Second Hospital of Jilin University, Changchun, Jilin Province, ChinaDepartment of Orthopedics, The Second Hospital of Jilin University, Changchun, Jilin Province, ChinaDepartment of Orthopedics, The Second Hospital of Jilin University, Changchun, Jilin Province, ChinaDepartment of Orthopedics, The Second Hospital of Jilin University, Changchun, Jilin Province, ChinaMedical Research Center, The Second Hospital of Jilin University, Changchun, Jilin Province, ChinaObjective Gastrointestinal cancer is the leading cause of cancer-related death worldwide. The aim of this study was to verify whether the genotype of six short tandem repeat (STR) loci including AR, Bat-25, D5S346, ER1, ER2, and FGA is associated with the risk of gastric cancer (GC) and colorectal cancer (CRC) and to develop a model that allows early diagnosis and prediction of inherited genomic susceptibility to GC and CRC. Methods Alleles of six STR loci were determined using the peripheral blood of six colon cancer patients, five rectal cancer patients, eight GC patients, and 30 healthy controls. Fisher linear discriminant analysis (FDA) was used to establish the discriminant formula to distinguish GC and CRC patients from healthy controls. Leave-one-out cross validation and receiver operating characteristic (ROC) curves were used to validate the accuracy of the formula. The relationship between the STR status and immunohistochemical (IHC) and tumor markers was analyzed using multiple correspondence analysis. Results D5S346 was confirmed as a GC- and CRC-related STR locus. For the first time, we established a discriminant formula on the basis of the six STR loci, which was used to estimate the risk coefficient of suffering from GC and CRC. The model was statistically significant (Wilks’ lambda = 0.471, χ2 = 30.488, df = 13, and p = 0.004). The results of leave-one-out cross validation showed that the sensitivity of the formula was 73.7% and the specificity was 76.7%. The area under the ROC curve (AUC) was 0.926, with a sensitivity of 73.7% and a specificity of 93.3%. The STR status was shown to have a certain relationship with the expression of some IHC markers and the level of some tumor markers. Conclusions The results of this study complement clinical diagnostic criteria and present markers for early prediction of GC and CRC. This approach will aid in improving risk awareness of susceptible individuals and contribute to reducing the incidence of GC and CRC by prevention and early detection.https://peerj.com/articles/7004.pdfMolecular diagnosisGenomic susceptibility predictionSTRFisher linear discriminant analysisGastrointestinal cancer |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Shuhong Hao Ming Ren Dong Li Yujie Sui Qingyu Wang Gaoyang Chen Zhaoyan Li Qiwei Yang |
spellingShingle |
Shuhong Hao Ming Ren Dong Li Yujie Sui Qingyu Wang Gaoyang Chen Zhaoyan Li Qiwei Yang Fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six STR loci in a northern Chinese Han population PeerJ Molecular diagnosis Genomic susceptibility prediction STR Fisher linear discriminant analysis Gastrointestinal cancer |
author_facet |
Shuhong Hao Ming Ren Dong Li Yujie Sui Qingyu Wang Gaoyang Chen Zhaoyan Li Qiwei Yang |
author_sort |
Shuhong Hao |
title |
Fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six STR loci in a northern Chinese Han population |
title_short |
Fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six STR loci in a northern Chinese Han population |
title_full |
Fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six STR loci in a northern Chinese Han population |
title_fullStr |
Fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six STR loci in a northern Chinese Han population |
title_full_unstemmed |
Fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six STR loci in a northern Chinese Han population |
title_sort |
fisher linear discriminant analysis for classification and prediction of genomic susceptibility to stomach and colorectal cancers based on six str loci in a northern chinese han population |
publisher |
PeerJ Inc. |
series |
PeerJ |
issn |
2167-8359 |
publishDate |
2019-05-01 |
description |
Objective Gastrointestinal cancer is the leading cause of cancer-related death worldwide. The aim of this study was to verify whether the genotype of six short tandem repeat (STR) loci including AR, Bat-25, D5S346, ER1, ER2, and FGA is associated with the risk of gastric cancer (GC) and colorectal cancer (CRC) and to develop a model that allows early diagnosis and prediction of inherited genomic susceptibility to GC and CRC. Methods Alleles of six STR loci were determined using the peripheral blood of six colon cancer patients, five rectal cancer patients, eight GC patients, and 30 healthy controls. Fisher linear discriminant analysis (FDA) was used to establish the discriminant formula to distinguish GC and CRC patients from healthy controls. Leave-one-out cross validation and receiver operating characteristic (ROC) curves were used to validate the accuracy of the formula. The relationship between the STR status and immunohistochemical (IHC) and tumor markers was analyzed using multiple correspondence analysis. Results D5S346 was confirmed as a GC- and CRC-related STR locus. For the first time, we established a discriminant formula on the basis of the six STR loci, which was used to estimate the risk coefficient of suffering from GC and CRC. The model was statistically significant (Wilks’ lambda = 0.471, χ2 = 30.488, df = 13, and p = 0.004). The results of leave-one-out cross validation showed that the sensitivity of the formula was 73.7% and the specificity was 76.7%. The area under the ROC curve (AUC) was 0.926, with a sensitivity of 73.7% and a specificity of 93.3%. The STR status was shown to have a certain relationship with the expression of some IHC markers and the level of some tumor markers. Conclusions The results of this study complement clinical diagnostic criteria and present markers for early prediction of GC and CRC. This approach will aid in improving risk awareness of susceptible individuals and contribute to reducing the incidence of GC and CRC by prevention and early detection. |
topic |
Molecular diagnosis Genomic susceptibility prediction STR Fisher linear discriminant analysis Gastrointestinal cancer |
url |
https://peerj.com/articles/7004.pdf |
work_keys_str_mv |
AT shuhonghao fisherlineardiscriminantanalysisforclassificationandpredictionofgenomicsusceptibilitytostomachandcolorectalcancersbasedonsixstrlociinanorthernchinesehanpopulation AT mingren fisherlineardiscriminantanalysisforclassificationandpredictionofgenomicsusceptibilitytostomachandcolorectalcancersbasedonsixstrlociinanorthernchinesehanpopulation AT dongli fisherlineardiscriminantanalysisforclassificationandpredictionofgenomicsusceptibilitytostomachandcolorectalcancersbasedonsixstrlociinanorthernchinesehanpopulation AT yujiesui fisherlineardiscriminantanalysisforclassificationandpredictionofgenomicsusceptibilitytostomachandcolorectalcancersbasedonsixstrlociinanorthernchinesehanpopulation AT qingyuwang fisherlineardiscriminantanalysisforclassificationandpredictionofgenomicsusceptibilitytostomachandcolorectalcancersbasedonsixstrlociinanorthernchinesehanpopulation AT gaoyangchen fisherlineardiscriminantanalysisforclassificationandpredictionofgenomicsusceptibilitytostomachandcolorectalcancersbasedonsixstrlociinanorthernchinesehanpopulation AT zhaoyanli fisherlineardiscriminantanalysisforclassificationandpredictionofgenomicsusceptibilitytostomachandcolorectalcancersbasedonsixstrlociinanorthernchinesehanpopulation AT qiweiyang fisherlineardiscriminantanalysisforclassificationandpredictionofgenomicsusceptibilitytostomachandcolorectalcancersbasedonsixstrlociinanorthernchinesehanpopulation |
_version_ |
1724958659613556736 |