A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors

Dental caries is a multifactorial disease that can be caused by interactions between genetic and environmental risk factors. Despite the availability of caries risk assessment tools, caries risk prediction models incorporating new factors, such as human genetic markers, have not yet been reported. T...

Full description

Bibliographic Details
Main Authors: Liangyue Pang, Ketian Wang, Ye Tao, Qinghui Zhi, Jianming Zhang, Huancai Lin
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-03-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2021.636867/full
id doaj-00c1d039587b441e8c07bd239152df43
record_format Article
spelling doaj-00c1d039587b441e8c07bd239152df432021-03-11T04:47:59ZengFrontiers Media S.A.Frontiers in Genetics1664-80212021-03-011210.3389/fgene.2021.636867636867A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic FactorsLiangyue Pang0Ketian Wang1Ye Tao2Qinghui Zhi3Jianming Zhang4Huancai Lin5Guangdong Provincial Key Laboratory of Stomatology, Department of Preventive Dentistry, Guanghua School of Stomatology, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, ChinaGuangdong Provincial Key Laboratory of Stomatology, Department of Preventive Dentistry, Guanghua School of Stomatology, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, ChinaGuangdong Provincial Key Laboratory of Stomatology, Department of Preventive Dentistry, Guanghua School of Stomatology, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, ChinaGuangdong Provincial Key Laboratory of Stomatology, Department of Preventive Dentistry, Guanghua School of Stomatology, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, ChinaFoshan Stomatology Hospital, School of Stomatology and Medicine, Foshan University, Foshan, ChinaGuangdong Provincial Key Laboratory of Stomatology, Department of Preventive Dentistry, Guanghua School of Stomatology, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, ChinaDental caries is a multifactorial disease that can be caused by interactions between genetic and environmental risk factors. Despite the availability of caries risk assessment tools, caries risk prediction models incorporating new factors, such as human genetic markers, have not yet been reported. The aim of this study was to construct a new model for caries risk prediction in teenagers, based on environmental and genetic factors, using a machine learning algorithm. We performed a prospective longitudinal study of 1,055 teenagers (710 teenagers for cohort 1 and 345 teenagers for cohort 2) aged 13 years, of whom 953 (633 teenagers for cohort 1 and 320 teenagers for cohort 2) were followed for 21 months. All participants completed an oral health questionnaire, an oral examination, biological (salivary and cariostate) tests, and single nucleotide polymorphism sequencing analysis. We constructed a caries risk prediction model based on these data using a random forest with an AUC of 0.78 in cohort 1 (training cohort). We further verified the discrimination and calibration abilities of this caries risk prediction model using cohort 2. The AUC of the caries risk prediction model in cohort 2 (testing cohort) was 0.73, indicating high discrimination ability. Risk stratification revealed that our caries risk prediction model could accurately identify individuals at high and very high caries risk but underestimated risks for individuals at low and very low caries risk. Thus, our caries risk prediction model has the potential for use as a powerful community-level tool to identify individuals at high caries risk.https://www.frontiersin.org/articles/10.3389/fgene.2021.636867/fullcariesrisk prediction modelpreventive dentistrybiomarkersbiomedical informatics
collection DOAJ
language English
format Article
sources DOAJ
author Liangyue Pang
Ketian Wang
Ye Tao
Qinghui Zhi
Jianming Zhang
Huancai Lin
spellingShingle Liangyue Pang
Ketian Wang
Ye Tao
Qinghui Zhi
Jianming Zhang
Huancai Lin
A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
Frontiers in Genetics
caries
risk prediction model
preventive dentistry
biomarkers
biomedical informatics
author_facet Liangyue Pang
Ketian Wang
Ye Tao
Qinghui Zhi
Jianming Zhang
Huancai Lin
author_sort Liangyue Pang
title A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_short A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_full A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_fullStr A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_full_unstemmed A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_sort new model for caries risk prediction in teenagers using a machine learning algorithm based on environmental and genetic factors
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2021-03-01
description Dental caries is a multifactorial disease that can be caused by interactions between genetic and environmental risk factors. Despite the availability of caries risk assessment tools, caries risk prediction models incorporating new factors, such as human genetic markers, have not yet been reported. The aim of this study was to construct a new model for caries risk prediction in teenagers, based on environmental and genetic factors, using a machine learning algorithm. We performed a prospective longitudinal study of 1,055 teenagers (710 teenagers for cohort 1 and 345 teenagers for cohort 2) aged 13 years, of whom 953 (633 teenagers for cohort 1 and 320 teenagers for cohort 2) were followed for 21 months. All participants completed an oral health questionnaire, an oral examination, biological (salivary and cariostate) tests, and single nucleotide polymorphism sequencing analysis. We constructed a caries risk prediction model based on these data using a random forest with an AUC of 0.78 in cohort 1 (training cohort). We further verified the discrimination and calibration abilities of this caries risk prediction model using cohort 2. The AUC of the caries risk prediction model in cohort 2 (testing cohort) was 0.73, indicating high discrimination ability. Risk stratification revealed that our caries risk prediction model could accurately identify individuals at high and very high caries risk but underestimated risks for individuals at low and very low caries risk. Thus, our caries risk prediction model has the potential for use as a powerful community-level tool to identify individuals at high caries risk.
topic caries
risk prediction model
preventive dentistry
biomarkers
biomedical informatics
url https://www.frontiersin.org/articles/10.3389/fgene.2021.636867/full
work_keys_str_mv AT liangyuepang anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT ketianwang anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT yetao anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT qinghuizhi anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT jianmingzhang anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT huancailin anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT liangyuepang newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT ketianwang newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT yetao newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT qinghuizhi newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT jianmingzhang newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT huancailin newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
_version_ 1724225979313815552