Abstract
Objectives
The decayed-missing-filled (DMFT) index is a representative oral health indicator. Prediction of DMFT index is an important basis for the development of public oral health care projects and strategies for caries prevention. In this study, we used data from the 2015 Korean children's oral health survey to predict DMFT index and caries risk groups using statistical techniques and four different machine-learning algorithms.
Methods
DMFT prediction models were constructed using multiple linear regression and four different machine-learning algorithms: decision tree regressor, decision tree classifier (DTC), random forest regressor, and random forest classifier (RFC). Thereafter, their accuracies were compared.
Results
For the DMFT predictive model, the prediction accuracy of multiple linear regression and RFC were 15.24% and 43.27%, respectively. The accuracy of DTC prediction was 2.84 times that of multiple linear regression. The important feature of the machine-learning model, which predicts DMFT index and the caries risk group, was the number of teeth with sealants.
Conclusions
Using data from the 2015 Korean children's oral health survey, which is considered big data in the field of oral health survey in Korea, this study confirmed that machine-learning models are more useful than statistical models for predicting DMFT index and caries risk in 12-year-old children. Therefore, it is expected that the machine-learning model can be used to predict the DMFT score.
References
1. Petersen PE, Bourgeois D, Ogawa H, Estupinan-Day S, Ndiaye C. The global burden of oral diseases and risks to oral health. Bull World Health Organ. 2005; 83:661–669.
2. Fejerskov O, Nyvad B, Kidd EAM. Dental caries: what is it. In: Fejer-skov O, Nyvad B, Kidd EAM. Dental caries. 3th ed. West sussex: John Wiley & Sons, Ltd;2015. p. 7–10.
3. Ministry of Health and Welfare. 2015 Korean Children’s Oral Health Survey. Sejong: Ministry of Health & Welfare;2015.
4. Seul MS. Current status and future developments of machine learning artificial intelligence in law: focusing the cusp of machine learning in U.S. and discourses over legal profession and law school education. The Justice. 2016; 156:269–302.
5. National library of Korea. National library of Korea digital collection. Leading the fourth industrial revolution-artificial intelligence and deep learning [Internet]. [cited 2019 Jan 02]. Available from:. http://nlcollection.nl.go.kr/front/search/searchList.do?facet=&indent=&query=%EC%9D%B8%EA%B3%B5%EC%A7%80%EB%8A%A5%2C+%EB%8D%B0%EC%9D%B4%ED%84%B0&facetField=true&wt=json&searchPageType=main&searchKeyword=%EC%9D%B8%EA%B3%B5%EC%A7%80%EB%8A%A5%2C+%EB%8D%B0%EC%9D%B4%ED%84%B0&searchSelect=all&searchFacet=&solrStart=0&solrEnd=20&solrRows=20.
6. Lee JH, Kim DH, Jeong SN, Choi SH. Detection and diagnosis of dental caries using a deep learning-based convolutional neural network algorithm. J Dent. 2018; 77:106–111.
7. National Science and Technology Information Service. Artificial intelligence, dentistry [Internet]. [cited 2019 Dec 17]. Available from:. https://www.ntis.go.kr/ThSearchTotalList.do?sort=RANK%2FDESC&ntisYn=&searchWord=%EC%9D%B8%EA%B3%B5%EC%A7%80%EB%8A%A5%2C+%EC%B9%98%EA%B3%BC.
8. Kim SJ, Ahn HC. Application of random forests to corporate credit rating prediction. The Journal of Business and Economics. 2016; 32:187–211.
9. Han EJ. Screening test data analysis for cataract happening prediction model using random forest [master’s thesis]. Seoul: Yonsei Uni-versity;2005. [Korean].
10. Yoo JH, Hong SH, Park HG, Kim DM, Kim SJ, Park SJ. Utilization of elderly stroke disease prediction using machine learning method. Korean Society for Emotion and Sensibility 2017 Annual spring conference program. 2017. 42.
11. Won SH, Lee CG, Park JM. A study on the prediction of land price with machine learning technique. Journal of the Korean Association of Professional Geographers. 2017; 51:347–355.
12. Bae SW, Yu JS. Estimation of the apartment housing price using the machine learning methods: the case of gangnam-gu, seoul. Journal of the Korea Real Estate Analysts Association. 2017; 1:293–309.
13. Choi CH, Park KH, Park HK, Lee MJ, Kim JS, Kim HS. Development of heavy rain damage prediction function for public facility using machine. J Korean Soc Hazard Mitig. 2017; 17:443–450.
14. Lee HY, Noh SC. Chaper 8. Linear regression. In: Lee HY, Noh SC. Advanced statistical analysis. 2nd ed. Seoul: Moonwoosa;2013. p. 250–339.
15. Montenegro RD, Oliveira ALI, Cabral GG, Katz CRT, Rosenblatt A. A comparative study of machine learning techniques for caries prediction. 2008 20th IEEE International Conference on Tools with Artificial Intelligence. 2008; 2:477–481.
16. Tamaki Y, Nomura Y, Katsumura S, Okada A, Yamada H, Tsuge S, et al. Construction of a dental caries prediction model by data mining. J Oral Sci. 2009; 51:61–68.
17. Gansky SA. Dental data mining: potential pitfalls and practical issues. Adv Dent Res. 2003; 17:109–114.
18. Ito A, Hayashi M, Hamasaki T, Ebisu S. Risk assessment of dental caries by using Classification and Regression Trees. J Dent. 2011; 39:457–463.
19. Churpek MM, Yuen TC, Winslow C, Meltzer DO, Kattan MW, Edel-son DP. Multicenter comparison of machine learning methods and conventional regression for predicting clinical deterioration on the wards. Crit Care Med. 2016; 44:368–374.
20. Chung SY, Cho JW, Jung YS, Kim HY, Kim JY, Choi YH, et al. Association between unmet needs for dental treatment and the DMFT index among Korean adults. J Korean Acad Oral Health. 2017; 41:267–273.
21. Shin HE, Kim HJ, Cho MJ, Choi YH, Song KB. Relationship between cancer and oral health in Korean adults determined using data from the 6th (2013-2014) Korea National Health and Nutrition Examination Survey. J Korean Acad Oral Health. 2017; 41:16–21.
22. Nyanye. [Internet]. [cited 2019 Nov 26]. Available from:. https://nyanye.com/machine-learning/2017/01/18/Decision-tree/.
23. Ahn SH, You HY, Kim MJ, Han DH, Kim JB, Jeong SH. Caries preventive effect of permanent teeth using pit and fissure sealant program and community water fluoridation program. J Korean Acad Oral Health. 2012; 36:289–296.
24. Oulis CJ, Berdouses ED, Mamai-Homata E, Polychronopoulou A. Prevalence of sealants in relation to dental caries on the permanent molars of 12 and 15-year-old Greek adolescents. A national pathfinder survey. BMC Public Health. 2011; 11:100.
25. van Loveren C, Lingstöm P. Chaper 8. Diet and dental caries. Fejerskov O, Nyvad B, Kidd E, editors. Dental Caries: the Disease and Its Clinical Management. 3rd ed.Oxford: Wiley Blackwell;2015. p. 133–154.
26. Hausen H, Baelum V. Chaper 23. How accurately can we assess the risk for developing caries lesions? Fejerskov O, Nyvad B, Kidd E, editors. Dental Caries: the Disease and Its Clinical Management. 3rd ed.Oxford: Wiley Blackwell;2015. p. 423–438.
27. van der Ploeg T, Austin PC, Steyerberg EW. Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints. BMC Med Res Methodol. 2014; 14:137.
28. Breiman L. Random Forests. Machine Learning. 2001; 45:5–32.
Table 1.
Table 3.
R Square: 0.119, Dependent variable: DMFT, Variables Entered with Enter method. Gender: Men=0, Women=1 / Regine: City=0, Rural area=1 / Number of pit and fissure sealant: 0-16 / Perceived oral health status: Very good-Very poor=1-5 / Dental treatment demand for the past one year: Yes=1, No=0 / Experience of toothache for the past one year: Yes=1, No=0 / Frequency of snack intake per day: No intake=1, once=2, 2 times=3, 3 times=4, 4 and over=5 / Number of oral hygiene auxiliaries using: 0-5.