|Rev Diabet Stud,
Computational Intelligence-Based Diagnosis Tool for the Detection of Prediabetes and Type 2 Diabetes in India
Shankaracharya1, Devang Odedra1, Subir Samanta2, Ambarish S. Vidyarthi1
1Department of Biotechnology, Birla Institute of Technology, Mesra, Ranchi 835215, India
2Department of Pharmaceutical Sciences, Birla Institute of Technology, Mesra, Ranchi 835215, India
Address correspondence to: Shankaracharya, e-mail: email@example.com
Manuscript submitted April 13, 2012; resubmitted April 27, 2012; accepted May 4, 2012.
Keywords: type 2 diabetes, diabetes diagnosis, prediabetes, computational intelligence, machine learning algorithm, mixture of expert
BACKGROUND: The incidence of diabetes is increasing rapidly across the globe. India has the highest proportion of diabetic patients, earning it the doubtful distinction of the 'diabetes capital of the world'. Early detection of diabetes could help to prevent or postpone its onset by taking appropriate preventive measures, including the initiation of lifestyle changes. To date, early identification of prediabetes or type 2 diabetes has proven problematic, such that there is an urgent requirement for tools enabling easy, quick, and accurate diagnosis. AIM: To develop an easy, quick, and precise tool for diagnosing early diabetes based on machine learning algorithms. METHODS: The dataset used in this study was based on the health profiles of diabetic and non-diabetic patients from hospitals in India. A novel machine learning algorithm, termed "mixture of expert", was used for the determination of a patient's diabetic state. Out of a total of 1415 subjects, 1104 were used to train the mixture of expert system. The remaining 311 data sets were reserved for validation of the algorithm. Mixture of expert was implemented in matlab to train the data for the development of the model. The model with the minimum mean square error was selected and used for the validation of the results. RESULTS: Different combinations and numbers of hidden nodes and expectation maximization (EM) iterations were used to optimize the accuracy of the algorithm. The overall best accuracy of 99.36% was achieved with an iteration of 150 and 20 hidden nodes. Sensitivity, specificity, and total classification accuracy were calculated as 99.5%, 99.07%, and 99.36%, respectively. Furthermore, a graphical user interface was developed in java script such that the user can readily enter the variables and easily use the algorithm as a tool. CONCLUSIONS: This study describes a highly precise machine learning prediction tool for identifying prediabetic, diabetic, and non-diabetic individuals with high accuracy. The tool could be used for large scale screening in hopsitals or diabetes prevention programs.
HTML , PDF
This article has been cited by other articles: