Original Data

Rev Diabet Stud, 2012, 9(1):55-62 DOI 10.1900/RDS.2012.9.55

Computational Intelligence-Based Diagnosis Tool for the Detection of Prediabetes and Type 2 Diabetes in India

Shankaracharya1, Devang Odedra1, Subir Samanta2, Ambarish S. Vidyarthi1

1Department of Biotechnology, Birla Institute of Technology, Mesra, Ranchi 835215, India
2Department of Pharmaceutical Sciences, Birla Institute of Technology, Mesra, Ranchi 835215, India
Address correspondence to: Shankaracharya, e-mail: shankaracharya@bitmesra.ac.in

Manuscript submitted April 13, 2012; resubmitted April 27, 2012; accepted May 4, 2012.

Keywords: type 2 diabetes, diabetes diagnosis, prediabetes, computational intelligence, machine learning algorithm, mixture of expert


BACKGROUND: The incidence of diabetes is increasing rapidly across the globe. India has the highest proportion of diabetic patients, earning it the doubtful distinction of the 'diabetes capital of the world'. Early detection of diabetes could help to prevent or postpone its onset by taking appropriate preventive measures, including the initiation of lifestyle changes. To date, early identification of prediabetes or type 2 diabetes has proven problematic, such that there is an urgent requirement for tools enabling easy, quick, and accurate diagnosis. AIM: To develop an easy, quick, and precise tool for diagnosing early diabetes based on machine learning algorithms. METHODS: The dataset used in this study was based on the health profiles of diabetic and non-diabetic patients from hospitals in India. A novel machine learning algorithm, termed "mixture of expert", was used for the determination of a patient's diabetic state. Out of a total of 1415 subjects, 1104 were used to train the mixture of expert system. The remaining 311 data sets were reserved for validation of the algorithm. Mixture of expert was implemented in matlab to train the data for the development of the model. The model with the minimum mean square error was selected and used for the validation of the results. RESULTS: Different combinations and numbers of hidden nodes and expectation maximization (EM) iterations were used to optimize the accuracy of the algorithm. The overall best accuracy of 99.36% was achieved with an iteration of 150 and 20 hidden nodes. Sensitivity, specificity, and total classification accuracy were calculated as 99.5%, 99.07%, and 99.36%, respectively. Furthermore, a graphical user interface was developed in java script such that the user can readily enter the variables and easily use the algorithm as a tool. CONCLUSIONS: This study describes a highly precise machine learning prediction tool for identifying prediabetic, diabetic, and non-diabetic individuals with high accuracy. The tool could be used for large scale screening in hopsitals or diabetes prevention programs.

Fulltext: HTML , PDF (224KB)

This article has been cited by other articles:

Machine Learning and Data Mining Methods in Diabetes Research

Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I

Comput Struct Biotechnol J 2017. 15:104-116

A Predictive Model to Forecast and Pre-Treat Diabetes Mellitus using Clinical Big Data in Cloud

Joshitta RS, Arockiam L

Int J Appl Engin Res 2015. 10(82):55-59

Classification of Diabetes Mellitus using Modified Particle Swarm Optimization and Least Squares Support Vector Machine

Soliman OS, AboElhamd E

Int J Comp Trend Technol 2014. 8(1):38-44

Modified mixture of experts for the diagnosis of perfusion magnetic resonance imaging measures in locally rectal cancer patients

Myoung S

Healthc Inform Res 2013. 19(2):130-136