Publications -> Conference Papers

A Comprehensive Exploration to the Machine Learning Techniques for Diabetes Identification


Authors: S. Wei, X. Zhao, and C. Miao
Title: A Comprehensive Exploration to the Machine Learning Techniques for Diabetes Identification
Abstract: Diabetes mellitus, known as diabetes, is a group of metabolic disorders and has affected hundreds of millions of people. The detection of diabetes is of great importance, concerning its severe complications. There have been plenty of research studies about diabetes identification, many of which are based on the Pima Indian diabetes data set. It's a data set studying women in Pima Indian population started from 1965, where the onset rate for diabetes is comparatively high. Most of the research studies done before mainly focused on one or two particular complex technique to test the data, while a comprehensive research over many common techniques is missing. In this paper, we make a comprehensive exploration to the most popular techniques (e.g. DNN (Deep Neural Network), SVM (Support Vector Machine), etc.) used to identify diabetes and data preprocessing methods. Basically, we examine these techniques by the accuracy of cross-validation on the Pima Indian data set. We compare the accuracy of each classifier over several ways of data preprocessors and we modify the parameters to improve their accuracy. The best technique we find has 77.86% accuracy using 10-fold cross-validation. We also analyze the relevance between each feature with the classification result.
Keywords: Machine learning; Deep neural network; Classification; Diabetes identification
Conference Name: 2018 IEEE 4th World Forum on Internet of Things (WF-IoT’18)
Location: Singapore, Singapore
Publisher: IEEE
Year: 2018
Accepted PDF File: A_Comprehensive_Exploration_to_the_Machine_Learning_Techniques_for_Diabetes_accepted.pdf
Permanent Link: https://doi.org/10.1109/WF-IoT.2018.8355130
Reference: S. Wei, X. Zhao, and C. Miao, “A comprehensive exploration to the machine learning techniques for diabetes identification,” in Proceedings of the 2018 IEEE 4th World Forum on Internet of Things (WF-IoT’18). IEEE, February 2018, pp. 291–295.
bibtex: 
@inproceedings{LILY-c147, 
   author = {Wei, Sidong and Zhao, Xuejiao and Miao, Chunyan},
   title  = {A Comprehensive Exploration to the Machine Learning Techniques for Diabetes Identification},  
   booktitle = {Proceedings of the 2018 IEEE 4th World Forum on Internet of Things (WF-IoT'18)}, 
   year  = {2018}, 
   month = {February}, 
   pages = {291-295}, 
   location = {Singapore, Singapore},
   publisher = {IEEE},
}