CFP last date
22 April 2024
Reseach Article

Diabetic Patient’s Data Classification and Prediction using Machine Learning Ensemble Algorithm

by M. Hemalatha
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 184 - Number 19
Year of Publication: 2022
Authors: M. Hemalatha
10.5120/ijca2022922209

M. Hemalatha . Diabetic Patient’s Data Classification and Prediction using Machine Learning Ensemble Algorithm. International Journal of Computer Applications. 184, 19 ( Jun 2022), 14-18. DOI=10.5120/ijca2022922209

@article{ 10.5120/ijca2022922209,
author = { M. Hemalatha },
title = { Diabetic Patient’s Data Classification and Prediction using Machine Learning Ensemble Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { Jun 2022 },
volume = { 184 },
number = { 19 },
month = { Jun },
year = { 2022 },
issn = { 0975-8887 },
pages = { 14-18 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume184/number19/32426-2022922209/ },
doi = { 10.5120/ijca2022922209 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:21:52.211719+05:30
%A M. Hemalatha
%T Diabetic Patient’s Data Classification and Prediction using Machine Learning Ensemble Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 184
%N 19
%P 14-18
%D 2022
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this research paper, the diabetic patent dataset is collected from Indian Pima dataset for Indians. The data is understood and visualized by using Pearson correlation statistics method. According to survey 65% of this data set is non-Diabetics and 35% of Indians are Diabetics. The data is understood better by statistics and visualization. A certain pre-processing of data is performed before applying machine learning algorithms. Then machine learning algorithms are carried out on Indian diabetic data set. The ensemble (Random forest) algorithm has got good performance metrics compared to other existing algorithms. The Random forest algorithm gave outperform results compared to MLP (Multi Layer perception classifier) classifier, Support Vector Machine (SVM) classifier and LR (Logistic Regression) algorithms. The performance metrics of machine learning algorithms are calculated using confusion matrix.

References
  1. World Health Organization, Global Action Plan on Physical Activity 2018-2030: More Active People for a Healthier World, World Health Organization, Geneva, Switzerland, 2019.
  2. Marx V. Biology: the big challenges of big data. Nature Jun 13 2013;498 (7453): 255–60. http://dx.doi.org/10.1038/498255a. Server/Data Center
  3. Wilson RA, Keil FC. The MIT encyclopaedia of the cognitive sciences. MIT Press; 1999.
  4. R. Williams, S. Karuranga, B. Malanda et al., “Global and regional estimates and projections of diabetes-related health expenditure: results from the international diabetes federation diabetes atlas,” Diabetes Research and Clinical Practice, vol. 162, Article ID 108072, 2020.
  5. American Diabetes Association, “Diagnosis and classification of diabetes mellitus,” Diabetes Care, vol. 37, no. Supplement 1, pp. S81–S90, 2014.
  6. G. Acciaroli, M. Vettoretti, A. Facchinetti, and G. Sparacino, “Calibration of minimally invasive continuous glucose monitoring sensors: state-of-the-art and current perspectives,” Biosensors, vol. 8, no. 1, 2018.
  7. D. Bruen, C. Delaney, L. Florea, and D. Diamond, “Glucose sensing for diabetes monitoring: recent developments,” Sensors, vol. 17, no. 8, 2017.
  8. S. Wadhwa and K. Babber, “Artificial intelligence in health care: predictive analysis on diabetes using machine learning algorithms,” in Proceeding of the International Conference on Computational Science and Its Applications, pp. 354–366, Springer, Cagliari, Italy, July 2020.
Index Terms

Computer Science
Information Sciences

Keywords

Machine learning Random forest Ensemble and confusion matrix.