COMPARISON OF ENSEMBLE LEARNING ALGORITHM IN CLASSIFYING EARLY DIAGNOSTIC OF DIABETES

Main Article Content

Okta Jaya Harmaja
Irvan Prasetia
Yosi Victor Hutagalung
Hendra Ardanis Sirait

Abstract

Diabetes is a significant public health problem and affects millions of people worldwide. This study will perform a comparative analysis of three ensemble learning algorithms (Random Forest, AdaBoost, and XGBoost) in classifying diabetes diagnoses. Based on the research that has been carried out, it is concluded that the model with the highest accuracy is Random Forest with a value of 0.86, XGBoost with a value of 0.85, and AdaBoost with a value of 0.82. It can also be concluded that the three models perform well and can be used to classify diabetes. Based on the visualization of the results of Feature Importance that has been made, it can be concluded that the Random Forest and XGBoost algorithms have in common the 3 most important features, namely Glucose, BMI and Age. As for AdaBoost, the 3 most important features are DPF, BMI and Glucose.

Article Details

How to Cite
[1]
O. J. Harmaja, I. Prasetia, Y. V. Hutagalung, and H. A. Sirait, “COMPARISON OF ENSEMBLE LEARNING ALGORITHM IN CLASSIFYING EARLY DIAGNOSTIC OF DIABETES”, JUSIKOM PRIMA, vol. 7, no. 1, pp. 218-231, Aug. 2023.
Section
Articles

Most read articles by the same author(s)