Enhancing Diabetes Diagnosis with Confidence-Calibrated Adaptive Weighting and Multi-Model Ensemble Framework
DOI:
https://doi.org/10.54392/irjmt25415Keywords:
Diabetes Prediction, Machine Learning, Hot Deck Imputation, Confidence-Calibrated Adaptive Weighting (CCAW), Ensemble Learning, Model Confidence, Entropy-Based Weighting, Medical Diagnosis, Cross-Validation, Classification AccuracyAbstract
Metabolic conditions like diabetes often show up as high blood sugar levels, or hyperglycemia, which happens when the body doesn't make enough insulin. If this goes unnoticed or untreated, it can cause serious damage to critical organs such as the eyes, kidneys, nerves, heart, and blood vessels. Recently, machine learning and other computational techniques have shown great potential for predicting who might develop diabetes. But there's still room to improve how accurate and reliable these models are. In this work introduce a step-by-step machine learning approach to predict and diagnose diabetes using three different datasets. First, used Hot Deck Imputation (HDeckImp) to fill in missing data and make the classifications more accurate. Next, we apply K-fold cross-validation to test how well the model holds up with different data splits, ensuring robustness. Then, we incorporate multiple classifiers like Random Forest, XGBoost, AdaBoost, and Bagging to strengthen predictions. Finally, we developed a new weighting method called Confidence-Calibrated Adaptive Weighting (CCAW). This method dynamically gives more weight to models that are both accurate and confident, measured through information entropy, so the best models have more influence on the final outcome. Experiments show that this approach, especially with CCAW, reaches a top accuracy of 98.9% on the Frankfurt dataset, beating previous ensemble methods such as ADR-W. This improvement not only enhances prediction accuracy but also makes the model stable, which is essential for practical clinical use.
References
World Health Organization, (2023) Diabetes country profiles 2023: India. WHO Global Report.
P. Zimmet, K.G. Alberti, J. Shaw, Global and societal implications of the diabetes epidemic. Nature 414(6865), (2001) 782–787. https://doi.org/10.1038/414782a
R. Unnikrishnan, V. Mohan, Diabetes in South Asians: Is the phenotype different?. Nature Reviews Endocrinology, 12(1), (2016) 5–6. https://doi.org/10.1038/nrendo.2016.53
N.H. Cho, J.E. Shaw, S. Karuranga, Y. Huang, J.D. da Rocha Fernandes, A.W. Ohlrogge, B. Malanda, IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045. Diabetes Research and Clinical Practice, 138, (2018) 271–281. https://doi.org/10.1016/j.diabres.2018.02.023
N.P. Tigga, S. Garg, (2020) September. Predicting type 2 diabetes using logistic regression. In Proceedings of the Fourth International Conference on Microelectronics, Computing and Communication Systems: MCCS 2019, Springer Singapore. https://doi.org/10.1007/978-981-15-5546-6_42
B.S. Ahamed, M.S. Arya, A.O.V. Nancy, Prediction of type-2 diabetes mellitus disease using machine learning classifiers and techniques. Frontiers in Computer Science, 4, (2022) 835242. https://doi.org/10.3389/fcomp.2022.835242
H.M. Deberneh, I. Kim, Prediction of type 2 diabetes based on machine learning algorithm. International journal of environmental research and public health, 18(6), (2021) 3317. https://doi.org/10.3390/ijerph18063317
R.D. Joshi, C.K. Dhakal, Predicting type 2 diabetes using logistic regression and machine learning approaches. International Journal of Environmental Research and Public Health 18(14), (2021) 7346. https://doi.org/10.3390/ijerph18147346
B.S. Ahamed, M.S. Arya, A.O.V. Nancy, Prediction of type-2 diabetes mellitus disease using machine learning classifiers and techniques. Frontiers in Computer Science 4, (2022) 835242. https://doi.org/10.3389/fcomp.2022.835242
K. Al Sadi, W. Balachandran, Prediction Model of Type 2 Diabetes Mellitus for Oman Prediabetes Patients Using Artificial Neural Network and Six Machine Learning Classifiers. Applied Sciences, 13(4), (2023) 2344. https://doi.org/10.3390/app13042344
K.K. Patro, J.P. Allam, U. Sanapala, C.K. Marpu, N.A. Samee, M. Alabdulhafith, P. Plawiak, An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques. BMC Bioinformatics, 24(1), (2023) 372. https://doi.org/10.1186/s12859-023-05488-6
K.M. Kuo, P. Talley, Y. Kao, C.H. Huang, A multi-class classification model for supporting the diagnosis of type II diabetes mellitus. Peer J, 8, (2020) e9920. https://doi.org/10.7717/peerj.9920
B.S. Ahamed, M.S. Arya, S.K.B. Sangeetha, N.V. Auxilia Osvin, Diabetes Mellitus Disease Prediction and Type Classification Involving Predictive Modeling Using Machine Learning Techniques and Classifiers. Applied Computational Intelligence and Soft Computing, 2022, (2022) 2022. https://doi.org/10.1155/2022/7899364
C. Mallika, S. Selvamuthukumaran, Hybrid Online Model for Predicting Diabetes Mellitus. Intelligent Automation & Soft Computing, 31(3), (2022) 1873-1885. https://doi.org/10.32604/iasc.2022.020543
M. Kumari, V. Kumar, S.K. Verma, R.K. Mahli, Evaluation of lipid profile and glycated haemoglobin in type 2 diabetic patients: A retrospective study. International Journal of Health Sciences, (II), (2022) 4372-4383. https://doi.org/10.53730/ijhs.v6nS2.6009
Z.M. Alhakeem, H. Hakim, O.A. Hasan, A.A. Laghari, A.K. Jumani, M.N. Jasm, Prediction of diabetic patients in Iraq using binary dragonfly algorithm with long-short term memory neural network. AIMS Electronics & Electrical Engineering, 7(3), (2023) 217-230. https://doi.org/10.3934/electreng.2023013
M.O. Edeh, O.I. Khalaf, C.A. Tavera, S. Tayeb, S. Ghouali, G.M. Abdulsahib, N.E. Richard-Nnabu, A. Louni, A classification algorithm-based hybrid diabetes prediction model. Frontiers in Public Health 10, (2022) 829519. https://doi.org/10.3389/fpubh.2022.829519
T. Beghriche, M. Djerioui, Y. Brik, B. Attallah, S.B. Belhaouari, An efficient prediction system for diabetes disease based on deep neural network. Complexity, 2021 (2021) 1-14. https://doi.org/10.1155/2021/6053824
UCI Machine Learning Repository. (2025) Pima Indians Diabetes Database. https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database
Mendeley Data, (2025) Iraqi Patient Dataset for Diabetes.
Frankfurt Diabetes Dataset, (2025) Frankfurt Diabetes Dataset.
K. Abnoosian, R. Farnoosh, M. H. Behzadi, Prediction of diabetes disease using an ensemble of machine learning multi-classifier models. BMC Bioinformatics, 24(1), (2023) 337. https://doi.org/10.1186/s12859-023-05465-z
H. Naz, S. Ahuja, Deep learning approach for diabetes prediction using PIMA Indian dataset," Journal of Diabetes & Metabolic Disorders 19, (2020) 391-403. https://doi.org/10.1007/s40200-020-00520-5
M.T. García-Ordás, C. Benavides, J.A. Benítez-Andrades, H. Alaiz-Moretón, I. García-Rodríguez, Diabetes detection using deep learning techniques with oversampling and feature augmentation. Computer Methods and Programs in Biomedicine, 202, (2021) 105968. https://doi.org/10.1016/j.cmpb.2021.105968
A. Kishor, C. Chakraborty, Early and accurate prediction of diabetics based on FCBF feature selection and SMOTE. International Journal of System Assurance Engineering and Management, (2024) 4649–4657. https://doi.org/10.1007/s13198-021-01174-z
M. Shrestha, O.H. Alsadoon, A. Alsadoon, T. Al-Dala’in, T.A. Rashid, P.W.C. Prasad, A. Alrubaie, A novel solution of deep learning for enhanced support vector machine for predicting the onset of type 2 diabetes. Multimedia Tools and Applications, 82(4), (2023) 6221-6241. https://doi.org/10.1007/s11042-022-13582-9
S.M. Ganie, M.B. Malik, An ensemble machine learning approach for predicting type-II diabetes mellitus based on lifestyle indicators. Healthcare Analytics 2, (2022) 100092. https://doi.org/10.1016/j.health.2022.100092
P. Madan, V. Singh, V. Chaudhari, Y. Albagory, A. Dumka, R. Singh, A. Gehlot, M. Rashid, S.S. Alshamrani, A.S. AlGhamdi, An optimization-based diabetes prediction model using CNN and Bi-directional LSTM in real-time environment. Applied Sciences, 12(8), (2022) 3989. https://doi.org/10.3390/app12083989
K. Azbeg, M. Boudhane, O. Ouchetto, S. Jai Andaloussi, Diabetes emergency cases identification based on a statistical predictive model. Journal of Big Data, 9(1), (2022) 1-25. https://doi.org/10.1186/s40537-022-00582-7
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Usha V, Rajalakshmi N.R (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.