Article section
Performance Evaluation of Machine Learning Models for Cardiovascular Disease Prediction
Abstract
Cardiovascular disease (CVD) is a deadly health issue that requires urgent attention due to the increasing global population. Using technological solutions, Machine Learning (ML) could be used to tackle health challenges in medical institutions. However, CVD diagnosis often involves multiple diagnostic procedures, leading to high medical errors. Using high-frequency and gamma rays in diagnostic devices exposes patients to high risks of other diseases. The acquisition of these devices is challenging in Low and Middle-Income Countries (LMICs), where patients are less privileged, and families lose their lives due to the inability to afford them. Healthcare institutions are utilizing the adoption of Artificial Intelligence (AI) to provide innovative solutions to these issues. In this research, the dataset was obtained from Kaggle. The Cross Industry Standard Process for Data Mining (CRISP-DM) framework was used. We selected five phases excluding the Deployment phase from the framework. Then, experiments conducted starting with exploratory data analysis, data cleansing, data visualization, transforming the dataset then splitting the CVD dataset, allocating 80% of the dataset to training and the remaining 20% as a testing dataset. We applied the different ML algorithms, where we achieved the best accuracy from Random Forest, Extra Tree Classifier, and Decision Tree all with 98.18% on the training dataset, for further experimental selection we tested the dataset where we obtained the following percentages: Random Forest 79.22%, Extra Tree Classifier 78.68% and Decision Tree 73.48%. Finally, the experiment featured the Random Forest algorithm as the best classifier compared to the rest as mentioned above, as it is more robust with the ability to handle non-linear and complex relationships making it more effective for CVD models.
Keywords:
Cardiovascular Disease CRISP-DM Framework Machine Learning Models Performance Evaluation Prediction Models
Article information
Journal
Journal of Computer, Software, and Program
Volume (Issue)
2(1), (2025)
Pages
39-48
Published
Copyright
Copyright (c) 2025 Nuraini Usman, Babawuro Usman, Zahriah Binti Sahri (Author)
Open access

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
References
Ahmad, H. F., Mukhtar, H., Alaqail, H., Seliaman, M., & Alhumam, A. (2021). Investigating health-related features and their impact on the prediction of diabetes using machine learning. Applied Sciences, 11(3), 1173.
Alić, B., Gurbeta, L., & Badnjević, A. (2017, June). Machine learning techniques for classification of diabetes and cardiovascular diseases. In 2017 6th mediterranean conference on embedded computing (MECO) (pp. 1-4). IEEE.
Balakrishnan, M., Arockia Christopher, A. B., Ramprakash, P., & Logeswari, A. (2021). Prediction of Cardiovascular Disease using Machine Learning. Journal of Physics: Conference Series, 1767(1), 1–7. https://doi.org/10.1088/1742-6596/1767/1/012013
Behera, S., Pramanik, K., & K Nayak, M. (2015). Recent advancement in the treatment of cardiovascular diseases: Conventional therapy to nanotechnology. Current pharmaceutical design, 21(30), 4479-4497. https://doi.org/10.2174/1381612821666150817104635
Bosnjak, Z., Grljevic, O., & Bosnjak, S. (2009, May). CRISP-DM as a framework for discovering knowledge in small and medium sized enterprises’ data. In 2009 5th International Symposium on Applied Computational Intelligence and Informatics (pp. 509-514). IEEE. https://doi.org/10.1109/SACI.2009.5136302
Cardiovascular Diseases - Repatriation Medical Authority. (n.d.). Retrieved July 2, 2025, from https://www.rma.gov.au/sops/category/cardiovascular-diseases
Chunko, M. T., Hayunga, E., Rothenberg, K., Rudick, J., & Pinn, V. W. (1996). Office of Research on Women’s Health National Institutes of Health. Journal of Women’s Health, 5(4), 317–323. https://doi.org/10.1089/jwh.1996.5.317
Dasgupta, A., & Wendler, S. (2019). AI adoption strategies. Retrieved September, 18, 2022.
Ej, B., Muntner, P., & Alonso, A. (2019). AHA 2019 Heart Disease and Stroke Statistics. American College of Cardiology, 2–5.
Ekwonwune, E. N., Ubochi, C. I., Duruoha, A. E., Duru, N. A., Ekwonwune, E. N., Ubochi, C. I., Duruoha, A. E., & Duru, N. A. (2023). Development of a Data Mining Model to Detect Cardiovascular Disease. International Journal of Communications, Network and System Sciences, 16(5), 77–96. https://doi.org/10.4236/IJCNS.2023.165006
Gürfidan, R., & Ersoy, M. (2021). Classification of death related to heart failure by machine learning algorithms. Advances in Artificial Intelligence Research, 1(1), 13-18.
Hussain, L., Aziz, W., Khan, I. R., Alkinani, M. H., & Alowibdi, J. S. (2021). Machine learning based congestive heart failure detection using feature importance ranking of multimodal features. Mathematical Biosciences and Engineering, 18(1), 69-91. https://doi.org/10.3934/mbe.2021004
Junwei, K., Yang, H., Junjiang, L., & Zhijun, Y. (2019). Dynamic prediction of cardiovascular disease using improved LSTM. International Journal of Crowd Science, 3(1), 14-25. https://doi.org/10.1108/IJCS-01-2019-0002
Kanikar, P., & Shah, D. R. (2016). Prediction of cardiovascular diseases using support vector machine and Bayesian classification. International Journal of Computer Applications, 156(2), 9-13.
Jevin, M. J., Jayant, H., Sanjay, R., Hemasai, V., & Venkatasrinivas, P. V. (2023). Heart disease identification method using machine learning classification in e-healthcare. Heart Disease, 10(3). https://doi.org/10.1109/ACCESS.2020.3001149
Khan Mamun, M. M. R., & Elfouly, T. (2023). Detection of Cardiovascular Disease from Clinical Parameters Using a One-Dimensional Convolutional Neural Network. Bioengineering, 10(7). https://doi.org/10.3390/BIOENGINEERING10070796,
Kolukisa, B., Yavuz, L., Soran, A., Bakir-Gungor, B., Tuncer, D., Onen, A., & Gungor, V. C. (2020). Coronary artery disease diagnosis using optimized adaptive ensemble machine learning algorithm. International Journal of Bioscience, Biochemistry and Bioinformatics, 10(1), 58-65. https://doi.org/10.17706/ijbbb.2020.10.1.58-65
Konieczny, R., & Idczak, R. (2016). Mössbauer study of Fe-Re alloys prepared by mechanical alloying. Hyperfine Interactions, 237(1), 1–8. https://doi.org/10.1007/s10751-016-1232-6
Lopez, E. O., Ballard, B. D., & Jan, A. (2023). Cardiovascular Disease. StatPearls Publishing. https://www.ncbi.nlm.nih.gov/books/NBK535419/
Louridi, N., Amar, M., & Ouahidi, B. El. (2019). Identification of Cardiovascular Diseases Using Machine Learning. 7th Mediterranean Congress of Telecommunications 2019, CMT 2019 (1–6). https://doi.org/10.1109/CMT.2019.8931411
Nasteski, V. (2017). An overview of the supervised machine learning methods. Horizons. b, 4(51-62), 56. https://doi.org/10.20544/horizons.b.04.1.17.p05
Nilsson, N. J. (2005). Introduction to machine learning an early draft of a proposed textbook Department of Computer Science. Machine Learning, 56(2), 387–399.
Radiation risk from medical imaging - Harvard Health. (n.d.). Retrieved July 2, 2025, from https://www.health.harvard.edu/cancer/radiation-risk-from-medical-imaging
Ravindhar, N. V., Anand, K., Hariharan, S., Ragavendran, & Winster, S. G. (2019). Intelligent diagnosis of cardiac disease prediction using machine learning. International Journal of Innovative Technology and Exploring Engineering, 8(11), 1417–1421. https://doi.org/10.35940/ijitee.J9765.0981119
Rong, G., Mendez, A., Bou, E., Zhao, B., & Sawan, M. (2020). Artificial Intelligence in Healthcare : Review and Prediction Case Studies. Engineering, 6(3), 291–301. https://doi.org/10.1016/j.eng.2019.08.015
Ruwanpathirana, T., Owen, A., & Reid, C. M. (2015). Review on Cardiovascular Risk Prediction. Cardiovascular Therapeutics, 33(2), 62–70. https://doi.org/10.1111/1755-5922.12110
Sabab, S. A., Munshi, M. A. R., Pritom, A. I., & Shihabuzzaman, S. (2017). Cardiovascular disease prognosis using effective classification and feature selection technique. 1st International Conference on Medical Engineering, Health Informatics and Technology, MediTec 2016 (pp. 2–7). https://doi.org/10.1109/MEDITEC.2016.7835374
Samoili, S., López Cobo, M., Gómez, E., De Prato, G., Martínez-Plumed, F., & Delipetrev, B. (2020). AI Watch - Defining Artificial Intelligence. Towards an operational definition and taxonomy of artificial intelligence. In Joint Research Centre (European Commission). https://doi.org/10.2760/382730
Spiegel, P. K. (1995). The first clinical X-ray made in America--100 years. American Journal of Roentgenology, 164(1), 241–243. https://doi.org/10.2214/AJR.164.1.7998549
Srivenkatesh, M. (2020). Prediction of cardiovascular disease using machine learning algorithms. International Journal of Engineering and Advanced Technology, 9(3), 2404-2414. https://doi.org/10.35940/ijeat.B3986.029320
Terrada, O., Hamida, S., Cherradi, B., Raihani, A., & Bouattane, O. (2020). Supervised machine learning based medical diagnosis support system for prediction of patients with heart disease. Advances in Science, Technology and Engineering Systems, 5(5), 269–277. https://doi.org/10.25046/AJ050533
Tripathi, K., Garg, H., & Sharma, H. (2020). A comprehensive survey on cardiovascular disease. International Journal of Grid and Distributed Computing, 13(1), 1772-1780.
Tsipouras, M. G., Exarchos, T. P., Fotiadis, D. I., Kotsia, A. P., Vakalis, K. V., Naka, K. K., & Michalis, L. K. (2008). Automated diagnosis of coronary artery disease based on data mining and fuzzy modeling. IEEE Transactions on information technology in biomedicine, 12(4), 447-458.
Ware, S., Rakesh, S., & Choudhary, B. (2020). Heart Attack Prediction by using Machine Learning Techniques. International Journal of Recent Technology and Engineering, 8(5), 1577–1580. https://doi.org/10.35940/ijrte.d9439.018520
Yadav, S. S., Jadhav, S. M., Nagrale, S., & Patil, N. (2020). Application of Machine Learning for the Detection of Heart Disease. 2nd International Conference on Innovative Mechanisms for Industry Applications, ICIMIA 2020 - Conference Proceedings, Icimia (pp. 165–172). https://doi.org/10.1109/ICIMIA48430.2020.9074954
Yang, L., Wu, H., Jin, X., Zheng, P., Hu, S., Xu, X., Yu, W., & Yan, J. (2020). Study of cardiovascular disease prediction model based on random forest in eastern China. Scientific Reports, 10(1), 1–8. https://doi.org/10.1038/s41598-020-62133-5
Zhang, C., Qu, J., Li, W., & Zheng, L. (2020, December). Predicting cardiovascular events by machine learning. In Journal of Physics: Conference Series (Vol. 1693, No. 1, p. 012093). IOP Publishing. https://doi.org/10.1088/1742-6596/1693/1/012093