Research Article

Performance Evaluation of Machine Learning Models for Cardiovascular Disease Prediction

Authors

  • Nuraini Usman Department of International Programmes, Jigawa State Polytechnic for Information and Communication Technology (JSPICT), Kazaure, Nigeria https://orcid.org/0009-0008-4287-7456

    message2nuraini@gmail.com

  • Babawuro Usman Department of Computer Science, Faculty of Computing and Mathematical Sciences (FACMS), Aliko Dangote University of Science and Technology (ADUSTECH), Wudil, Kano State, Nigeria
  • Zahriah Binti Sahri Department of Intelligence Computing, Universiti Teknikal Malaysia Melaka (UTeM), Durian Tunggal, Melaka, Malayisa

Abstract

Cardiovascular disease (CVD) is a deadly health issue that requires urgent attention due to the increasing global population. Using technological solutions, Machine Learning (ML) could be used to tackle health challenges in medical institutions. However, CVD diagnosis often involves multiple diagnostic procedures, leading to high medical errors. Using high-frequency and gamma rays in diagnostic devices exposes patients to high risks of other diseases. The acquisition of these devices is challenging in Low and Middle-Income Countries (LMICs), where patients are less privileged, and families lose their lives due to the inability to afford them. Healthcare institutions are utilizing the adoption of Artificial Intelligence (AI) to provide innovative solutions to these issues. In this research, the dataset was obtained from Kaggle. The Cross Industry Standard Process for Data Mining (CRISP-DM) framework was used. We selected five phases excluding the Deployment phase from the framework. Then, experiments conducted starting with exploratory data analysis, data cleansing, data visualization, transforming the dataset then splitting the CVD dataset, allocating 80% of the dataset to training and the remaining 20% as a testing dataset. We applied the different ML algorithms, where we achieved the best accuracy from Random Forest, Extra Tree Classifier, and Decision Tree all with 98.18% on the training dataset, for further experimental selection we tested the dataset where we obtained the following percentages: Random Forest 79.22%, Extra Tree Classifier 78.68% and Decision Tree 73.48%. Finally, the experiment featured the Random Forest algorithm as the best classifier compared to the rest as mentioned above, as it is more robust with the ability to handle non-linear and complex relationships making it more effective for CVD models.

Keywords:

Cardiovascular Disease CRISP-DM Framework Machine Learning Models Performance Evaluation Prediction Models

Article information

Journal

Journal of Computer, Software, and Program

Volume (Issue)

2(1), (2025)

Pages

39-48

Published

12-08-2025

How to Cite

Usman, N., Usman, B., & Sahri, Z. B. (2025). Performance Evaluation of Machine Learning Models for Cardiovascular Disease Prediction. Journal of Computer, Software, and Program, 2(1), 39-48. https://doi.org/10.69739/jcsp.v2i1.744

References

Ahmad, H. F., Mukhtar, H., Alaqail, H., Seliaman, M., & Alhumam, A. (2021). Investigating health-related features and their impact on the prediction of diabetes using machine learning. Applied Sciences, 11(3), 1173.

Alić, B., Gurbeta, L., & Badnjević, A. (2017, June). Machine learning techniques for classification of diabetes and cardiovascular diseases. In 2017 6th mediterranean conference on embedded computing (MECO) (pp. 1-4). IEEE.

Balakrishnan, M., Arockia Christopher, A. B., Ramprakash, P., & Logeswari, A. (2021). Prediction of Cardiovascular Disease using Machine Learning. Journal of Physics: Conference Series, 1767(1), 1–7. https://doi.org/10.1088/1742-6596/1767/1/012013

Behera, S., Pramanik, K., & K Nayak, M. (2015). Recent advancement in the treatment of cardiovascular diseases: Conventional therapy to nanotechnology. Current pharmaceutical design, 21(30), 4479-4497. https://doi.org/10.2174/1381612821666150817104635

Bosnjak, Z., Grljevic, O., & Bosnjak, S. (2009, May). CRISP-DM as a framework for discovering knowledge in small and medium sized enterprises’ data. In 2009 5th International Symposium on Applied Computational Intelligence and Informatics (pp. 509-514). IEEE. https://doi.org/10.1109/SACI.2009.5136302

Cardiovascular Diseases - Repatriation Medical Authority. (n.d.). Retrieved July 2, 2025, from https://www.rma.gov.au/sops/category/cardiovascular-diseases

Chunko, M. T., Hayunga, E., Rothenberg, K., Rudick, J., & Pinn, V. W. (1996). Office of Research on Women’s Health National Institutes of Health. Journal of Women’s Health, 5(4), 317–323. https://doi.org/10.1089/jwh.1996.5.317

Dasgupta, A., & Wendler, S. (2019). AI adoption strategies. Retrieved September, 18, 2022.

Ej, B., Muntner, P., & Alonso, A. (2019). AHA 2019 Heart Disease and Stroke Statistics. American College of Cardiology, 2–5.

Ekwonwune, E. N., Ubochi, C. I., Duruoha, A. E., Duru, N. A., Ekwonwune, E. N., Ubochi, C. I., Duruoha, A. E., & Duru, N. A. (2023). Development of a Data Mining Model to Detect Cardiovascular Disease. International Journal of Communications, Network and System Sciences, 16(5), 77–96. https://doi.org/10.4236/IJCNS.2023.165006

Gürfidan, R., & Ersoy, M. (2021). Classification of death related to heart failure by machine learning algorithms. Advances in Artificial Intelligence Research, 1(1), 13-18.

Hussain, L., Aziz, W., Khan, I. R., Alkinani, M. H., & Alowibdi, J. S. (2021). Machine learning based congestive heart failure detection using feature importance ranking of multimodal features. Mathematical Biosciences and Engineering, 18(1), 69-91. https://doi.org/10.3934/mbe.2021004

Junwei, K., Yang, H., Junjiang, L., & Zhijun, Y. (2019). Dynamic prediction of cardiovascular disease using improved LSTM. International Journal of Crowd Science, 3(1), 14-25. https://doi.org/10.1108/IJCS-01-2019-0002

Kanikar, P., & Shah, D. R. (2016). Prediction of cardiovascular diseases using support vector machine and Bayesian classification. International Journal of Computer Applications, 156(2), 9-13.

Jevin, M. J., Jayant, H., Sanjay, R., Hemasai, V., & Venkatasrinivas, P. V. (2023). Heart disease identification method using machine learning classification in e-healthcare. Heart Disease, 10(3). https://doi.org/10.1109/ACCESS.2020.3001149

Khan Mamun, M. M. R., & Elfouly, T. (2023). Detection of Cardiovascular Disease from Clinical Parameters Using a One-Dimensional Convolutional Neural Network. Bioengineering, 10(7). https://doi.org/10.3390/BIOENGINEERING10070796,

Kolukisa, B., Yavuz, L., Soran, A., Bakir-Gungor, B., Tuncer, D., Onen, A., & Gungor, V. C. (2020). Coronary artery disease diagnosis using optimized adaptive ensemble machine learning algorithm. International Journal of Bioscience, Biochemistry and Bioinformatics, 10(1), 58-65. https://doi.org/10.17706/ijbbb.2020.10.1.58-65

Konieczny, R., & Idczak, R. (2016). Mössbauer study of Fe-Re alloys prepared by mechanical alloying. Hyperfine Interactions, 237(1), 1–8. https://doi.org/10.1007/s10751-016-1232-6

Lopez, E. O., Ballard, B. D., & Jan, A. (2023). Cardiovascular Disease. StatPearls Publishing. https://www.ncbi.nlm.nih.gov/books/NBK535419/

Louridi, N., Amar, M., & Ouahidi, B. El. (2019). Identification of Cardiovascular Diseases Using Machine Learning. 7th Mediterranean Congress of Telecommunications 2019, CMT 2019 (1–6). https://doi.org/10.1109/CMT.2019.8931411

Nasteski, V. (2017). An overview of the supervised machine learning methods. Horizons. b, 4(51-62), 56. https://doi.org/10.20544/horizons.b.04.1.17.p05

Nilsson, N. J. (2005). Introduction to machine learning an early draft of a proposed textbook Department of Computer Science. Machine Learning, 56(2), 387–399.

Radiation risk from medical imaging - Harvard Health. (n.d.). Retrieved July 2, 2025, from https://www.health.harvard.edu/cancer/radiation-risk-from-medical-imaging

Ravindhar, N. V., Anand, K., Hariharan, S., Ragavendran, & Winster, S. G. (2019). Intelligent diagnosis of cardiac disease prediction using machine learning. International Journal of Innovative Technology and Exploring Engineering, 8(11), 1417–1421. https://doi.org/10.35940/ijitee.J9765.0981119

Rong, G., Mendez, A., Bou, E., Zhao, B., & Sawan, M. (2020). Artificial Intelligence in Healthcare : Review and Prediction Case Studies. Engineering, 6(3), 291–301. https://doi.org/10.1016/j.eng.2019.08.015

Ruwanpathirana, T., Owen, A., & Reid, C. M. (2015). Review on Cardiovascular Risk Prediction. Cardiovascular Therapeutics, 33(2), 62–70. https://doi.org/10.1111/1755-5922.12110

Sabab, S. A., Munshi, M. A. R., Pritom, A. I., & Shihabuzzaman, S. (2017). Cardiovascular disease prognosis using effective classification and feature selection technique. 1st International Conference on Medical Engineering, Health Informatics and Technology, MediTec 2016 (pp. 2–7). https://doi.org/10.1109/MEDITEC.2016.7835374

Samoili, S., López Cobo, M., Gómez, E., De Prato, G., Martínez-Plumed, F., & Delipetrev, B. (2020). AI Watch - Defining Artificial Intelligence. Towards an operational definition and taxonomy of artificial intelligence. In Joint Research Centre (European Commission). https://doi.org/10.2760/382730

Spiegel, P. K. (1995). The first clinical X-ray made in America--100 years. American Journal of Roentgenology, 164(1), 241–243. https://doi.org/10.2214/AJR.164.1.7998549

Srivenkatesh, M. (2020). Prediction of cardiovascular disease using machine learning algorithms. International Journal of Engineering and Advanced Technology, 9(3), 2404-2414. https://doi.org/10.35940/ijeat.B3986.029320

Terrada, O., Hamida, S., Cherradi, B., Raihani, A., & Bouattane, O. (2020). Supervised machine learning based medical diagnosis support system for prediction of patients with heart disease. Advances in Science, Technology and Engineering Systems, 5(5), 269–277. https://doi.org/10.25046/AJ050533

Tripathi, K., Garg, H., & Sharma, H. (2020). A comprehensive survey on cardiovascular disease. International Journal of Grid and Distributed Computing, 13(1), 1772-1780.

Tsipouras, M. G., Exarchos, T. P., Fotiadis, D. I., Kotsia, A. P., Vakalis, K. V., Naka, K. K., & Michalis, L. K. (2008). Automated diagnosis of coronary artery disease based on data mining and fuzzy modeling. IEEE Transactions on information technology in biomedicine, 12(4), 447-458.

Ware, S., Rakesh, S., & Choudhary, B. (2020). Heart Attack Prediction by using Machine Learning Techniques. International Journal of Recent Technology and Engineering, 8(5), 1577–1580. https://doi.org/10.35940/ijrte.d9439.018520

Yadav, S. S., Jadhav, S. M., Nagrale, S., & Patil, N. (2020). Application of Machine Learning for the Detection of Heart Disease. 2nd International Conference on Innovative Mechanisms for Industry Applications, ICIMIA 2020 - Conference Proceedings, Icimia (pp. 165–172). https://doi.org/10.1109/ICIMIA48430.2020.9074954

Yang, L., Wu, H., Jin, X., Zheng, P., Hu, S., Xu, X., Yu, W., & Yan, J. (2020). Study of cardiovascular disease prediction model based on random forest in eastern China. Scientific Reports, 10(1), 1–8. https://doi.org/10.1038/s41598-020-62133-5

Zhang, C., Qu, J., Li, W., & Zheng, L. (2020, December). Predicting cardiovascular events by machine learning. In Journal of Physics: Conference Series (Vol. 1693, No. 1, p. 012093). IOP Publishing. https://doi.org/10.1088/1742-6596/1693/1/012093

Downloads

Views

67

Downloads

16