CFP last date
20 June 2025
Reseach Article

Comparative Analysis of Machine Learning Algorithms for Heart Disease Prediction

by Mohammad Belal Aziz, Syed Wajahat Abbas Rizvi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Number 5
Year of Publication: 2025
Authors: Mohammad Belal Aziz, Syed Wajahat Abbas Rizvi
10.5120/ijca2025924890

Mohammad Belal Aziz, Syed Wajahat Abbas Rizvi . Comparative Analysis of Machine Learning Algorithms for Heart Disease Prediction. International Journal of Computer Applications. 187, 5 ( May 2025), 62-65. DOI=10.5120/ijca2025924890

@article{ 10.5120/ijca2025924890,
author = { Mohammad Belal Aziz, Syed Wajahat Abbas Rizvi },
title = { Comparative Analysis of Machine Learning Algorithms for Heart Disease Prediction },
journal = { International Journal of Computer Applications },
issue_date = { May 2025 },
volume = { 187 },
number = { 5 },
month = { May },
year = { 2025 },
issn = { 0975-8887 },
pages = { 62-65 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume187/number5/comparative-analysis-of-machine-learning-algorithms-for-heart-disease-prediction/ },
doi = { 10.5120/ijca2025924890 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-05-29T00:02:58.447887+05:30
%A Mohammad Belal Aziz
%A Syed Wajahat Abbas Rizvi
%T Comparative Analysis of Machine Learning Algorithms for Heart Disease Prediction
%J International Journal of Computer Applications
%@ 0975-8887
%V 187
%N 5
%P 62-65
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Diagnosing heart disease in medical facilities is of high importance as it is the leading cause of death and requires precise predictive algorithms. The seven classification techniques considered in the research are Logistic Regression, Decision Tree, Random Forest, SVM, XGBoost, Stacking Ensemble, Neural Network. The study measured models based on ROC-AUC, accuracy and recall among the chief metrics across both the model training and test evaluations. All models saw their peak of accuracy at 84.53%, with Logistic Regression proving to be the most accurate of the Random Forest and SVM. During evaluation, heart disease cases detection was far from optimal – Decision Trees reported 27% recall and Neural Networks a paltry 10%. The study reveals that organizations experience a tradeoff between accuracy and recall which underscores the need of using tactics such as ensemble learning and data augmentation to achieve superior sensitivity performance. Investigations will attempt to enhance the case detection rates while still upholding high predictive power applicable to medicine.

References
  1. Detrano, R., et al., “International application of a new probability algorithm for the diagnosis of coronary artery disease,” The American Journal of Cardiology, 1989.
  2. Aha, D. W., Kibler, D., & Albert, M. K., “Instance-based learning algorithms,” Machine Learning, 1991.
  3. Breiman, L., “Random forests,” Machine Learning, 2001.
  4. LeCun, Y., Bengio, Y., & Hinton, G., “Deep learning,” Nature, 2015.
  5. Wolpert, D. H., “Stacked generalization,” Neural Networks, 1992.
  6. Quinlan, J. R., “Induction of decision trees,” Machine Learning, 1986.
  7. Chen, T., &Guestrin, C., “XGBoost: A scalable tree boosting system,” Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.
  8. Ribeiro, M. T., Singh, S., &Guestrin, C., “Why should I trust you? Explaining the predictions of any classifier,” Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.
  9. Rokach, L., “Ensemble-based classifiers,” Artificial Intelligence Review, 2010.
  10. Weng, S. F., Reps, J., Kai, J., et al., “Can machine-learning improve cardiovascular risk prediction using routine clinical data?” PLOS ONE, 2017.
  11. H. J. Escudero et al., “A Machine Learning-Based Model for Early Detection of Cardiovascular Diseases,” IEEE Journal of Biomedical and Health Informatics, vol. 24, no. 8, pp. 2342–2351, Aug. 2020.
  12. M. M. R. Krishnan and S. V. Thomas, “Feature Selection and Ensemble Learning for Heart Disease Prediction,” in Proc. IEEE Int. Conf. Bioinformatics and Biomedicine (BIBM), 2021, pp. 980–985.
  13. A. R. Fariha, M. I. Hossain, and K. Andersson, “Comparative Analysis of Deep Learning and Machine Learning Approaches for Cardiovascular Disease Prediction,” IEEE Access, vol. 9, pp. 39136–39148, 2021.
  14. B. S. Karthikeyan and R. Manavalan, “Hybrid Deep Learning Model for Accurate Prediction of Heart Disease,” IEEE Transactions on Computational Social Systems, vol. 9, no. 2, pp. 345–356, Feb. 2022.
  15. J. H. Friedman, “Greedy Function Approximation: A Gradient Boosting Machine,” Annals of Statistics, vol. 29, no. 5, pp. 1189–1232, 2001.
  16. S. Shickel, P. J. Tighe, A. Bihorac, and P. Rashidi, “Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record Analysis,” IEEE Journal of Biomedical and Health Informatics, vol. 22, no. 5, pp. 1589–1604, Sep. 2018.
  17. A. Alotaibi, “Implementation of Machine Learning Model to Predict Heart Disease,” International Journal of Advanced Computer Science and Applications, vol. 10, no. 6, pp. 1–7, 2019.
  18. Y. Zhang, R. Yao, L. Sun, and Z. He, “A Hybrid Intelligent System for Predicting Heart Disease Using Machine Learning Algorithms,” Computational and Mathematical Methods in Medicine, vol. 2021, Article ID 3272531, pp. 1–12, 2021.
  19. M. H. Mozaffarian et al., “Cardiovascular Disease Prediction Using Feature Selection and Machine Learning Techniques,” Expert Systems with Applications, vol. 198, pp. 116923, 2022.
  20. P. Ghosh, S. B. Kundu, and R. Kumar, “A Novel Deep Learning-Based Approach for Early Detection of Heart Disease,” in Proc. IEEE Int. Conf. Emerging Technologies and Intelligent Systems (ICETIS), 2021, pp. 345–350.
Index Terms

Computer Science
Information Sciences

Keywords

Heart Disease Prediction Machine Learning Neural Networks Random Forest Comparative Analysis Framingham Dataset