Call for Paper - March 2023 Edition
IJCA solicits original research papers for the March 2023 Edition. Last date of manuscript submission is February 20, 2023. Read More

Dimensionality Reduction Techniques for Improved Diagnosis of Heart Disease

International Journal of Computer Applications
© 2013 by IJCA Journal
Volume 61 - Number 5
Year of Publication: 2013
Swati Shilaskar
Ashok Ghatol

Swati Shilaskar and Ashok Ghatol. Article: Dimensionality Reduction Techniques for Improved Diagnosis of Heart Disease. International Journal of Computer Applications 61(5):1-8, January 2013. Full text available. BibTeX

	author = {Swati Shilaskar and Ashok Ghatol},
	title = {Article: Dimensionality Reduction Techniques for Improved Diagnosis of Heart Disease},
	journal = {International Journal of Computer Applications},
	year = {2013},
	volume = {61},
	number = {5},
	pages = {1-8},
	month = {January},
	note = {Full text available}


Medical diagnosis is an important task that needs to be executed accurately and efficiently. Medical domain complexities are represented by multidimensional heterogeneous datasets. Computer aided diagnosis must deal with processing and analyzing high dimensional data. Optimization of features in datasets reduces time and memory complexity of learning algorithms. It is necessary to have a tool that gives relationship between features and eliminate redundant ones. Feature selection or feature extraction reduce dimensions and essentially influence the performance of classifier. Many techniques have been used to determine essential features of medical data. We investigate two feature extraction techniques, Principal component analysis (PCA) and common Factor Analysis (FA) techniques for classification of heart disease. These techniques expose the structure, while maintaining the integrity of the data, thus improving diagnosis performance.


  • Alok Sharma, Godfrey C. Onwubolu, 2007. A Hybrid Approach for Modeling High Dimensional Medical Data . International Workshop on Inductive Modelling. Prague, 39-45.
  • Asha Gowda Karegowda, M. A. Jayaram, A. S . Manjunath, 2011 . Feature Subset Selection using Cascaded GA & CFS: A Filter Approach in Supervised Learning. International Journal of Computer Applications . 23–2, 10.
  • Asha. T , Dr. S. Natarajan, Dr. K. N. B. Murthy, 2010 . Diagnosis of Tuberculosis using Ensemble methods. IEEE Catalog Number: ISBN: CFP1057E-PRT . 978-1-4244-5537-9, 409-412.
  • Donald Jackson, 1993 . Stopping rules in Principal Component Analysis: A Comparison of Heuristical and Statistical Approaches . Ecology. 74, 2204-2214.
  • Hasmarina Hasan, Nooritawati Md Tahir , 2010. Feature Selection of Breast Cancer Based on Principal Component Analysis. 6th International Colloquium on Signal Processing & Its Applications (CSPA), 242-245.
  • J. Padmavathi 2012 . A Comparative Study on Logistic Regression Model and PCA-Logistic Regression Model in Medical Diagnosis. International Journal of Engineering and Technology . 2 - 8, 1372-1378 .
  • Jerome Fan, Suneel Upadhye, Andrew Worster 2006. Understanding receiver operating characteristic (ROC) curves Pedagogical Tools and Methods. Can J Emerg Med 8(1):19-20 .
  • Mykola Pechenizkiy, Alexey Tsymbal, Seppo Puuronen , 2004. PCA-based Feature Transformation for Classification: Issues in Medical Diagnostics. 17th IEEE Symposium on Computer-Based Medical Systems .
  • Philip Sedgwick , 2012. Statistical Question Parametric v non-parametric statistical tests. BMJ Journal. 344 doi: http://dx. doi. org/10. 1136/bmj. e1753
  • Ranjana Raut, S. V. Duddul, 2010 . Intelligent Diagnosis of Heart Diseases using Neural Network Approach. International Journal of Computer Applications . 1 – 2, 97-102.
  • Sudarshan Nandy, Partha Pratim Sarkar, Kalyani, Nadia, Achintya Das. 2012 An Improved Gauss-Newtons Method based Back-propagation Algorithm for Fast Convergence. International Journal of Computer Applications (0975 – 8887) vol39– No. 8, February 2012. 1-7.
  • Sunila Godara, Nirmal . Intelligent and Effective Decision Support System Using Multilayer Perceptron. International Journal of Engineering Research and Applications. 1-3, 513-518.
  • Yonghong Peng, Zhiqing Wu, Jianmin Jiang ,2010 . A novel feature selection approach for biomedical data classification. Journal of Biomedical Informatics, 43. No. 1,15-23.
  • Zweig MH, Campbell G. , 1993 . Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem. Apr 39-4:561-77
  • Charles X. Ling Jin Huang Harry Zhang, 2003 . AUC: a Statistically Consistent and more Discriminating Measure than Accuracy . 18th International Conference On Artificial Intelligence .
  • Gang Zheng, Ya-Lou Huang, Peng-Tao Wang , Guang-Fu Shu, 2005. Study Of Classification Rules On Weighted Coronary Heart Disease Data. Fourth International Conference On Machine Learning And Cybernetics, Guangzhou, 18-21 , 1845-1850. ISBN
  • Kyungim Baek and Bruce A. Draper, 2002. Factor Analysis for Background Suppression . International Conference on Pattern Recognition Colorodo.
  • Naeem Seliya, Taghi M. Khoshgoftaar, Jason Van Hulse, 2009. Aggregating Performance Metrics for Classifier Evaluation . IEEE IRI , July 10-12,, Las Vegas, Nevada, USA, 35-40.
  • Shivajirao M. Jadhav, Sanjay L. Nalbalwar, Ashok A. Ghatol, 2010. Generalized Feedforward Neural Network based Cardiac Arrhythmia Classification from ECG Signal Data. Advanced Information Management and Service (IMS)', 6th International Conference, 351-356.
  • V. Garc´?a, R. A. Mollineda and J. S. S´anchez, 2010. Theoretical Analysis of a Performance Measure for Imbalanced Data . International Conference on Pattern Recognition Pub in IEEE Computer society, 617-620.
  • Deepak Mishra, Abhishek Yadav, Sudipta Ray, and Prem K. Kalra . Levenberg-Marquardt Learning Algorithm for Integrate-and-Fire Neuron Model. Neural Information Processing - Letters and Reviews Vol. 9, No. 2, 41-51
  • Jonathon Shlens , 2009. A Tutorial on Principal Component Analysis . April 22 . Version 3. 01, tutorial available at http://www. snl. salk. edu/~shlens/pca. pdf.
  • UCI ML Repository. http://archive. ics. uci. edu/ml/
  • Fredric M. Ham, Ivica Kostanic , 2002. Principles of Neurocomputing for science and Engineering. Tata McGraw – Hill Edition.
  • Vojislav Kecman , 2004. Learning and Soft Computing. The MIT Press, Cambridge.
  • Principal Component Analysis and Factor Analysis. Therese Leinonen. Seminar in Statistics and Methodology, 25th February, 2009 http://www. let. rug. nl/ ~nerbonne/teach/rema-stats-meth-seminar/presentations/ leinonen-pcafa-2009. pdf