CFP last date
22 April 2024
Reseach Article

Dimensionality Reduction Techniques for Improved Diagnosis of Heart Disease

by Swati Shilaskar, Ashok Ghatol
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 61 - Number 5
Year of Publication: 2013
Authors: Swati Shilaskar, Ashok Ghatol
10.5120/9921-4538

Swati Shilaskar, Ashok Ghatol . Dimensionality Reduction Techniques for Improved Diagnosis of Heart Disease. International Journal of Computer Applications. 61, 5 ( January 2013), 1-8. DOI=10.5120/9921-4538

@article{ 10.5120/9921-4538,
author = { Swati Shilaskar, Ashok Ghatol },
title = { Dimensionality Reduction Techniques for Improved Diagnosis of Heart Disease },
journal = { International Journal of Computer Applications },
issue_date = { January 2013 },
volume = { 61 },
number = { 5 },
month = { January },
year = { 2013 },
issn = { 0975-8887 },
pages = { 1-8 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume61/number5/9921-4538/ },
doi = { 10.5120/9921-4538 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:08:15.479305+05:30
%A Swati Shilaskar
%A Ashok Ghatol
%T Dimensionality Reduction Techniques for Improved Diagnosis of Heart Disease
%J International Journal of Computer Applications
%@ 0975-8887
%V 61
%N 5
%P 1-8
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Medical diagnosis is an important task that needs to be executed accurately and efficiently. Medical domain complexities are represented by multidimensional heterogeneous datasets. Computer aided diagnosis must deal with processing and analyzing high dimensional data. Optimization of features in datasets reduces time and memory complexity of learning algorithms. It is necessary to have a tool that gives relationship between features and eliminate redundant ones. Feature selection or feature extraction reduce dimensions and essentially influence the performance of classifier. Many techniques have been used to determine essential features of medical data. We investigate two feature extraction techniques, Principal component analysis (PCA) and common Factor Analysis (FA) techniques for classification of heart disease. These techniques expose the structure, while maintaining the integrity of the data, thus improving diagnosis performance.

References
  1. Alok Sharma, Godfrey C. Onwubolu, 2007. A Hybrid Approach for Modeling High Dimensional Medical Data . International Workshop on Inductive Modelling. Prague, 39-45.
  2. Asha Gowda Karegowda, M. A. Jayaram, A. S . Manjunath, 2011 . Feature Subset Selection using Cascaded GA & CFS: A Filter Approach in Supervised Learning. International Journal of Computer Applications . 23–2, 10.
  3. Asha. T , Dr. S. Natarajan, Dr. K. N. B. Murthy, 2010 . Diagnosis of Tuberculosis using Ensemble methods. IEEE Catalog Number: ISBN: CFP1057E-PRT . 978-1-4244-5537-9, 409-412.
  4. Donald Jackson, 1993 . Stopping rules in Principal Component Analysis: A Comparison of Heuristical and Statistical Approaches . Ecology. 74, 2204-2214.
  5. Hasmarina Hasan, Nooritawati Md Tahir , 2010. Feature Selection of Breast Cancer Based on Principal Component Analysis. 6th International Colloquium on Signal Processing & Its Applications (CSPA), 242-245.
  6. J. Padmavathi 2012 . A Comparative Study on Logistic Regression Model and PCA-Logistic Regression Model in Medical Diagnosis. International Journal of Engineering and Technology . 2 - 8, 1372-1378 .
  7. Jerome Fan, Suneel Upadhye, Andrew Worster 2006. Understanding receiver operating characteristic (ROC) curves Pedagogical Tools and Methods. Can J Emerg Med 8(1):19-20 .
  8. Mykola Pechenizkiy, Alexey Tsymbal, Seppo Puuronen , 2004. PCA-based Feature Transformation for Classification: Issues in Medical Diagnostics. 17th IEEE Symposium on Computer-Based Medical Systems .
  9. Philip Sedgwick , 2012. Statistical Question Parametric v non-parametric statistical tests. BMJ Journal. 344 doi: http://dx. doi. org/10. 1136/bmj. e1753
  10. Ranjana Raut, S. V. Duddul, 2010 . Intelligent Diagnosis of Heart Diseases using Neural Network Approach. International Journal of Computer Applications . 1 – 2, 97-102.
  11. Sudarshan Nandy, Partha Pratim Sarkar, Kalyani, Nadia, Achintya Das. 2012 An Improved Gauss-Newtons Method based Back-propagation Algorithm for Fast Convergence. International Journal of Computer Applications (0975 – 8887) vol39– No. 8, February 2012. 1-7.
  12. Sunila Godara, Nirmal . Intelligent and Effective Decision Support System Using Multilayer Perceptron. International Journal of Engineering Research and Applications. 1-3, 513-518.
  13. Yonghong Peng, Zhiqing Wu, Jianmin Jiang ,2010 . A novel feature selection approach for biomedical data classification. Journal of Biomedical Informatics, 43. No. 1,15-23.
  14. Zweig MH, Campbell G. , 1993 . Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem. Apr 39-4:561-77
  15. Charles X. Ling Jin Huang Harry Zhang, 2003 . AUC: a Statistically Consistent and more Discriminating Measure than Accuracy . 18th International Conference On Artificial Intelligence .
  16. Gang Zheng, Ya-Lou Huang, Peng-Tao Wang , Guang-Fu Shu, 2005. Study Of Classification Rules On Weighted Coronary Heart Disease Data. Fourth International Conference On Machine Learning And Cybernetics, Guangzhou, 18-21 , 1845-1850. ISBN
  17. Kyungim Baek and Bruce A. Draper, 2002. Factor Analysis for Background Suppression . International Conference on Pattern Recognition Colorodo.
  18. Naeem Seliya, Taghi M. Khoshgoftaar, Jason Van Hulse, 2009. Aggregating Performance Metrics for Classifier Evaluation . IEEE IRI , July 10-12,, Las Vegas, Nevada, USA, 35-40.
  19. Shivajirao M. Jadhav, Sanjay L. Nalbalwar, Ashok A. Ghatol, 2010. Generalized Feedforward Neural Network based Cardiac Arrhythmia Classification from ECG Signal Data. Advanced Information Management and Service (IMS)', 6th International Conference, 351-356.
  20. V. Garc´?a, R. A. Mollineda and J. S. S´anchez, 2010. Theoretical Analysis of a Performance Measure for Imbalanced Data . International Conference on Pattern Recognition Pub in IEEE Computer society, 617-620.
  21. Deepak Mishra, Abhishek Yadav, Sudipta Ray, and Prem K. Kalra . Levenberg-Marquardt Learning Algorithm for Integrate-and-Fire Neuron Model. Neural Information Processing - Letters and Reviews Vol. 9, No. 2, 41-51
  22. Jonathon Shlens , 2009. A Tutorial on Principal Component Analysis . April 22 . Version 3. 01, tutorial available at http://www. snl. salk. edu/~shlens/pca. pdf.
  23. UCI ML Repository. http://archive. ics. uci. edu/ml/
  24. Fredric M. Ham, Ivica Kostanic , 2002. Principles of Neurocomputing for science and Engineering. Tata McGraw – Hill Edition.
  25. Vojislav Kecman , 2004. Learning and Soft Computing. The MIT Press, Cambridge.
  26. Principal Component Analysis and Factor Analysis. Therese Leinonen. Seminar in Statistics and Methodology, 25th February, 2009 http://www. let. rug. nl/ ~nerbonne/teach/rema-stats-meth-seminar/presentations/ leinonen-pcafa-2009. pdf
Index Terms

Computer Science
Information Sciences

Keywords

Dimensionality reduction PCA FA neural networks AUC heart disease