CFP last date
20 May 2024
Reseach Article

Comparison of Neural Networks and Support Vector Machines using PCA and ICA for Feature Reduction

by J. Sripriya, E. S. Samundeeswari
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 40 - Number 16
Year of Publication: 2012
Authors: J. Sripriya, E. S. Samundeeswari
10.5120/5066-7434

J. Sripriya, E. S. Samundeeswari . Comparison of Neural Networks and Support Vector Machines using PCA and ICA for Feature Reduction. International Journal of Computer Applications. 40, 16 ( February 2012), 31-36. DOI=10.5120/5066-7434

@article{ 10.5120/5066-7434,
author = { J. Sripriya, E. S. Samundeeswari },
title = { Comparison of Neural Networks and Support Vector Machines using PCA and ICA for Feature Reduction },
journal = { International Journal of Computer Applications },
issue_date = { February 2012 },
volume = { 40 },
number = { 16 },
month = { February },
year = { 2012 },
issn = { 0975-8887 },
pages = { 31-36 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume40/number16/5066-7434/ },
doi = { 10.5120/5066-7434 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:28:15.479821+05:30
%A J. Sripriya
%A E. S. Samundeeswari
%T Comparison of Neural Networks and Support Vector Machines using PCA and ICA for Feature Reduction
%J International Journal of Computer Applications
%@ 0975-8887
%V 40
%N 16
%P 31-36
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Web page classification provides an efficient information search to internet users. However, presently most of the web directories are still being classified manually or semi-automatically. This paper analyses the concept of the statistical analysis methods known as Principal Component Analysis (PCA) and Independent Component Analysis (ICA). The main purpose for using integration of PCA and ICA in Web News Classification is to perform feature separation and reduction. The feature vectors are applied to Neural Networks (NN) and Support Vector Machines (SVM) classifiers. F-measure is used to measure the classification effectiveness and found SVM is better than Neural Networks (NN). For the classification-ability experiment, sports news web page section was used.

References
  1. S. Ali and O. Sigeru, Web page feature selection and classification using neural networks, Inf. Sci. Inf. Comput. Sci., vol. 158, pp. 69-88, 2004.
  2. J. Brank & M. Grobelnik, N. Milic-Frayling, D. Mladenic, Training text classifiers with SVM on very few positive examples, Microsoft Research Technical Report MSR-TR-2003-34, 2003.
  3. Burges, C.J.C, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery Vol.2 No.2 (1998) 121-167.
  4. R. A. Calvo, M. Partridge, and M. A. Jabri, A Comparative Study of Principal Component Analysis Techniques, presented at In Proc. Ninth Australian Conf. on Neural Networks, Brisbane, 1998.
  5. G. G. Chowdhury, Introduction to modern information retrieval. London: Library Association Publishing, 1999.
  6. S. T. Dumais, Improving the retrieval of information from extemal sources. Behavior Research Methods, Instruments and Computers, 23(2), 229-236, 1991.
  7. T. Joachims, Probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization, Proceedings of International Conference on Machine Learning (ICML), 1997.
  8. Joachims, T. Advances in Kernal Methods-Support Vector Learning, chapter Making Large-Scale SVM Learning Practical MIT-Press, 1999.
  9. T. Joachims, Learning to Classify Text using Support Vector Machines, Kluwer, 2002.
  10. G. Karypis and E.H. Sam, Concept indexing: a fast dimensionality reduction algorithm with applications IO document retrieval & categorization, ClKM 2000 (2000).
  11. S. L.Y. Lam and D. L. Lee, Feature reduction for neural network based text categorization, Proceedings of the 6th International Conference on Database Systems for Advanced Applications 19 - 22 April Hsinchu, Taiwan, 1999.
  12. Lee Zhi Sam, Mohd Aizaini bin Maarof, Ali Selamat, Feature Extraction for Illicit We Pages Identifications Using Independent Component Analysis, International Conference on Intelligence and Advanced Systems, 2007.
  13. Prof.Dr.Markus Borschbach, A Hierarchical ICA-based Text Classifier, Institut fur Informatik, 2010.
  14. H. Mase and H. Tsuji. Experiments on automatic web page categorization for information retrieval system, Joumal of Information Processing, IPSJ Journal, Feb. 2001, pg. 334-347, 2001.
  15. Miao Zhang, De-xian Zhang, Trained SVMs based rules extraction method for text classification, IEEE International Symposium on IT in Medicine and Education, 2008, ITME 2008, 12-14 Dec. 2008.
  16. F. Sebastini, Machine learning in automated text categorization, ACM Computing Surveys, 34(1), 2002.
Index Terms

Computer Science
Information Sciences

Keywords

Independent Component Analysis Neural Networks Principal Component Analysis Support Vector Machine