CFP last date
20 May 2024
Reseach Article

Text Categorization Comparison between Simple BPNN and Combinatorial Method of LSI and BPNN

by Hemlata Tekwani, Mahak Motwani
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 97 - Number 22
Year of Publication: 2014
Authors: Hemlata Tekwani, Mahak Motwani
10.5120/17138-7723

Hemlata Tekwani, Mahak Motwani . Text Categorization Comparison between Simple BPNN and Combinatorial Method of LSI and BPNN. International Journal of Computer Applications. 97, 22 ( July 2014), 15-21. DOI=10.5120/17138-7723

@article{ 10.5120/17138-7723,
author = { Hemlata Tekwani, Mahak Motwani },
title = { Text Categorization Comparison between Simple BPNN and Combinatorial Method of LSI and BPNN },
journal = { International Journal of Computer Applications },
issue_date = { July 2014 },
volume = { 97 },
number = { 22 },
month = { July },
year = { 2014 },
issn = { 0975-8887 },
pages = { 15-21 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume97/number22/17138-7723/ },
doi = { 10.5120/17138-7723 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:24:48.154254+05:30
%A Hemlata Tekwani
%A Mahak Motwani
%T Text Categorization Comparison between Simple BPNN and Combinatorial Method of LSI and BPNN
%J International Journal of Computer Applications
%@ 0975-8887
%V 97
%N 22
%P 15-21
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper proposed a text categorization comparison between simple BPNN and Combinatorial method of LSI and BPNN. In the traditional error back propagation network, the weight adjustment process gets block in a local minima and also the training speed of such network is very slow which leads to reduced performance and reduced efficiency of the network. Also the Learning time of overall network is very high. Hence, to improve the categorization accuracy, a new combinatorial method of LSI (latent semantic Indexing) and BPNN (back propagation neural network) is proposed. The latent semantics demonstration is an accurate data structure in low-dimensional space in which documents, terms and queries are rooted and also compared. Singular value decomposition (SVD) technique is used in Latent semantic Analysis in which large term-document matrix is decomposed into a set of k orthogonal factors by which the original textual data is changed to a smaller semantic space. New document vectors are found in reduced k-dimensional space. Also new coordinates of the queries are found. Here we implement combinatorial method of LSI and BPNN based technique for the classification of 20Newsgroup dataset which include categories of Sports, CS, and Medicine. The proposed technique implemented is compared with the existing BPNN technique. . Hence, this new method greatly reduces the dimension and better classification results can be achieved.

References
  1. Zenglin Xu, Rong Jin, Kaizhu Huang, Michael R. Lyu, Irwin King "Semi-supervised Text Categorization by Active Search", Proceedings of the 17th ACM conference on Information and knowledge management, pp. 1517-1518, 2008.
  2. Robert E Uhrig "introduction to artificial neural network", Proceedings of the IEEE 21st International Conference on Industrial Electronics, Control, and Instrumentation (IECON -1995), vol. 1, pp. 33-37, 1995.
  3. Richard P Lippmann "Pattern Classification Using Neural Networks", IEEE Communications Magazine, Volume 27, Issue: 11, pp. 47 – 50, 1989.
  4. Mikhail Krivenko, Vitaly Vasilyev " Sequential Latent Semantic Indexing" ,proceedings of the 2nd Workshop on Data Mining using Matrices and Tensors ,Article no3 , 2009
  5. Berry M. W. , Dumais S. T. , O'Brien G. W. "Using linear algebra for intelligent information retrieval" // SIAM Review, Vol. 37. No. 4, pp-573–595, 1995
  6. Jennifer Farkas "Documents, Concepts and Neural Networks" proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2, pp- 1021-1031
  7. Ram Dayal Goyal "Knowledge Based Neural Network for Text Classification" GRC '07 Proceedings of the 2007 IEEE International Conference on Granular Computing, pp-542-547, 2007
  8. Zhihang Chen, Chengwen Ni and Yi L. Murphey "Neural Network Approaches for Text Document Categorization" IEEE 2006 International Joint Conference on Neural Networks,pp-1054-1060, 2006
  9. Zhan Wang, Yifan He and Minghu Jiang "A Comparison among Three Neural Networks for Text Classification" published in Signal Processing, 2006 8th International Conference on (Volume:3 ), pp-102-107 ,2006
  10. Cheng Hua Li and Soon Cheol Park "Neural Network for Text Classification Based on Singular Value Decomposition", IEEE Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference, pp. 47-52, 2007
  11. Tim Andersen, Wei Zhang "Features for Neural Net Based Region Identification of Newspaper Documents" IEEE Published in Document Analysis and Recognition,vol-1 2003. Proceedings. Seventh International Conference, pp-403-407, 2003
  12. Suvad Selman, Alma Husagic-Selman "Multilayered Feedforward Neural Networks as aTool for Distinction of the Authors of Texts" Published in Information, Communication and Automation Technologies (ICAT), 2011 XXIII International Symposium on, pp-1-6, 2011
  13. Qiuming Ma, Zhiguang Qin, Fengli Zhang, Qiao Liu "Text Spam Neural Network Classification Algorithm" published in:Communications, Circuits and Systems (ICCCAS), 2010 International Conference on,pp-466-469, 2010
  14. Diganta Saha "Web Text Classification Using a Neural Network" 2011 Second International Conference on Emerging Applications of Information Technology,pp-57-60, 2011
  15. Jingwen Tian, Meijuan Gao, and Yang Sun "Study on Web Classification Mining Method Based on Fuzzy Neural Network" Published in Automation and Logistics, 2009. ICAL '09. IEEE International Conference on pp-1781 – 178, 2009
Index Terms

Computer Science
Information Sciences

Keywords

Text categorization Latent semantic Indexing Singular value decomposition Neural network Back propagation neural network.