Call for Paper - May 2023 Edition
IJCA solicits original research papers for the May 2023 Edition. Last date of manuscript submission is April 20, 2023. Read More

A Query based Text Categorization using K-Nearest Neighbor Approach

Print
PDF
International Journal of Computer Applications
© 2011 by IJCA Journal
Number 1 - Article 1
Year of Publication: 2011
Authors:
Suneetha Manne
Sita Kumari Kotha
Dr. S. Sameen Fatima
10.5120/3915-5513

Suneetha Manne, Sita Kumari Kotha and Dr. Sameen S Fatima. Article:A Query based Text Categorization using K-Nearest Neighbor Approach. International Journal of Computer Applications 32(7):16-21, October 2011. Full text available. BibTeX

@article{key:article,
	author = {Suneetha Manne and Sita Kumari Kotha and Dr. S. Sameen Fatima},
	title = {Article:A Query based Text Categorization using K-Nearest Neighbor Approach},
	journal = {International Journal of Computer Applications},
	year = {2011},
	volume = {32},
	number = {7},
	pages = {16-21},
	month = {October},
	note = {Full text available}
}

Abstract

World Wide Web is the store house of abundant information available in various electronic forms. In the past two decades, the increase in the performance of computers in handling large quantity of text data led researchers to focus on reliable and optimal retrieval of information already exist in the huge resources. Though the existing search engines, answering machines has succeeded in retrieving the data relative to the user query, the relevancy of the text data is not appreciable of the huge set. It is hence binding the range of resultant text data for a given user query with appreciable ranking to each document stand as a major challenge. In this paper, we propose a Query based k-Nearest Neighbor method to access relevant documents for a given query finding the most appropriate boundary to related documents available on web and rank the document on the basis of query rather than customary Content based classification. The experimental results will elucidate the categorization with reference to closeness of the given query to the document.

Reference

  • Sebastiani, F.,Machine learning in automated text categorization. ACM Computing Surveys, 34(1), pp. 1–47, 2002.
  • XiuboGeng, Tie-YanLiu, TaoQin, AndrewArnold, HangLi and Heung-YeungShum, “Query Dependent Ranking Using K-Nearest Neighbor,” ACM, SIGIR08, July20–24,2008,Singapore.
  • Dik L. Lee, uei Chuang, H Ent Seamons,“ Document Ranking and the Vector-Space Model”,a research theisis, March-April,1997.
  • T.Y.Liu,Y.Yang,H.Wan,H.Zeng,Z.Chen,andW.Y.Ma,“Support Vector machines classification with a very large scale taxonomy. SIGKDD Explor. Newsl,7(1):36–43.
  • Gongde Guo , Hui Wang , David Bell , Yaxin Bi , and Kieran Greer, “Using kNN Model-based Approach for Automatic Text Categorization”.
  • Stavros Papadopoulos, Lixing Wang, Yin Yang, Dimitris Papadias, Panagiotis Karras, “Authenticated Multi-Step Nearest Neighbor Search”
  • Yang, Y. & Pedersen, J.O., A comparative study on feature selection in text categorization. Proceedings of ICML-97, 14th International Conference on Machine Learning, ed.D.H. Fisher,Morgan Kaufmann Publishers, San Francisco, US: Nashville, US, pp. 412–420, 1997.
  • Guru, D. S., Harish B. S., and Manjunath, S. 2009. “Clustering of Textual Data: A Brief Survey”, In the Proceedings of International Conference on Signal and Image Processing, pp. 409 – 413.
  • Dr. Riyad Al-Shalabi , Dr. Ghassan Kanaan and Manaf H. Gharaibeh “Arabic Text Categorization Using kNN Algorithm”
  • K. Aas, L. Eikvil, Text Categorization: A Survey. Norwegian Computation Center, Oslo, 1999
  • R.M. Duwairi,