CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

Analysis of Vector Space Model in Information Retrieval

Published on November 2012 by Jitendra Nath Singh, Sanjay Kumar Dwivedi
National Conference on Communication Technologies & its impact on Next Generation Computing 2012
Foundation of Computer Science USA
CTNGC - Number 2
November 2012
Authors: Jitendra Nath Singh, Sanjay Kumar Dwivedi
51f0caf6-9990-4dfe-aeae-d2095c41d83b

Jitendra Nath Singh, Sanjay Kumar Dwivedi . Analysis of Vector Space Model in Information Retrieval. National Conference on Communication Technologies & its impact on Next Generation Computing 2012. CTNGC, 2 (November 2012), 14-18.

@article{
author = { Jitendra Nath Singh, Sanjay Kumar Dwivedi },
title = { Analysis of Vector Space Model in Information Retrieval },
journal = { National Conference on Communication Technologies & its impact on Next Generation Computing 2012 },
issue_date = { November 2012 },
volume = { CTNGC },
number = { 2 },
month = { November },
year = { 2012 },
issn = 0975-8887,
pages = { 14-18 },
numpages = 5,
url = { /proceedings/ctngc/number2/9056-1016/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Conference on Communication Technologies & its impact on Next Generation Computing 2012
%A Jitendra Nath Singh
%A Sanjay Kumar Dwivedi
%T Analysis of Vector Space Model in Information Retrieval
%J National Conference on Communication Technologies & its impact on Next Generation Computing 2012
%@ 0975-8887
%V CTNGC
%N 2
%P 14-18
%D 2012
%I International Journal of Computer Applications
Abstract

Information retrieval is great technology behind web search services. In information retrieval, it is common to model index terms and documents as vectors in a suitably defined vector space. The vector space model is one of the classical and widely applied retrieval models to evaluate relevance of web page. The retrieval operation consists of computing the cosine similarity function between a given query vector and the set of documents vector and then ranking documents accordingly. In this paper, we present different approaches of vector space model to compute similarity score of hits from search engine and more importantly, it is felt that this investigation will lead to a clearer understanding of the issues and problems in using the vector space model in information retrieval and our work intends to discuss the main aspects of Vector space models and provide a comprehensive comparison for Term- Count model, Tf-Idf model and Vector space model based on normalization.

References
  1. Shalton, G; Wong, A; Yang, C. S. : A vector space Model for automatic indexing Communications of The ACM, Volume 18, Issue 11(November1975).
  2. Sanjay K. Dwivedi,Jitendra Nath Singh, Rajesh Gotam "Information Retrieval Evaluative Model" FTICT 2011: Proceedings of the 2011, International conference on "Future Trend in Information & Communication Technology, Ghaziabad, India, Feb -2011.
  3. Yi Shang Longzhuang Li: Precision Evaluation of Search Engines. World Wide Web (2002).
  4. D. L. Lee, H. Chuang, and K. Seamons. Document ranking and the vector space model. IEEE Transactions on Software, 14(2): 1997.
  5. Chris Buckley. The importance of proper weighting methods. In M. Bates, editor, Human Language Technology. Morgan Kaufman, 1993.
  6. Longzhuang Li, Yi Shang A new statistical method for performance evaluation of search engines. ICTAI 2000.
  7. Longzhuang Li, Yi Shang A new method for automatic performance comparison of search engines. World Wide Web (2000).
  8. Chu, H. & Rosenthal: "Search engines for the World Wide Web: A comparative study and evaluation methodology". In Proceedings of the 59th Annual Meeting of the American Society for Information Science, Baltimore, 1996.
  9. Jinbiao Hou: "Research on Design of an Automatic Evaluation System of Search Engine" . In proceeding of ETP International Conference on Future Computer and Communication . FCC/2009.
  10. Gerald Salton and Chris Buckley. Term weighting approaches in automatic text retrieval. Information Processing and Management, 24(5): Is-sue 5. 1988.
  11. G. Salton and C. Buckley, "Improving Retrieval Performance by Relevance Feedback," J. Amer. Soc. for Information Science, Vol. 41, No. 4, 1990
Index Terms

Computer Science
Information Sciences

Keywords

Vector Space Model Information Retrieval Tf-idf Term Frequency Cosine Similarity