CFP last date
20 May 2024
Reseach Article

Efficient Document Retrieval using Annotation, Searching and Ranking

by Sonal Kutade, Poonam Dhamal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 108 - Number 5
Year of Publication: 2014
Authors: Sonal Kutade, Poonam Dhamal
10.5120/18904-0198

Sonal Kutade, Poonam Dhamal . Efficient Document Retrieval using Annotation, Searching and Ranking. International Journal of Computer Applications. 108, 5 ( December 2014), 1-3. DOI=10.5120/18904-0198

@article{ 10.5120/18904-0198,
author = { Sonal Kutade, Poonam Dhamal },
title = { Efficient Document Retrieval using Annotation, Searching and Ranking },
journal = { International Journal of Computer Applications },
issue_date = { December 2014 },
volume = { 108 },
number = { 5 },
month = { December },
year = { 2014 },
issn = { 0975-8887 },
pages = { 1-3 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume108/number5/18904-0198/ },
doi = { 10.5120/18904-0198 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:42:09.984331+05:30
%A Sonal Kutade
%A Poonam Dhamal
%T Efficient Document Retrieval using Annotation, Searching and Ranking
%J International Journal of Computer Applications
%@ 0975-8887
%V 108
%N 5
%P 1-3
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

It is always difficult to find relevant information in unstructured text documents. In this paper we study the methods of fuzzy search, instant search and proximity ranking and how they can be used in the process of annotation of documents. These various methods can be integrated to give better search results and to achieve efficient space and time complexities. We propose a novel alternative approach which facilitates the generation of the structured metadata automatically using OpenNLP, methods of Instant-fuzzy search and Proximity ranking. It is done by identifying documents which are likely to contain the information of interest. And this information will be subsequently useful for querying the database.

References
  1. Vagelis Hristidis, Panagiotis G. Ipeirotis, Eduardo J. Ruiz, "Facilitating Document Annotation Using Content and Querying Value", IEEE Transactions On Knowledge And Data Engineering, volume 6, no 2, IEEE 2014
  2. Cetindil, I. , Esmaelnezhad, J. , Taewoo Kim, Chen Li, "Efficient instant fuzzy search with proximity ranking", Data Engineering (ICDE), 30th International Conference , IEEE 2014
  3. Akshay Shingote, Nikhil Vispute, Priyanka Dhikale, "Facilitating Document Annotation Using Content & Querying Value", IJCTT, vol 9, March 2014
  4. Vagelis Hristidis, Eduardo Ruiz," CADS: A Collaborative Adaptive Data Sharing Platform", SCIS, International University, Florida, 2009
  5. R. Schenkel, A. Broschart, S. Won Hwang, G. Weikum, M. Theobald, "Efficient text proximity search," SPIRE, 2007, pp. 287–299.
  6. H. Yan, S. Shi, F. Zhang, T. Suel, and J. -R. Wen, "Efficient term proximity search with the term-pair indexes,"CIKM, 2010, pp. 1229– 1238.
  7. H. Bast, F. Suchanek, A. Chitea, I. Weber, , "Ester : efficient search on text, entities, and relations," SIGIR, 2007, pp. 671– 678.
  8. J. Feng, G Li, S. Ji, C. Li, , "Efficient interactive fuzzy keyword search," WWW, 2009, pp. 371–380.
  9. C. Li, G. Li, J. Feng S. Ji, and, "Efficient type-ahead search on the relational data: a tastier approach" , SIGMOD, 2009, pp. 695–706.
  10. M. Hadjieleftheriou and C. Li, "Efficient approximate search on string collections," PVLDB, vol. 2, no. 2, pp. 1660–1661, 2009.
  11. D. Xin, K. Chakrabarti, V. Ganti, S. Chaudhuri, "An efficient filter for approximate membership checking," SIGMOD Conf, 2008, pp. 805–818.
  12. R. Motwani S. Chaudhuri, and V. Ganti, "Robust identification of fuzzy duplicates," ICDE, 2005.
  13. J. Lu, S. Ji, A. Behm, , C. Li, " Space- constrained gram-based indexing for efficient approximate string search," ICDE, 2009, pp. 604–615.
  14. M. Zhu, S. Shi, J. -R. Wen, and N. Yu, "Can phrase indexing help to process non-phrase queries?" CIKM, 2008, pp. 679–688.
  15. R. Song, M. J. Taylor Y. Yu, , J. R. Wen, H. Hon, "Viewing term proximity from a different perspective," ECIR, 2008, pp. 346–357
Index Terms

Computer Science
Information Sciences

Keywords

Document Retrieval Document Annotation Instant-fuzzy search Proximity Ranking OpenNLP.