CFP last date
20 June 2024
Call for Paper
July Edition
IJCA solicits high quality original research papers for the upcoming July edition of the journal. The last date of research paper submission is 20 June 2024

Submit your paper
Know more
Reseach Article

Investigation of the Issues Related to Word Level Features of Marathi Language for Searching Web Content

by Harshali B. Patil, Ajay S. Patil
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 154 - Number 9
Year of Publication: 2016
Authors: Harshali B. Patil, Ajay S. Patil
10.5120/ijca2016912211

Harshali B. Patil, Ajay S. Patil . Investigation of the Issues Related to Word Level Features of Marathi Language for Searching Web Content. International Journal of Computer Applications. 154, 9 ( Nov 2016), 14-18. DOI=10.5120/ijca2016912211

@article{ 10.5120/ijca2016912211,
author = { Harshali B. Patil, Ajay S. Patil },
title = { Investigation of the Issues Related to Word Level Features of Marathi Language for Searching Web Content },
journal = { International Journal of Computer Applications },
issue_date = { Nov 2016 },
volume = { 154 },
number = { 9 },
month = { Nov },
year = { 2016 },
issn = { 0975-8887 },
pages = { 14-18 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume154/number9/26518-2016912211/ },
doi = { 10.5120/ijca2016912211 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:59:46.865451+05:30
%A Harshali B. Patil
%A Ajay S. Patil
%T Investigation of the Issues Related to Word Level Features of Marathi Language for Searching Web Content
%J International Journal of Computer Applications
%@ 0975-8887
%V 154
%N 9
%P 14-18
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Now-a-days Internet is an important source of information. So everyone uses Internet in their day-to-day life for doing various activities. Text search is one of the important activities performed daily by the Internet users. The dramatic growth in textual data available on Internet in regional languages gave birth to natural language search. Natural language content retrieval poses certain problems due to the word level features of that language such as spelling variation, morphological variation, etc. The unavailability of tools and techniques for these regional languages can be the reason for the low recall level for these natural languages information retrieval. This paper addresses the issues related to Marathi language word level features for textual content retrieval. This paper describes different types of problems with examples and also suggests solutions to these problems.

References
  1. Mhaske N, and Patil A. 2016. Issues and Challenges in Analyzing Opinions in Marathi Text. International Journal of Computer Science Issues, Volume 13, Issue 2, pp- 19-25.
  2. Strzalkowski, T. and Vauthey, B. 1992. Information Retrieval Using Robust Natural Language Processing. In Proceedings of ACL-92, pp 104–111, Newark, Deleware, USA.
  3. Brants T. 2003. Natural Language Processing in Information Retrieval, 14th meeting of computational linguistics in the Netherlands.
  4. Majumder P., Mitra M. 2009. Indian Language Information Retrieval. Guide to OCR for Indic Scripts, pp 301-314.
  5. Kimmo Kettunen. 2007. Reductive and Generative Approaches to Morphological Variation of Keywords in Monolingual Information Retrieval, Doctoral Thesis. University of Tampere.
  6. Pal D., Majumder P., Mitra M., Mitra S., and Sen A. 2008. Issues in Searching for Indian language Web Content. In iNEWS '08 Proceedings of the 2nd ACM Workshop on Improving non-English Web Searching Pages 93-96.
  7. Soundalgekar M. Internet search for Indian Languages. M.Tech dissertation, IIT Bombay.
  8. Patil H. B., Pawar B. V., Patil A. S., 2016. A Comprehensive Analysis of Stemmers Available for Indic Languages. International Journal on Natural Language Computing (IJNLC) Volume 5 – No.1, pp 45-55.
  9. Patil H. B., Patil A. S., Pawar B. V., 2014. Part-of-Speech Tagger for Marathi Language using Limited Training Corpora. IJCA Proceedings on National Conference on Recent Advances in Information Technology NCRAIT(4), 2014, pp. 33-37.
  10. Vaishali. B. Patil & B. V. Pawar 2015. Modeling Complex Sentences for Parsing through Marathi Link Grammar Parser, Int. Journal of Computer Science Issues, Vol. 12, Issue 1, No. 2, pp 108-113.
  11. Patil N. V., Patil A. S., Pawar B. V. 2016. Survey of Named Entity Recognition Systems with respect to Indian and Foreign languages. International Journal of Computer Applications (0975 – 8887) Volume 134 – No.16, pp 21-26.
  12. Patil N. V., Patil A. S., Pawar B. V. 2016. Issues and Challenges in Marathi Named Recognition. International Journal on Natural Language Computing (IJNLC) Volume 5 – No.1, pp 15-30.
  13. Feldman S. 1999. NLP meets the Jabberwocky: Natural Language Processing in Information Retrieval, http://www.scism.lsbu.ac.uk/inmandw/ir/jaberwocky.htm
  14. V, Mari and Pedraza-Jimenez 2007, Rafael Natural Language Processing in Textual Information Retrieval and Related Topics. Hipertext.net, n. 5.
  15. Govilkar L. Marathiche Wyakaran, Mehata Publishing House.
  16. Bhagwat S. Tumche Aamche Marathi Vyakaran Vidyabharati Prakashan
Index Terms

Computer Science
Information Sciences

Keywords

Natural language processing text retrieval morphology Marathi