CFP last date
20 March 2024
Reseach Article

Part of Speech Tagging of Punjabi Language using N Gram Model

by Sumeer Mittal, Navdeep Singh Sethi, Sanjeev Kumar Sharma
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 100 - Number 19
Year of Publication: 2014
Authors: Sumeer Mittal, Navdeep Singh Sethi, Sanjeev Kumar Sharma
10.5120/17634-8229

Sumeer Mittal, Navdeep Singh Sethi, Sanjeev Kumar Sharma . Part of Speech Tagging of Punjabi Language using N Gram Model. International Journal of Computer Applications. 100, 19 ( August 2014), 19-23. DOI=10.5120/17634-8229

@article{ 10.5120/17634-8229,
author = { Sumeer Mittal, Navdeep Singh Sethi, Sanjeev Kumar Sharma },
title = { Part of Speech Tagging of Punjabi Language using N Gram Model },
journal = { International Journal of Computer Applications },
issue_date = { August 2014 },
volume = { 100 },
number = { 19 },
month = { August },
year = { 2014 },
issn = { 0975-8887 },
pages = { 19-23 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume100/number19/17634-8229/ },
doi = { 10.5120/17634-8229 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:30:23.486102+05:30
%A Sumeer Mittal
%A Navdeep Singh Sethi
%A Sanjeev Kumar Sharma
%T Part of Speech Tagging of Punjabi Language using N Gram Model
%J International Journal of Computer Applications
%@ 0975-8887
%V 100
%N 19
%P 19-23
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

POS tagger is the process of assigning a correct tag to each word of the sentence. We attempted to improve the accuracy of existing Punjabi POS tagger. This POS tagger lacks in resolving the ambiguity of a no of words as it uses only hand written Rules. A Bi-gram Model has been used to solve the part of speech tagging problem. An annotated corpus was used for training and estimating of bi gram probabilities.

References
  1. Dinesh Kumar and Gurpreet Singh Josan,(2010), "Part of Speech Taggers for Morphologically Rich Indian Languages: A Survey", International Journal of Computer Applications (0975 – 8887) Volume6–No. 5, September, 2010, www. ijcaonline. org/ volume6/number5 /pxc3871409 . pdf. .
  2. Vijayalaxmi . F. Patil (2010), "Designing POS Tagset for Kannada, Linguistic Data Consortium for Indian Languages (LDC-IL), Organized by Central Institute of Indian Languages, Department of Higher Education Ministry of Human Resource Development, Government of India, March 2010. .
  3. Hammad Ali (2010), "An Unsupervised Parts-of-Speech Tagger for the Bangla language", Department of Computer Science, University of British Columbia. 2010.
  4. Nidhi Mishra Amit Mishra (2011), "Part of Speech Tagging for Hindi Corpus", International Conference on Communication Systems and Network Technologies.
  5. Aniket Dalal, Kumar Nagaraj, Uma Sawant and Sandeep Shelke, "Hindi Part of Speech Tagging and Chunking: A Maximum Entropy Approach", In Proceeding of the NLPAI Machine Learning Competition, 2006.
  6. Antony P. J, Santhanu P Mohan, Soman K. P,"SVM Based Part of Speech Tagger for Malayalam", IEEE International Conference on Recent Trends in Information, Telecommunication and Computing, pp. 339-341, 2010
  7. Agarwal Himashu, Amni Anirudh," Part of Speech Tagging and Chunking with Conditional Random Fields" in the proceedings of NLPAI Contest, 2006
  8. Brants, TnT – A statistical part-of-speech tagger. In Proc. Of the 6th Applied NLP Conference, pp. 224-231, 2000
  9. Sanjeev Kumar Sharma and Dr G S Lehal "Improving Existing Punjabi POS tagger Using Hidden Markov Model"
  10. Jyoti Singh, Nisheeth Joshi and Iti Mathur in 2013 "Part Of Speech Tagging of Marathi text Using Trigram Model" in International Journal of Advanced Information Technology (IJAIT) Vol. 3, No. 2, April2013 pp. 35-41.
Index Terms

Computer Science
Information Sciences

Keywords

POS tagger bi-gram n-gram Punjabi tag set