Call for Paper - March 2023 Edition
IJCA solicits original research papers for the March 2023 Edition. Last date of manuscript submission is February 20, 2023. Read More

Part of Speech Tagging of Punjabi Language using N Gram Model

International Journal of Computer Applications
© 2014 by IJCA Journal
Volume 100 - Number 19
Year of Publication: 2014
Sumeer Mittal
Navdeep Singh Sethi
Sanjeev Kumar Sharma

Sumeer Mittal, Navdeep Singh Sethi and Sanjeev Kumar Sharma. Article: Part of Speech Tagging of Punjabi Language using N Gram Model. International Journal of Computer Applications 100(19):19-23, August 2014. Full text available. BibTeX

	author = {Sumeer Mittal and Navdeep Singh Sethi and Sanjeev Kumar Sharma},
	title = {Article: Part of Speech Tagging of Punjabi Language using N Gram Model},
	journal = {International Journal of Computer Applications},
	year = {2014},
	volume = {100},
	number = {19},
	pages = {19-23},
	month = {August},
	note = {Full text available}


POS tagger is the process of assigning a correct tag to each word of the sentence. We attempted to improve the accuracy of existing Punjabi POS tagger. This POS tagger lacks in resolving the ambiguity of a no of words as it uses only hand written Rules. A Bi-gram Model has been used to solve the part of speech tagging problem. An annotated corpus was used for training and estimating of bi gram probabilities.


  • Dinesh Kumar and Gurpreet Singh Josan,(2010), "Part of Speech Taggers for Morphologically Rich Indian Languages: A Survey", International Journal of Computer Applications (0975 – 8887) Volume6–No. 5, September, 2010, www. ijcaonline. org/ volume6/number5 /pxc3871409 . pdf. .
  • Vijayalaxmi . F. Patil (2010), "Designing POS Tagset for Kannada, Linguistic Data Consortium for Indian Languages (LDC-IL), Organized by Central Institute of Indian Languages, Department of Higher Education Ministry of Human Resource Development, Government of India, March 2010. .
  • Hammad Ali (2010), "An Unsupervised Parts-of-Speech Tagger for the Bangla language", Department of Computer Science, University of British Columbia. 2010.
  • Nidhi Mishra Amit Mishra (2011), "Part of Speech Tagging for Hindi Corpus", International Conference on Communication Systems and Network Technologies.
  • Aniket Dalal, Kumar Nagaraj, Uma Sawant and Sandeep Shelke, "Hindi Part of Speech Tagging and Chunking: A Maximum Entropy Approach", In Proceeding of the NLPAI Machine Learning Competition, 2006.
  • Antony P. J, Santhanu P Mohan, Soman K. P,"SVM Based Part of Speech Tagger for Malayalam", IEEE International Conference on Recent Trends in Information, Telecommunication and Computing, pp. 339-341, 2010
  • Agarwal Himashu, Amni Anirudh," Part of Speech Tagging and Chunking with Conditional Random Fields" in the proceedings of NLPAI Contest, 2006
  • Brants, TnT – A statistical part-of-speech tagger. In Proc. Of the 6th Applied NLP Conference, pp. 224-231, 2000
  • Sanjeev Kumar Sharma and Dr G S Lehal "Improving Existing Punjabi POS tagger Using Hidden Markov Model"
  • Jyoti Singh, Nisheeth Joshi and Iti Mathur in 2013 "Part Of Speech Tagging of Marathi text Using Trigram Model" in International Journal of Advanced Information Technology (IJAIT) Vol. 3, No. 2, April2013 pp. 35-41.