Part of Speech Tagging of Punjabi Language using N Gram Model

Sumeer Mittal; Navdeep Singh Sethi; Sanjeev Kumar Sharma

Call for Paper

June Edition

IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2026

Submit your paper

Know more

The week's pick

REVENUE FORECASTING IN INTELLIGENT WATER MANAGEMENT SYSTEMS USING ARIMA TIME SERIES MODEL

Coraina Y. Torar Gloria Manggala Meilani J. Ngantung

Random Articles

Speech Synthesis - Automatic Segmentation

July

2014

Analysis and Implementation of Encapsulation Schemes for Baseband Frame of DVB-S2 Satellite Modulator

June

2015

On the Internal Workings of Botnets: A Review

March

2016

Leader Election using Modified Heap Tree Method

July

2012

Reseach Article

Part of Speech Tagging of Punjabi Language using N Gram Model

by Sumeer Mittal, Navdeep Singh Sethi, Sanjeev Kumar Sharma

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 100 - Number 19

Year of Publication: 2014

Authors: Sumeer Mittal, Navdeep Singh Sethi, Sanjeev Kumar Sharma

10.5120/17634-8229

Sumeer Mittal, Navdeep Singh Sethi, Sanjeev Kumar Sharma . Part of Speech Tagging of Punjabi Language using N Gram Model. International Journal of Computer Applications. 100, 19 ( August 2014), 19-23. DOI=10.5120/17634-8229

@article{ 10.5120/17634-8229,

author = { Sumeer Mittal, Navdeep Singh Sethi, Sanjeev Kumar Sharma },

title = { Part of Speech Tagging of Punjabi Language using N Gram Model },

journal = { International Journal of Computer Applications },

issue_date = { August 2014 },

volume = { 100 },

number = { 19 },

month = { August },

year = { 2014 },

issn = { 0975-8887 },

pages = { 19-23 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume100/number19/17634-8229/ },

doi = { 10.5120/17634-8229 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T22:30:23.486102+05:30

%A Sumeer Mittal

%A Navdeep Singh Sethi

%A Sanjeev Kumar Sharma

%T Part of Speech Tagging of Punjabi Language using N Gram Model

%J International Journal of Computer Applications

%@ 0975-8887

%V 100

%N 19

%P 19-23

%D 2014

%I Foundation of Computer Science (FCS), NY, USA

Abstract

POS tagger is the process of assigning a correct tag to each word of the sentence. We attempted to improve the accuracy of existing Punjabi POS tagger. This POS tagger lacks in resolving the ambiguity of a no of words as it uses only hand written Rules. A Bi-gram Model has been used to solve the part of speech tagging problem. An annotated corpus was used for training and estimating of bi gram probabilities.

References

Dinesh Kumar and Gurpreet Singh Josan,(2010), "Part of Speech Taggers for Morphologically Rich Indian Languages: A Survey", International Journal of Computer Applications (0975 – 8887) Volume6–No. 5, September, 2010, www. ijcaonline. org/ volume6/number5 /pxc3871409 . pdf. .
Vijayalaxmi . F. Patil (2010), "Designing POS Tagset for Kannada, Linguistic Data Consortium for Indian Languages (LDC-IL), Organized by Central Institute of Indian Languages, Department of Higher Education Ministry of Human Resource Development, Government of India, March 2010. .
Hammad Ali (2010), "An Unsupervised Parts-of-Speech Tagger for the Bangla language", Department of Computer Science, University of British Columbia. 2010.
Nidhi Mishra Amit Mishra (2011), "Part of Speech Tagging for Hindi Corpus", International Conference on Communication Systems and Network Technologies.
Aniket Dalal, Kumar Nagaraj, Uma Sawant and Sandeep Shelke, "Hindi Part of Speech Tagging and Chunking: A Maximum Entropy Approach", In Proceeding of the NLPAI Machine Learning Competition, 2006.
Antony P. J, Santhanu P Mohan, Soman K. P,"SVM Based Part of Speech Tagger for Malayalam", IEEE International Conference on Recent Trends in Information, Telecommunication and Computing, pp. 339-341, 2010
Agarwal Himashu, Amni Anirudh," Part of Speech Tagging and Chunking with Conditional Random Fields" in the proceedings of NLPAI Contest, 2006
Brants, TnT – A statistical part-of-speech tagger. In Proc. Of the 6th Applied NLP Conference, pp. 224-231, 2000
Sanjeev Kumar Sharma and Dr G S Lehal "Improving Existing Punjabi POS tagger Using Hidden Markov Model"
Jyoti Singh, Nisheeth Joshi and Iti Mathur in 2013 "Part Of Speech Tagging of Marathi text Using Trigram Model" in International Journal of Advanced Information Technology (IJAIT) Vol. 3, No. 2, April2013 pp. 35-41.

Index Terms

Computer Science

Information Sciences

Keywords

POS tagger bi-gram n-gram Punjabi tag set