Call for Paper - January 2023 Edition
IJCA solicits original research papers for the January 2023 Edition. Last date of manuscript submission is December 20, 2022. Read More

Evaluation of Punjabi Named Entity Recognition using Context Word Feature

Print
PDF
International Journal of Computer Applications
© 2014 by IJCA Journal
Volume 96 - Number 20
Year of Publication: 2014
Authors:
Amandeep Kaur
Gurpreet Singh Josan
10.5120/16913-7011

Amandeep Kaur and Gurpreet Singh Josan. Article: Evaluation of Punjabi Named Entity Recognition using Context Word Feature. International Journal of Computer Applications 96(20):32-38, June 2014. Full text available. BibTeX

@article{key:article,
	author = {Amandeep Kaur and Gurpreet Singh Josan},
	title = {Article: Evaluation of Punjabi Named Entity Recognition using Context Word Feature},
	journal = {International Journal of Computer Applications},
	year = {2014},
	volume = {96},
	number = {20},
	pages = {32-38},
	month = {June},
	note = {Full text available}
}

Abstract

Named Entity Recognition is the task of identifying and classifying Named Entities in the given text. In this paper evaluation of Named Entity Recognition in Punjabi language has been performed using context word feature. Words preceding and succeeding the target word are very helpful in determining its category. In this work context word feature of word window 7, 5 and 3 have been used. Experiments have been performed using different training and test sets. In this evaluation a Named Entity Tagset of 14 tags namely PERSON, ORGANIZATION, LOCATION, FACILITY, EVENT, RELATIONSHIP, TIME, DATE, DESIGNATION, TITLE-PERSON, NUMBER, MEASURE, ABBREVIATION and ARTIFACT has been used. It has been observed that word window 7 and 5 have given better results as compared to word window 3. Although F-scores and Precision values of word window 7 are slightly higher than that of word window 5 but recall of word window 7 was found to be lower than that word window 5.

References

  • Borthwick, A. , 1999. Maximum Entropy Approach to Named Entity Recognition. Ph. D. dissertation, Comput. Sci. Dept. , New York Univ. , New York, USA.
  • Chaudhuri, B. B. and Bhattacharya, S. , 2008. An Experiment on Automatic Detection of Named Entities in Bangla. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 75-82.
  • Ekbal, A. and Bandyopadhyay, S. , 2008. Bengali Named Entity Recognition using Support Vector Machine. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 51–58.
  • Ekbal, A. , Haque, R. , Das, A. , Poka V. and Bandyopadhyay, S. , 2008. Language Independent Named Entity Recognition in Indian Languages. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 33–40.
  • Gali, K. , Surana, H. , Vaidya, A. , Shishtla, P. and Sharma, D. M. , 2008. Aggregating Machine Learning and Rule Based Heuristics for Named Entity Recognition. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 25-32.
  • Grishman, R. and Sundheim B. , 1996. Message Understanding Conference - 6: A Brief History. In the Proceedings of the 16th International Conference on Computational Linguistics (COLING). 466 – 471.
  • Kaur, A. and Josan, G. , 2014. Improved Named Entity Tagset for Punjabi Language. In the Proceedings of 2014 RAECS.
  • Kaur, A. , Josan, G. and Kaur, J. , 2009. Named Entity Recognition For Punjabi: A Conditional Random Field Approach. In Proceedings of ICON-2009: 7th International Conference on Natural Language Processing. 277-282.
  • Lafferty, J. D. , McCallum, A. and Pereira, F. C. N. , 2001. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Proceedings of International Conference on Machine Learning. 282-289
  • Mansouri, A. , Suriani Affendey, L. and Mamat, A. , 2008. Named Entity Recognition Approaches. International Journal of Computer Science and Network Security. 339-344.
  • Saha, S. K. , Chatterji, S. , Dandapat, S. , Sarkar, S. and Mitra, P. , 2008. A Hybrid Approach for Named Entity Recognition in Indian Language. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 17-24.
  • Sang, E. F. T. K. and Meulder, F. D. , 2003. Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition. In Proceedings of 7th Conference on Natural Language Learning CoNLL-2003.
  • Sang, E. F. T. K. , 2002. Introduction to the CoNLL- 2002 shared task: Language-independent named entity recognition. In Proceedings of 6th Workshop on Computational Language Learning, CoNLL-2002.
  • Sekine, S. and Ishara, H. , 2000. IREX: IR & IE evaluation project in Japanese. In Proceedings of the 2nd International Conference on Language Resources and Evaluation.
  • Sekine, S. , Sudo, K. and Nobata, C. , 2002. Extended Named Entity Hierarchy. In Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002.
  • Shishtla, P. M. , Gali, K. , Pingali P. and Varma, V. , 2008. Experiments in Telugu NER: A Conditional Random Field Approach. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 105-110.
  • Singh, A. K. , 2008. Named Entity Recognition for South and South East Asian Languages: Taking Stock. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 5–16.
  • Srikanth, P. and Murthy, K. N. , 2008. Named Entity Recognition for Telugu. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 41-50.