CFP last date
22 April 2024
Reseach Article

Named Entity Recognition in Assamese

by Padmaja Sharma, Utpal Sharma, Jugal Kalita
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 142 - Number 8
Year of Publication: 2016
Authors: Padmaja Sharma, Utpal Sharma, Jugal Kalita
10.5120/ijca2016909885

Padmaja Sharma, Utpal Sharma, Jugal Kalita . Named Entity Recognition in Assamese. International Journal of Computer Applications. 142, 8 ( May 2016), 1-8. DOI=10.5120/ijca2016909885

@article{ 10.5120/ijca2016909885,
author = { Padmaja Sharma, Utpal Sharma, Jugal Kalita },
title = { Named Entity Recognition in Assamese },
journal = { International Journal of Computer Applications },
issue_date = { May 2016 },
volume = { 142 },
number = { 8 },
month = { May },
year = { 2016 },
issn = { 0975-8887 },
pages = { 1-8 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume142/number8/24913-2016909885/ },
doi = { 10.5120/ijca2016909885 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:44:24.593714+05:30
%A Padmaja Sharma
%A Utpal Sharma
%A Jugal Kalita
%T Named Entity Recognition in Assamese
%J International Journal of Computer Applications
%@ 0975-8887
%V 142
%N 8
%P 1-8
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Named Entity Recognition is a process through which a program extracts proper nouns in texts and associates them with a proper tag. NER has made significant progress in European languages, but in Indian languages due to the lack of effort as well as proper resources, it remains a challenging task. Recognizing ambiguities and assigning the correct tags to the names is the main goal of NER. Thus NER can be defined as identification of the proper nouns and the classification of these nouns into classes such as person, location, organization and miscellaneous including date, time and year. The main aim of this work is to develop a computational system that can perform NER in text in Assamese, which is a resource poor Indo-Aryan language. This article present an overview of NER and its issues in the context of Assamese, and also the work done in Assamese using various approaches.

References
  1. Borthwick Andrew. A Maximum Entropy Approach to NER. In Ph.D thesis,Computer Science Dept, New York University, 1999.
  2. D Appelt, R Hobbs, J Bear, D Israel, M Kaymeyama, A Kehler, D Martin, K Myers, andMTyson. SRI International FASTUS system MUC-6 Test Results and Analysis. In Proceedings of the Sixth Message Understanding Conference, pages 237–48, Columbia, Maryland, 1995.
  3. Nancy Chinchor, Eric Brown, Lisa Ferro, and Patty Robinson. Named Entity Recognition Task Definition. August-27 1999.
  4. Cortes and Vapnik. Support Vector Network. Machine Learning, pages 273–297, 1995.
  5. R. Grishman and B Sundheim. Message Understanding Conference-6: A Brief History. In Proceedings of the 16th International Conference on Computational Linguistics (COLING), pages 466–71, Copenhagen, Denmark, 1996.
  6. Humphreys. K, Gaizauskas. R, Azzam. S, Huyck. C, Mitchell. B, Cunningham. H, and Wilks. Y. Description of the Lasieii System as Used for MUC-7. In Proceedings of the 7th Message Understanding Conference, Fairfax, VA, 1998.
  7. Kaufmann.M, Gaizauskas.R, Wakao. T, Humphreys. K, Cunningham.H, , and Wilks. Y. Description of the Lasie System as Used for MUC-6. In Proceedings of the Sixth Message Understanding Conference, pages 207–220, Columbia, Maryland, 1995.
  8. John Lafferty, Andrew McCallum, and Fernando Pereira. Probabilistic Models for Segmenting and Labelling Sequence Data. In Proceedings of the Eighteenth International Conference on Machine Learning (ICML-2001), pages 282– 289, Williams College, Williamstown, MA, USA, 2001.
  9. Bikel Daniel M, Miller Scott, Schwartz Richard, and Weischedel Ralph. A High Performance Learning Namefinder. In Proceedings of the fifth Conference on Applied Natural language Processing, pages 194–201, Washington, DC, USA, 1997.
  10. Scott Miller, Michael Crystal, Heidi Fox, Lance Ramshaw, Richard Schwartz, Rebecca Stone, Ralph Weischedel, and the Annotation Group. BBN: Description of the SIFT System as Used for MUC-7. In Proceedings of Seventh Message Understanding Conference (MUC-7), pages 1–17, Fairfax,Virginia, 1998.
  11. Grishman Ralph. The New York University System MUC-6 or Where’s the syntax. In Proceedings of the Sixth Message Understanding Conference, pages 167–175, Columbia, Maryland, 1995.
  12. Nina Wacholder, Yael Ravin, and Misook Choi. Disambiguation of Proper Names in Text. In Proceedings of the Fifth Conference on Applied Natural Language, pages 202–208, Washington Marriott Hotel, Washington, DC, USA, 1997.
  13. Takahiro Wakao, Robert Gaizauskas, and Yorick Wilks. Evaluation of an Algorithm for the Recognition and Classification of Proper Names. In Proceedings of COLING- 96, pages 418–423, Copenhagen, Denmark, 1996.
  14. Shihong Yu, Shuanhu Bai, and Paul Wu. Description of the Kent Ridge Digital Labs System Used for MUC-7. In Proceedings of Seventh Message Understanding Conference (MUC-7), Fairfox, Virginia.
Index Terms

Computer Science
Information Sciences

Keywords

Named Entity Recognition Assamese Corpora