CFP last date
20 May 2024
Reseach Article

A Novel Data Driven Algorithm for Tamil Morphological Generator

by Anand Kumar M, Dhanalakshmi V, Soman K.P, Rajendran S
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 6 - Number 12
Year of Publication: 2010
Authors: Anand Kumar M, Dhanalakshmi V, Soman K.P, Rajendran S
10.5120/1121-1470

Anand Kumar M, Dhanalakshmi V, Soman K.P, Rajendran S . A Novel Data Driven Algorithm for Tamil Morphological Generator. International Journal of Computer Applications. 6, 12 ( September 2010), 52-56. DOI=10.5120/1121-1470

@article{ 10.5120/1121-1470,
author = { Anand Kumar M, Dhanalakshmi V, Soman K.P, Rajendran S },
title = { A Novel Data Driven Algorithm for Tamil Morphological Generator },
journal = { International Journal of Computer Applications },
issue_date = { September 2010 },
volume = { 6 },
number = { 12 },
month = { September },
year = { 2010 },
issn = { 0975-8887 },
pages = { 52-56 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume6/number12/1121-1470/ },
doi = { 10.5120/1121-1470 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:55:16.619354+05:30
%A Anand Kumar M
%A Dhanalakshmi V
%A Soman K.P
%A Rajendran S
%T A Novel Data Driven Algorithm for Tamil Morphological Generator
%J International Journal of Computer Applications
%@ 0975-8887
%V 6
%N 12
%P 52-56
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Tamil is a morphologically rich language with agglutinative nature. Being agglutinative language most of the word features are postpositionally affixed to the root word. The morphological generator takes lemma, POS category and morpho-lexical description as input and gives a word-form as output. It is a reverse process of morphological analyzer. In any natural language generation system, morphological generator is an essential component in post processing stage. Morphological generator system implemented here is based on a new algorithm, which is simple, efficient and does not require any rules and morpheme dictionary. A paradigm classification is done for noun and verb based on Dr.S.Rajendran’s paradigm classification. Tamil verbs are classified into 32 paradigms with 1884 inflected forms. Like verbs, nouns are classified into 25 paradigms with 325 word forms. This approach requires only minimum amount of data. So this approach can be easily implemented to less resourced and morphologically rich languages.

References
  1. Anandan, P., Geetha, T.V., and Paratasarathy, R. 2001.“ Morphological Generator for Tamil ”, In Proceedings of the Tamil Inayam Conference, Malaysia, 46-54.
  2. A. G. Menon, S. Saravanan, R. Loganathan, Dr. K. P. Soman,“ Amrita Morph Analyzer and Generator for Tamil: A Rule-Based Approach ” Proceedings of Tamil Internet Conference 2009 , Cologne, Germany,October 2009.
  3. Garrido, Alicia, Amaia Iturraspe, Sandra Montserrat, Herm´ınia Pastor, and Mikel L. Forcada. 1999. “A compiler for morphological analysers and generators based on finite-state transducers ”. Procesamiento del Lenguaje Natural, 25:93–98.
  4. Goyal, V, Singh Lehal, G. “Hindi Morphological Analyzer and Generator ” Emerging Trends in Engineering and Technology, 2008. ICETET '08.
  5. Guido Minnen, John Carroll, and Darren Pearce. 2000. “Robust applied morphological generation.” Proceedings of the First International Natural Language Generation Conference, pages 201.208, 12.16 June.
  6. Irimia, E. ROG - A Paradigmatic Morphological Generator for Romanian.,2007, In Proceedings of the 3rd Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics. Poznań, Poland.
  7. M Anand kumar, V Dhanalakshmi. , K P Soman, S Rajendran ,“A Novel Apporach For Tamil Morphological Analyzer” Proceedings of Tamil Internet Conference 2009 , Cologne, Germany, Page no: 23-35, October 2009.
  8. Madhavi Ganapathiraju and Lori Levin, 2006, - “TelMore: Morphological Generator for Telugu Nouns and Verbs ”. Proc. Second International Conference on Universal Digital Library, Vol Alexandria, Egypt, Nov 17-19, 2006
  9. S.Rajendran, Arulmozi, S., Ramesh Kumar, Viswanathan, S. 2001. “Computational morphology of verbal complex “. Language in india Volume 3 : 4 April 2003
  10. Thomas Lehmann, 1992 second edition. “A Grammar of Modern Tamil ”. Pondicherry Institute of Linguistics and Culture, Pondicherry.
Index Terms

Computer Science
Information Sciences

Keywords

Paradigm Suffix table word-forms Morpho-lexical information Tamil morphological generator