CFP last date
22 April 2024
Reseach Article

Transmuter: An Approach to Rule-based English to Marathi Machine Translation

by G V Garje, G K Kharate, Harshad Kulkarni
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 98 - Number 21
Year of Publication: 2014
Authors: G V Garje, G K Kharate, Harshad Kulkarni
10.5120/17309-7782

G V Garje, G K Kharate, Harshad Kulkarni . Transmuter: An Approach to Rule-based English to Marathi Machine Translation. International Journal of Computer Applications. 98, 21 ( July 2014), 33-37. DOI=10.5120/17309-7782

@article{ 10.5120/17309-7782,
author = { G V Garje, G K Kharate, Harshad Kulkarni },
title = { Transmuter: An Approach to Rule-based English to Marathi Machine Translation },
journal = { International Journal of Computer Applications },
issue_date = { July 2014 },
volume = { 98 },
number = { 21 },
month = { July },
year = { 2014 },
issn = { 0975-8887 },
pages = { 33-37 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume98/number21/17309-7782/ },
doi = { 10.5120/17309-7782 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:26:48.838518+05:30
%A G V Garje
%A G K Kharate
%A Harshad Kulkarni
%T Transmuter: An Approach to Rule-based English to Marathi Machine Translation
%J International Journal of Computer Applications
%@ 0975-8887
%V 98
%N 21
%P 33-37
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper describes the architecture of a Machine Translation System with source language as English and target language as Marathi. The basic approach used for the development of this system is Rule Based Machine Translation. The basic algorithm for obtaining the correct word order in the target language was developed based on specific traversals of the parse tree. One of the special features of the system is a Word Sense Disambiguation model. Presently only prepositions will be disambiguated and work is going on for verbs and nouns. The model is a generalized approach based on the categories/domains a word belongs to. Another feature is the target language generation module. The focus is on the grammar structure of the target language that will produce better and smoother translations. The architecture though developed specifically for English – Marathi language pair, may be extended to other language pairs with similar structure. The architecture is partially implemented in the form of Machine Translation system. A lexicon is built for morphological and semantic properties. The results, even at partial implementation stage, are really encouraging.

References
  1. Wren P. and Martin H. High School English Grammar and Composition. S Chand Publication
  2. Technology Development for Indian Languages, DIT, Government of India Also available at: http://www. tdil-dc. in/index. php
  3. http://www. saakava. com
  4. Jenny Rose Finkel, Trond Grenager, and Christopher Manning. 2005. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In proceedings of the 43nd Annual Meeting of the Association for Computational Linguistics (ACL 2005), pp. 363-370.
  5. http://www. nlp. stanford. edu/software/lex-parser. shtml
  6. Esha Palta. 2006-07. Word Sense Disambiguation. Master of Technology First Stage Report, IIT Bombay.
  7. Walker D. and Amsler R. 1986. The Use of Machine Readable Dictionaries in Sublanguage Analysis. In Analyzing Language in Restricted Domains, Grishman and Kittredge (eds), LEA Press, pp. 69-83
  8. Tarkhadkar Dwarakanath. Tarkhadkar Bhashantar Pathmala. Raj Prakashan, 1st edition
  9. Walimbe M. R. 2013. Sugam Marathi Vyakran Lekhan. Nitin Prakashan, revised edition
  10. Dan Klein and Christopher D. Manning. 2003. Accurate Unlexicalized Parsing. Proceedings of the 41st Meeting of the Association for Computational Linguistics, pp. 423-430.
  11. Marie-Catherine de Marneffe, Bill MacCartney and Christopher D. Manning. 2006. Generating Typed Dependency Parses from Phrase Structure Parses. In proceedings of LREC 2006.
  12. Malarkodi C. S, Pattabhi RK Rao and Sobha Lalitha Devi, 2012. Tamil NER – Coping with Real Time Challenges. In proceedings of Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), 24th International Conference on Computer Linguistics
  13. Jayan V. , Sunil R. , Bhadran V. K. 2012. Disambiguation of pre/post positions in English-Malyalam Text Translation Proceedings of Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), 24th International Conference on Computer Linguistics
  14. Papineni, K. Roukos, S. Ward, T. ; Zhu, W. J. , 2002. BLEU: a method for automatic evaluation of machine translation. ACL-2002: 40th Annual meeting of the Computational Linguistics. pp. 311–318.
Index Terms

Computer Science
Information Sciences

Keywords

Machine Translation Word Sense Disambiguation Parser Transliteration Marathi Case-suffixes