CFP last date
20 May 2024
Reseach Article

An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System

by Thiruumeni P G, Anand Kumar M, Dhanalakshmi V, Soman K P
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 26 - Number 10
Year of Publication: 2011
Authors: Thiruumeni P G, Anand Kumar M, Dhanalakshmi V, Soman K P
10.5120/3139-4328

Thiruumeni P G, Anand Kumar M, Dhanalakshmi V, Soman K P . An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System. International Journal of Computer Applications. 26, 10 ( July 2011), 36-41. DOI=10.5120/3139-4328

@article{ 10.5120/3139-4328,
author = { Thiruumeni P G, Anand Kumar M, Dhanalakshmi V, Soman K P },
title = { An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System },
journal = { International Journal of Computer Applications },
issue_date = { July 2011 },
volume = { 26 },
number = { 10 },
month = { July },
year = { 2011 },
issn = { 0975-8887 },
pages = { 36-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume26/number10/3139-4328/ },
doi = { 10.5120/3139-4328 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:12:26.686218+05:30
%A Thiruumeni P G
%A Anand Kumar M
%A Dhanalakshmi V
%A Soman K P
%T An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System
%J International Journal of Computer Applications
%@ 0975-8887
%V 26
%N 10
%P 36-41
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, we report our work on incorporating a technique to handle phrasal verbs and idioms for English to Tamil machine translation. While translating from English to Tamil, both phrasal verbs and idioms in English have more chances, to get translated to Tamil in wrong sense. This is because of the idioms or phrasal verbs that convey individual meaning for each word in it instead of conveying a single meaning by considering it as a group of words while translating from English to Tamil. This in turn affects the accuracy of the translation. The proposed technique is used to handle the idioms and phrasal verbs during the translation process and it increases the accuracy of the translation. The BLEU and NIST scores calculated before and after handling the phrasal verbs and idioms during the translation process show a significant increase in the accuracy of the translation. This technique, proposed for English to Tamil machine translation system, can be incorporated with machine translation system for English to any language.

References
  1. Durgesh Rao, 2001. “Machine Translation in India: A Brief Survey”. In Proceedings of the SCALLA 2001 Conference, Bangalore, India.
  2. Sahar Ahmadi and Saeed Ketabi. 2011. “Translation Procedures and problems of Color Idiomatic Expressions in English and Persian”. The Journal of International Social Research, Volume: 4 Issue: 17.
  3. Martine Smets, Joseph Pentheroudakis and Arul Menezes. “Translation of verbal idioms”, Microsoft Research.
  4. Breidt, E., Segond F and Valetto G. 1996. “Local grammars for the description of multi-word lexemes and their automatic recognition in texts”, Proceedings of COMPLEX96, Budapest.
  5. Courtney, Rosemary. 1989. “Longman Dictionary of Phrasal Verbs”. Longman Group UK Limited, ISBN 0-582-55530-2 CSD, ISBN 0-582-05864-3 PPR.
  6. The Oxford Companion to the English Language. 1992. pp.495–96.
  7. Jackendoff, R. 1997. “The architecture of the language faculty”. Cambridge, MA: MIT Press.
  8. Chunli Yang 2010. “Cultural Differences on Chinese and English Idioms of Diet and the Translation”, In CCSENET, English Language Teaching.
  9. Anand Kumar M, Dhanalakshmi, Soman K P and Rajendran S. 2011. “Morphology based Factored Statistical Machine Translation System for English to Tamil”, 10th Tamil Internet Conference, INFIT & UPEN, Philadelphia, USA.
  10. Philipp Koehn and Hieu Hoang. 2007. “Factored Translation Models”. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 868–876, Prague.
Index Terms

Computer Science
Information Sciences

Keywords

Phrasal verbs Idioms Statistics Machine Translation Rule based Machine Translation