CFP last date
22 April 2024
Reseach Article

Machine Translation from English to Tamil using Hybrid Technique

by Lakshmana Pandian S, Kumanan Kadhirvelu
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 46 - Number 16
Year of Publication: 2012
Authors: Lakshmana Pandian S, Kumanan Kadhirvelu
10.5120/6997-9596

Lakshmana Pandian S, Kumanan Kadhirvelu . Machine Translation from English to Tamil using Hybrid Technique. International Journal of Computer Applications. 46, 16 ( May 2012), 36-42. DOI=10.5120/6997-9596

@article{ 10.5120/6997-9596,
author = { Lakshmana Pandian S, Kumanan Kadhirvelu },
title = { Machine Translation from English to Tamil using Hybrid Technique },
journal = { International Journal of Computer Applications },
issue_date = { May 2012 },
volume = { 46 },
number = { 16 },
month = { May },
year = { 2012 },
issn = { 0975-8887 },
pages = { 36-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume46/number16/6997-9596/ },
doi = { 10.5120/6997-9596 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:39:57.180687+05:30
%A Lakshmana Pandian S
%A Kumanan Kadhirvelu
%T Machine Translation from English to Tamil using Hybrid Technique
%J International Journal of Computer Applications
%@ 0975-8887
%V 46
%N 16
%P 36-42
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The corpus based techniques in Machine Translation involves parallel corpora, but it is not applicable for the languages for which there are less or no parallel corpora available. In such case the Rule based machine Translation suits best. The main objective of our work is to build a translation system that translates English sentences to Tamil Sentences. Due to the less availability of parallel corpora for English to Tamil the system is implemented using a Hybrid Technique (the combination of both Rule Based Technique and Statistical Technique). The system is first implemented in a Rule Based approach which involves segmentation and tagging, Rule Based Reordering, Morphological Analyzing, and dictionary based translation to the Target language. Then the errors in the translated sentences are corrected by applying Statistical technique.

References
  1. Hutchins, W. John; and Harold L. Somers "An Introduction to Machine Translation" London: Academic Press, pp. ISBN: 0-12-362830-x. March 1992.
  2. Hui Yang, Anne de Roeck, Vincenzo Gervasi, Alistair Willis , and Bashar Nuseibeh, "Analysing anaphoric ambiguity in natural language requirements", © Springer-Verlag London Limited, 2011.
  3. RemyaRajan, Remya Sivan, RemyaRavindran and K. P Soman, "Rule Based Machine Translation from English to Malayalam", in the proceedings of the International Conference on Advances in Computing, Control, and Telecommunication Technologies, pp. 439-441, 2009.
  4. Poornima C , Dhanalakshmi V, Anand Kumar M and Soman K P, "Rule based Sentence Simplification for English to Tamil Machine Translation System", International Journal of Computer Applications (0975 – 8887),Volume 25– No. 8, July 2011.
  5. R. Harshawardhan, Mridula Sara Augustine and K. P. Soman, "Phrase based English – Tamil Translation System by Concept Labeling using Translation Memory", International Journal of Computer Applications (0975 – 8887), Volume 20– No. 3, April 2011.
  6. Rahul. C, Dinunath. K, RemyaRavindran, K. P. Soman, Department of Computational Engineering &NetworkingAmritaVishwaVidyapeetham, "Rule Based Reordering and Morphological Processing For English-Malayalam Statistical Machine Translation", in the proceedings of the International Conference on Advances in Computing, Control, and Telecommunication Technologies, pp. 458-460, 2009.
  7. S. Sripirakas, A. R. Weerasinghe, D. L. Herath, University of Colombo School of Computing, "Statistical Machine Translation of Systems for Sinhala - Tamil", In the proceedings of the International Conference on Advances in ICT for Emerging Regions (ICTer), pp. 62-68, 2010.
  8. Felipe Sánchez-Martínez ,Juan Antonio Pérez-Ortiz and Mikel L . Forcada, "Using target-language information to train part-of-speech taggers for machine translation", Springer Science+Business Media B. V,. 2008.
  9. Fu-Hua Liu, Liang Gu, YuqingGao and Michael Picheny, IBM T. J. Watson Research Center Yorktown Heights, NY 10598, U. S. A, "Use of Statistical N-Gram Models in Natural Language Generation for Machine Translation", In the proceedings of the International Conference on Computer Science and Engineering, pp. 636-639, 2003.
  10. ArwaHatem, Nazlia Omar, Khalid Shaker, School of Computer Science, Faculty of Information Science and Technology, "Morphological Analysis for Rule based Machine Translation", in the proceedings of the International Conference on Semantic Technology and Information Retrieval, Putrajaya, Malaysia, pp. 260-263, june,2011.
  11. MuyunYang Sch. of Comput. Sci. & Technol. , Harbin Eng. Univ. ,Harbin Junguo Zhu, Jufeng Li, Lixin Wang Haoliang Qi , Sheng Li, Liu Daxin , "Extending BLEU Evaluation Method with Linguistic Weight", in the proceedings of the 9th International Conference on young computer scientist, ICYCS 2008, pp. 1683 - 1688 ,2008.
  12. Yang Liu, Dept. of Comput. Sci. , Texas Univ. , Richardson, TX, Shriberg, E. "Comparing Evaluation Metrics for Sentence Boundary Detection", IEEE International Conference onAcoustics, Speech and Signal Processing, 2007. ICASSP 2007.
  13. Wang, Y. , Acero, A. , and Chelba, C, "Is Word Error Rate a Good Indicator for Spoken Language Understanding Accuracy", IEEE Workshop on Automatic Speech Recognition and Understanding, St. Thomas, US Virgin Islands, 2003.
  14. Subramanian, K. BBN Technol. , Cambridge, Stallard, D. ,Prasad, R. , Saleem, S. , Natarajan, P. , "Semantic translation error rate for evaluating translation systems", IEEE Workshop on Automatic Speech Recognition and Understanding, 2007.
  15. Myers, Jerome L. , Well, Arnold D, "Research Design and Statistical Analysis",Second Edition, Lawrence Erlbaum,pp. 508, ISBN 0-8058-4037-0,2003.
  16. Maritz. J. S. "Distribution-Free Statistical Methods", Chapman & Hall. ISBN 0-412-15940-6, page 217, 1981.
  17. http://turner. faculty. swau. edu/mathematics/math241/materials/anova/
Index Terms

Computer Science
Information Sciences

Keywords

Segmentation And Tagging Rule Based Reordering Chunking Morphological Analyzer