Automated Transcription System for Malayalam Language

Cini Kurian; Kannan Balakrishnan

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

Navigating the Future of Cybersecurity: A Strategic Approach to Crypto Agility for Modern Enterprises

Aditya Gupta

Random Articles

Automatic Speaker Age Estimation and Gender Dependent Emotion Recognition

May

2015

A Hybrid Data Model to Share Medical Images

Mar

2017

A RFID based Inventory Control System for Nigerian Supermarkets

April

2015

Article:Analyzing EAP TLS & ERP Protocol with varying processor speed

October

2010

Reseach Article

Automated Transcription System for Malayalam Language

by Cini Kurian, Kannan Balakrishnan

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 19 - Number 5

Year of Publication: 2011

Authors: Cini Kurian, Kannan Balakrishnan

10.5120/2360-3091

Cini Kurian, Kannan Balakrishnan . Automated Transcription System for Malayalam Language. International Journal of Computer Applications. 19, 5 ( April 2011), 5-10. DOI=10.5120/2360-3091

@article{ 10.5120/2360-3091,

author = { Cini Kurian, Kannan Balakrishnan },

title = { Automated Transcription System for Malayalam Language },

journal = { International Journal of Computer Applications },

issue_date = { April 2011 },

volume = { 19 },

number = { 5 },

month = { April },

year = { 2011 },

issn = { 0975-8887 },

pages = { 5-10 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume19/number5/2360-3091/ },

doi = { 10.5120/2360-3091 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:06:10.780747+05:30

%A Cini Kurian

%A Kannan Balakrishnan

%T Automated Transcription System for Malayalam Language

%J International Journal of Computer Applications

%@ 0975-8887

%V 19

%N 5

%P 5-10

%D 2011

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Malayalam is one of the 22 scheduled languages in India with more than 130 million speakers. This paper presents a report on the development of a speaker independent, continuous transcription system for Malayalam. The system employs Hidden Markov Model (HMM) for acoustic modeling and Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. It is trained with 21 male and female speakers in the age group ranging from 20 to 40 years. The system obtained a word recognition accuracy of 87.4% and a sentence recognition accuracy of 84%, when tested with a set of continuous speech data.

References

A.Ganapatiraju , J. Hamaker and J. Picones, “Support vector machines for Speech Recogntion “ Proceedings of the International Conference on Spoken Language Processing , pp 292-296, Sydney, Australia , November, 1999.
A. Sperduti and Starita , “ Supervised Neural Networks for Classification of structures “IEEE Transactions on Neural Networks, 8(3) , pp 714-735, May 1997.
Behrman, L. Nash,J . Steck, V. Chandrashekar and S.Skinner, Simulations of Quantum Neural Networks”, Information Sciences, 128(3-4): pp 257-269, October 2000.
Baum, L.E, T. Petrie , G. Soules and N. Weiss, (1970), A maximization technique occurring in the statistical analysis of probabilistic functions of Markov Chains, Ann. Math , Statist, vol 41, no, 1, pp 164-171.
Chegalvarayan, R. and L. Deng , (1997), “ HMM based speech recognition using state-dependent discriminatively derived transforms on mel-warped DFT features”, IEEE Trans. Speech, Audio Processing, vol.5.pp 243-256.
Cini Kurian , Kannan BalaKrishnan, (2009), “ Speech Recognition of Malayalam Numbers”, IEEE Transaction of Nature and Biologically Inspired Computing ( NABIC-2009) pp 1475-1479.
C.J.C. Burges, A tutorial on Support Vector Machines on Pattern “knowledge Discovery Data Mining, vol 2, no, 2 , pp. 121-167 , 1998.
Davis S and Mermelstein P, “Comparative parametric representations of monosyllabic word recognition in continuously spoken sentences” IEEE Trans. ASSP vol 28 pp 57-336.
Dimov, D., and Azamonov , I (2005). “Experimental specifics using HMM in isolated word speech recognition” International conference on Computer Systems and Technologies – CompSysTech , 2005.
F. Felinek, “Statistical Methods for Speech Recognition” MIT Press , Cambridge, Massachusetts, USA, 1997.
Forney, G.D., (1973), The Viterbi Algorithm, Proc. IEEE, vol . 61, pp. 268-277.
Huang, X., Alex, A., and Hon, H.W (2001). “Spoken Language Processing; A Guide to Theory, Algorithm and System Development”, Pentice Hall, Upper Saddle River, New Jersey .
Jankowski , C.H , D.V and Lippman, (1995), A comparison of signal Processing front ends for Automatic word recognition , IEEE Trans. Speech , Audio, Processing, vol, 2, pp. 286-293.
Jurasky, D., and Martin, J.H (2007). “Speech and Language Processing: An introduction to Natural Language Processing, Computational linguistics, and speech recognition” 2nd Edition .
Kai-Fu Lee “ Context-Dependent phonetic Hidden Markov Models for speaker Independent Continuous speech recognition, IEEE Transaction on Acoustics, Speech and Signal Processing vol 38, No. 4 , April 1990.
Krishnan, V.R ; V. Jayakumar A, Anto P.B (2008) , “Speech Recognition of isolated Malayalam words using wavelet features and Artificial Neural Networks “ DELTA 2008. 4th IEEE International symposium on Electronic Design, Test and Applications, 2008 volume Issue 23-25 Jan, 2008. Page(s) 240 – 243
Mosur K, Ravishankar , Kevin A. Lenzo , Sphinx II User Guide CMU, 2001.
Pallett et al., D, 1990. Tools for the analysis of bench mark speech recognition tests in ICASSP, volume 1
P.Boersma, “Praat a system for doing phonetics by computer”, Glot International, vol 5, 9/10, pp 341-345, 2005

Index Terms

Computer Science

Information Sciences

Keywords

HMM MFCC Speech Recognition Transcription systems