BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System

Arun Soman; Sachin Kumar S.; Hemanth V. K.

Call for Paper

June Edition

IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2026

Submit your paper

Know more

The week's pick

REVENUE FORECASTING IN INTELLIGENT WATER MANAGEMENT SYSTEMS USING ARIMA TIME SERIES MODEL

Coraina Y. Torar Gloria Manggala Meilani J. Ngantung

Random Articles

A Kalman Filter based Sway Velocity Estimation for Rudder Roll Control of Ships

February

2013

Analysis of Propagation Delay Deviation under Process Induced Threshold Voltage Variation

March

2011

An Efficient Feature Selection Technique using Supervised Fuzzy Information Theory

January

2014

Analysis of Risk Management Assessment on Customer Services using COBIT 5 Framework

Mar

2022

Reseach Article

BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System

by Arun Soman, Sachin Kumar S., Hemanth V. K.

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 29 - Number 4

Year of Publication: 2011

Authors: Arun Soman, Sachin Kumar S., Hemanth V. K.

10.5120/3551-4871

Arun Soman, Sachin Kumar S., Hemanth V. K. . BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System. International Journal of Computer Applications. 29, 4 ( September 2011), 41-46. DOI=10.5120/3551-4871

@article{ 10.5120/3551-4871,

author = { Arun Soman, Sachin Kumar S., Hemanth V. K. },

title = { BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System },

journal = { International Journal of Computer Applications },

issue_date = { September 2011 },

volume = { 29 },

number = { 4 },

month = { September },

year = { 2011 },

issn = { 0975-8887 },

pages = { 41-46 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume29/number4/3551-4871/ },

doi = { 10.5120/3551-4871 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:14:56.119721+05:30

%A Arun Soman

%A Sachin Kumar S.

%A Hemanth V. K.

%T BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System

%J International Journal of Computer Applications

%@ 0975-8887

%V 29

%N 4

%P 41-46

%D 2011

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In a text-to-speech system, spoken utterances are automatically produced from text. In this paper, we present a corpus-driven Malayalam text-to-speech (TTS) system based on the concatenative synthesis approach. The most important qualities of a synthesized speech are naturalness and intelligibility. In this system, words and syllables are used as the basic units for synthesis. Our corpus consists of speech waveforms that are collected for most frequently used words in different domains. The speaker is selected through subjective and objective evaluation of natural and synthesized waveform. The proposed Malayalam text-to-speech system is implemented in Java multimedia framework (JMF) and runs on both in Windows and Linux platforms. The proposed system provides utility to save the synthesized output. The output generated by the proposed Malayalam text-to-speech synthesis system resembles natural human voice. Our text to speech reader software converts a Malayalam text to speech wav file that has high rates of intelligibility and comprehensibility.

References

Marian Macchi, Bellcore,”Issues in text-to-speech synthesis” In Proc. EEE International Joint Symposia on Intelligence and Systems, pp.318-325, 1998
N.P. Narendra, K. Sreenivasa Rao ,Krishnendu Ghosh, Ramu Reddy Vempada, Sudhamay Maity “Development of syllable-based text to speech synthesis system in Bengali” DOI: 10.1007/s10772-011-9094-4
C. Pornpanomchai, N. Soontharanont, C. Langla, N. Wongsawat, "A dictionary-based approach for Thai text to speech (TTTS)," icmtma, In Proc. Third Int. Conference on Measuring Technology and Mechatronics Automation, vol. 1, pp.40-43, 2011
M. N. Rao, S. Thomas, T. Nagarajan, and H. A. Murthy, “Text-to-speech synthesis using syllable like units”. In National conference on communication, IIT Kharagpur, pp. 227–280, 2005.
D. H. Klatt, “Review of text-to-speech conversion for English”. The Journal of the Acoustical Society of America, 82, 737–793, 1987
M. Sreekanth, & A.G. Ramakrishnan, “Festival based maiden tts system for Tamil language”. In Proc. 3rd language and technology conf., Poznan, Poland, October pp. 187–191, 2007.
S. P. Kishore and A. W. Black, “Unit size in unit selection speech synthesis,” in Proc. Eurospeech 2003, Sept. 2003.
N.S. Krishna, P.P. Talukdar, K. Bali, A.G. Ramakrishnan, “Duration modeling for Hindi text-to-speech synthesis system”, in Proc. Of Int. Conference on Spoken Language Processing (ICSLP’04), Korea, 2004
A. Hunt, & A. Black, “Unit selection in a concatenative speech synthesis system using a large speech database”, In Proc. of EEE int. Conference acoust, speech, and signal processing, vol. 1, pp. 373–376, 1996
N. Sridhar Krishna, Hema A.Murthy and Timothy A. Gonsalves, “Text-to-Speech (TTS) in Indian Languages”, Int. Conference on Natural Language Processing, ICON-2002, Mumbai, pp. 317.326, 2002.
Robert J. Utama, Ann K. Syrdal, Alistair Conkie,” Six approaches to limited domain concatenative speech synthesis”. INTERSPEECH, ICSLP, 2006
S.D. Shirbahadurkar, D.S. Bormane, R.L. Kazi, “Subjective and spectrogram analysis of speech synthesizer for Marathi tts using concatenative synthesis”. Recent Trends in Information, Telecommunication and Computing (ITC), 2010
K. P. Mohanan, T. Mohanan “Lexical phonology of the consonant system in Malayalam” Linguistic Inquiry The MIT Press, volume 15, 1984.
K. Panchapagesan, P.P Talukdar, N.S. Krishna, K. Bali and A.G. Ramakrishnan, “Hindi text normalization”, Fifth International Conference on Knowledge Based Computer Systems (KBCS), Hyderabad, India, 2004.
T.Charoenporn,A.Chotimongkol,V.Sornlertlamvanich, “Automatic romanization for Thai”, In Proc. of the 2nd Int. Workshop on East-Asian Language Resources and Evaluation, 1999
M. Cernak, M. Rusko, “An evaluation of synthetic speech using the PESQ measure”, In Proc. Forum Acusticum, Budapest, 2725-2728, 2005

Index Terms

Computer Science

Information Sciences

Keywords

Text-to-Speech Concatenation Speech Synthesis Text Normalization Romanization