Estimation of Spectral Mismatch for Joint Cost Evaluation in Marathi TTS

Smita P. Kawachale; Janardan S. Chitode

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 20 July 2026

Submit your paper

Know more

The week's pick

ReLeaf: A MobileNetV2-Based Mobile Application for Real-Time Waste Classification with LLM-Assisted Recycling Guidance

Fatimah H. Alyami Nadeen N. Abduljabbar Ghadi T. Alzahrani Dana B. Alakeel Amal S. Almirsal Atheer S. Algherairy

Random Articles

Transmit Power Minimization using Fuzzy Rule based System in Relay Assisted Cognitive Radio Networks

November

2015

An Optimized Classifier Frame Work based on Rough Set and Random Tree

Feb

2017

An Intelligent approach to enhance the help messages for a compiler - An expert system

February

2010

Advanced Algorithm for Detection and Prevention of Cooperative Black and Gray Hole Attacks in Mobile Ad Hoc Networks

February

2010

Reseach Article

Estimation of Spectral Mismatch for Joint Cost Evaluation in Marathi TTS

by Smita P. Kawachale, Janardan S. Chitode

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 65 - Number 17

Year of Publication: 2013

Authors: Smita P. Kawachale, Janardan S. Chitode

10.5120/11019-6387

Smita P. Kawachale, Janardan S. Chitode . Estimation of Spectral Mismatch for Joint Cost Evaluation in Marathi TTS. International Journal of Computer Applications. 65, 17 ( March 2013), 43-50. DOI=10.5120/11019-6387

@article{ 10.5120/11019-6387,

author = { Smita P. Kawachale, Janardan S. Chitode },

title = { Estimation of Spectral Mismatch for Joint Cost Evaluation in Marathi TTS },

journal = { International Journal of Computer Applications },

issue_date = { March 2013 },

volume = { 65 },

number = { 17 },

month = { March },

year = { 2013 },

issn = { 0975-8887 },

pages = { 43-50 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume65/number17/11019-6387/ },

doi = { 10.5120/11019-6387 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:19:08.268000+05:30

%A Smita P. Kawachale

%A Janardan S. Chitode

%T Estimation of Spectral Mismatch for Joint Cost Evaluation in Marathi TTS

%J International Journal of Computer Applications

%@ 0975-8887

%V 65

%N 17

%P 43-50

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Among different methods of speech synthesis, Concatenative Speech Synthesis is widely used due to its naturalness and less signal processing requirement. But concatenative TTS has problems like requirement of large database and resulting spectral mismatch in output speech. In concatenative TTS position of syllable plays very important role while carrying out segmentation. If proper position syllable is used while forming new words from existing syllables, resulting spectral mismatch is less. If position of syllable is not considered during concatenation of speech units, resulting synthesis end up in more concatenation cost. This paper presents different techniques like PSD, Wavelet and DTW to find spectral mismatch in concatenated segments. In all these three techniques PSD results are more superior who shows spectral mismatch in graphical form. With direct formant modification we can overcome spectral mismatch and smooth some of the frames which helps to reduce glitch type of sound at concatenation point. Wavelet based audio results shows more naturalness compare to other two methods. In proposed work the discontinuities at the cutting point are smoothed by changing the spectral characteristics before and after the cutting point so that the spectral mismatch is equally distributed over the number of adjacent frames. This work throws light on how spectral mismatch calculation and reduction increases naturalness of concatenative Marathi TTS.

References

"Objective distance measure for spectral discontinuities in concatenative speech synthesis. "—J. Vepa, S. King and P. Taylor, in proc. ICSLP, Denver, co, 2002.
"The minimum phase signal derived from the magnitude spectrum and its applications to speech segmentation" – T. Nagarajan, V. Kamakshi Prasad and Hema A. Murthy, Sixth Biennial conference of signal processing and communications, July 2001.
"A comparision of spectral smoothing methods for segment concatenation based speech synthesis", -David T. Chappell, John H. L. Hansen.
"Context-Adaptive Smoothing for concatenative speech synthesis", - Ki-Seung Lee and Sang-Ryong Kim, IEEE signal processing letters, vol. 9, No. 12, December 2002.
"Refining segmental boundaries for TTS Database using fine contextual dependent boundary models", - Lijuan Wang, Yong Zhao, Min Chu, Jianlai Zhou and Zhigang Cao.
"Subjective evaluation of joint cost and smoothing methods for unit selection speech synthesis", - Jithendra Vepa and Simon King, IEEE transactions on Audio, Speech, and Language Processing, Vol. 14, No. 5, September 2006.
"New Objective Distance measures for Spectral Discontinuities in Concatenative speech synthesis. ", - Jithendra Vepa, Simon King and Paul Taylor, IEEE 0- 7803-7395-2/2002.
"Concatenative Speech Synthesis for European Portuguese", -Pedro M. Carvalho, Luis C. Oliveira, Isabel M. Trancoso, M. Ceu Viana, INESC/IST.
"Sub-band based group delay segmentation ofspontaneous speech into syllable like units", -T. Nagarajan, H. A. Murthy, I. I. T. Madras.
"A Study on the Performance of Wavelet Packets for Spectral Analysis" M. K. Lakshmanan et. al, IRCTR, Dept of Electrical Engg, Delft University, Netherlands.

Index Terms

Computer Science

Information Sciences

Keywords

TTS-Text to Speech System Spectral Smoothing Concatenative TTS Speech Synthesizer