Speech Synthesis System for Marathi Accent using FESTVOX

Sangramsing N.Kayte; Monica Mundada; Charansing Kayte

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

A Unified NIST SP 800-90B Validation Framework for CMOS True Random Number Generators and Quantum Random Number Generators

Che-Ping Lin

Random Articles

Reseach Article

Speech Synthesis System for Marathi Accent using FESTVOX

by Sangramsing N.Kayte, Monica Mundada, Charansing Kayte

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 130 - Number 6

Year of Publication: 2015

Authors: Sangramsing N.Kayte, Monica Mundada, Charansing Kayte

10.5120/ijca2015907024

Sangramsing N.Kayte, Monica Mundada, Charansing Kayte . Speech Synthesis System for Marathi Accent using FESTVOX. International Journal of Computer Applications. 130, 6 ( November 2015), 38-42. DOI=10.5120/ijca2015907024

@article{ 10.5120/ijca2015907024,

author = { Sangramsing N.Kayte, Monica Mundada, Charansing Kayte },

title = { Speech Synthesis System for Marathi Accent using FESTVOX },

journal = { International Journal of Computer Applications },

issue_date = { November 2015 },

volume = { 130 },

number = { 6 },

month = { November },

year = { 2015 },

issn = { 0975-8887 },

pages = { 38-42 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume130/number6/23214-2015907024/ },

doi = { 10.5120/ijca2015907024 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T23:24:39.123218+05:30

%A Sangramsing N.Kayte

%A Monica Mundada

%A Charansing Kayte

%T Speech Synthesis System for Marathi Accent using FESTVOX

%J International Journal of Computer Applications

%@ 0975-8887

%V 130

%N 6

%P 38-42

%D 2015

%I Foundation of Computer Science (FCS), NY, USA

Abstract

A Text To Speech synthesis (TTS) is the production of artificial speech by a machine for the given text as input. This field of study is known both as Speech Synthesis that is the “synthetic” (computer) generation of speech, and Text-To-Speech or TTS. It is the process of converting written text into speech. In the process of speech synthesis, mainly two processing components are used; they are NLP (natural language processing) and DSP (digital signal processing) modules. The speech synthesis has enormous applications such as reading for blind people, telecommunication services, language education, and aid to handicapped persons, talking books and toys, call center automation etc. The main aim of the project is to develop a TTS system producing a voice with Indian accent for the given input text. In this project, for the conversion of text to speech, we use Festival in Linux environment. Festival is a general pre-packaged tool for development of multi-language speech synthesis systems; and it will support most of the languages in the text to speech conversion. In this project, the speech generation process is done by using Festival frame work and speech tools. The voice model is generated by using festvox frame work, festival and speech tools. The required speech data for generating voice is recorded in noise less environment. The voice models can be generated by unit selection or clustergen modules present in festvox. It is observed from the generated voices that clustergen voices are better than unit selection voices.

References

Ramani Boothalingam, V Sherlin Solomi, Anushiya Rachel Gladston, Lilly Christina, P Vijayalakshmi, Nagarajan Thangavelu, Hema A Murthy, “Development and Evaluation of Unit Selection and HMM-Based Speech Synthesis Systems for Tamil” 978-1-4673-5952-8/13/$31.00 ⃝c 2013 IEEE
Samuel Thomas, “Natural Sounding Text-To-Speech Synthesis Based on Syllable-Like Units”, ms thesis, Indian Institute of Technology, Madras, May-2007
Paul Taylor, a text book on “Text to Speech Synthesis”, University of Cambridge, United Kingdom
T.Dutoit, “High-quality text-to-speech synthesis: an overview.” Faculte Polytechnique de Mons, TCTS Lab, 31, bvd Dolez, B-7000 MONS (Belgium).
Sami Lemmetty “Review of Speech Synthesis Technology” M.Tech., Helsinki University of Technology, Finland, 1999
Amdal, T. Svendsen: “Unit Selection Synthesis Database Development Using Utterance Verification”, Proc. Interspeech 2005, Lisbon, Portugal, Sept. 2005
A.J. Hunt and A. Black: “Unit selection in a Concatenative speech synthesis system using a large speech database”, Proc. ICASSP 1996, (Atlanta, USA), pp.373-376, 1996.
Möbius: Corpus-based speech synthesis: Methods and challenges, Arbeitspapiere des Institutes für Machinelle Sprachverarbeitung, Univ. Stuttgart, AIMS 6 (4), pp. 87-116, 2000.
Simon King, “A beginners’ guide to statistical parametric speech synthesis” The Centre for Speech Technology Research, University of Edinburgh, UK
A. Black, P. Taylor, and R. Caley, “The Festival speech synthesis system,” http://festvox.org/festival, 1999.
K. Prahallad, N. K. Elluru, V. Keri, S. Rajendran, and A. W. Black, "The IIIT-H Indic speech databases", in Proceedings of INTERSPEECH, Portland, Oregon, USA, 2012.
Sri Rama Murty K, B. Yegnanarayana, Anand Joseph Xavier M, “Characterization of Glottal Activity From Speech Signals”, IEEE Signal Processing Letters, vol. 16, no. 8, pp. 469-472, June 2009.
A. Black and K. Lenzo, “Building voices in the Festival speech synthesis system,” http://festvox.org/bsv/, 2000.
Roman Timofe, “Classification and Regression Trees (CART) Theory and Applications” A Master Thesis, CASE, Berlin, December 20, 2004.
Sangramsing Kayte, Monica Mundada "Study of Marathi Phones for Synthesis of Marathi Speech from Text" International Journal of Emerging Research in Management &Technology ISSN: 2278-9359 (Volume-4, Issue-10) October 2015
Sangramsing Kayte, Dr. Bharti Gawali “Marathi Speech Synthesis: A review” International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169 Volume: 3 Issue: 6 3708 – 3711
Sangramsing Kayte, Monica Mundada, Santosh Gaikwad, Bharti Gawali "PERFORMANCE EVALUATION OF SPEECH SYNTHESIS TECHNIQUES FOR ENGLISH LANGUAGE " International Congress on Information and Communication Technology 9-10 October, 2015
Sangramsing N.kayte “Marathi Isolated-Word Automatic Speech Recognition System based on Vector Quantization (VQ) approach” 101th Indian Science Congress Jammu University 03th Feb to 07 Feb 2014.
Monica Mundada, Sangramsing Kayte “Classification of speech and its related fluency disorders Using KNN” ISSN2231-0096 Volume-4 Number-3 Sept 2014
Monica Mundada, Bharti Gawali, Sangramsing Kayte "Recognition and classification of speech and its related fluency disorders" International Journal of Computer Science and Information Technologies (IJCSIT)
http://tcts.fpms.ac.be/synthesis/introtts_old.html
http://www.festvox.org/
http://www.cstr.ed.ac.uk/
http://en.wikipedia.org/wiki/Speech_synthesis
http://hts.sp.nitech.ac.jp/
http://festvox.org/11752/packed/
http://audacity.sourceforge.net/
http://www.speech.kth.se/wavesurfer/man.html

Index Terms

Computer Science

Information Sciences

Keywords

TTS Festival Festvox speech syntheses.