CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

A Text-To-Speech Synthesis for Marathi Language using Festival and Festvox

by Sangramsing Kayte, Bharti Gawali
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 132 - Number 3
Year of Publication: 2015
Authors: Sangramsing Kayte, Bharti Gawali
10.5120/ijca2015907356

Sangramsing Kayte, Bharti Gawali . A Text-To-Speech Synthesis for Marathi Language using Festival and Festvox. International Journal of Computer Applications. 132, 3 ( December 2015), 35-41. DOI=10.5120/ijca2015907356

@article{ 10.5120/ijca2015907356,
author = { Sangramsing Kayte, Bharti Gawali },
title = { A Text-To-Speech Synthesis for Marathi Language using Festival and Festvox },
journal = { International Journal of Computer Applications },
issue_date = { December 2015 },
volume = { 132 },
number = { 3 },
month = { December },
year = { 2015 },
issn = { 0975-8887 },
pages = { 35-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume132/number3/23577-2015907356/ },
doi = { 10.5120/ijca2015907356 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:28:11.393252+05:30
%A Sangramsing Kayte
%A Bharti Gawali
%T A Text-To-Speech Synthesis for Marathi Language using Festival and Festvox
%J International Journal of Computer Applications
%@ 0975-8887
%V 132
%N 3
%P 35-41
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This research paper describes the Impalement of the first, usable, Marathi Text to Speech system for Maharashtra Marathi using the open source Festival TTS engine. Besides that, this research paper also discusses a few practical applications that use this system. This system is developed using di-phone concatenation approach in its waveform generation phase. Construction of a di-phone database and implementation of the natural language processing modules are described. Natural language processing modules include text processing, tokenizing and grapheme to phoneme (G2P) conversion that were written in Festival's format. Finally, a test was conducted to evaluate the intelligibility of the synthesized speech.

References
  1. Sangramsing Kayte, Monica Mundada, Dr. Charansing Kayte “Di-phone-Based Concatenative Speech Synthesis Systems for Marathi Language” OSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 5, Issue 5, Ver. I (Sep –Oct. 2015), PP 76-81e-ISSN: 2319 –4200, p-ISSN No. : 2319 –4197 www.iosrjournals.org
  2. A.W. Black, and K.A. Lenzo, 2003, Building Synthetic Voices, Language Technologies Institute, Carnegie Mellon University and Cepstral LLC. Retrieved from: http://festvox.org/bsv/
  3. Sangramsing Kayte, Dr. Bharti Gawali “Marathi Speech Synthesis: A review” International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169 Volume: 3 Issue: 6 3708 – 3711
  4. Newton, “Review of methods of Speech Synthesis”, M.Tech Credit Seminar Report, Electronic Systems Group, November, 2011, pp. 1-15
  5. Mark Hasegawa Johnson,”Lecture Notes in Speech Production, Speech Coding and Recognition”, University of Illinois, February 2000.
  6. Thierry Dutoit,”An Introduction to Text-to-Speech Synthesis”, Springer, Volume 3.
  7. Christopher Richards, “Normalization of non-standard words”. Computer Speech and Language (2001), pp.287–333.
  8. M.B.Chandak, Dr.R.V.Dharaskar and Dr.V.M.Thakre,”Text to Speech with Prosody Feature: Implementation of Emotion in Speech Output using Forward Parsing”, International Journal of Computer science and Security, Volume (4), Issue (3).
  9. Ramani Boothalingam,V Sherlin Solomi, Anushiya Rachel Gladston,S Lilly Christina, “Development and Evaluation of Unit Selection and HMM-Based Speech Synthesis Systems for Tamil”, 978-1-4673-5952-8/13, IEEE 2013 National Conference.
  10. Heiga Zen, Tomoki Toda and Keiichi Tokuda. “The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006”, INTERSPEECH 2005.
  11. K. Partha Sarathy, A.G.Ramakrishnan "TEXT TO SPEECH SYNTHESIS SYSTEM FOR MOBILE APPLICATIONS" http://mile.ee.iisc.ernet.in /mile/publications/softCopy/SpeechProcessing/WISP07_124_MobileTTS.pdf
  12. Paul Taylor, 2009, Text-to-Speech Synthesis, University of Cambridge, February.
  13. Ayesha Binte Mosaddeque, Naushad UzZaman and Mumit Khan, 2006, Rule based Automated Pronunciation Generator, Proc. of 9th International Conference on Computer and Information Technology (ICCIT 2006), Dhaka, Bangladesh, December 2006.
  14. C. Kamisetty and S.M. Adapa, 2006, Telugu Festival Text-to-Speech System, Retrieved from:http://festivalte.sourceforge.net/wiki/Main_PageCRBLP, 2010, CRBLP pronunciation lexicon, [Online], Available: http://crblp.bracu.ac.bd/demo/PL/
  15. Sangramsing Kayte, Monica Mundada "Study of Marathi Phones for Synthesis of Marathi Speech from Text" International Journal of Emerging Research in Management &Technology ISSN: 2278-9359 (Volume-4, Issue-10) October 2015
  16. Sangramsing Kayte, Monica Mundada, Santosh Gaikwad, Bharti Gawali "PERFORMANCE EVALUATION OF SPEECH SYNTHESIS TECHNIQUES FOR ENGLISH LANGUAGE " International Congress on Information and Communication Technology 9-10 October, 2015
  17. Sangramsing Kayte, Monica Mundada, Dr. Charansing Kayte "Di-phone-Based Concatenative Speech Synthesis System for Hindi" International Journal of Advanced Research in Computer Science and Software Engineering -Volume 5, Issue 10, October-2015
  18. Sangramsing Kayte, Monica Mundada, Dr. Charansing Kayte "A Review of Unit Selection Speech Synthesis International Journal of Advanced Research in Computer Science and Software Engineering -Volume 5, Issue 10, October-2015
  19. 19) Sangramsing Kayte, Monica Mundada, Dr. Charansing Kayte “Di-phone-Based Concatenative Speech Synthesis Systems for Marathi Language” OSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 5, Issue 5, Ver. I (Sep –Oct. 2015), PP 76-81e-ISSN: 2319 –4200, p-ISSN No. : 2319 –4197 www.iosrjournals.org
  20. Monica Mundada, Sangramsing Kayte, Dr. Bharti Gawali "Classification of Fluent and Dysfluent Speech Using KNN Classifier" International Journal of Advanced Research in Computer Science and Software Engineering Volume 4, Issue 9, September 2014 (IMPACT FACTOR: 2.080)
  21. Monica Mundada, Bharti Gawali, Sangramsing Kayte "Recognition and classification of speech and its related fluency disorders" International Journal of Computer Science and Information Technologies (IJCSIT) (IMPACT FACTOR: 3.32)
  22. Sangramsing Kayte, Monica Mundada "Study of Marathi Phones for Synthesis of Marathi Speech from Text" International Journal of Emerging Research in Management &Technology ISSN: 2278-9359 (Volume-4, Issue-10) October 2015 Impact Factor: 1.492
  23. Sangramsing Kayte, Monica Mundada, Santosh Gaikwad, Bharti Gawali "PERFORMANCE EVALUATION OF SPEECH SYNTHESIS TECHNIQUES FOR ENGLISH LANGUAGE " International Congress on Information and Communication Technology 9-10 October, 2015
  24. Sangramsing Kayte, Monica Mundada, Dr. Charansing Kayte "Di-phone-Based Concatenative Speech Synthesis System for Hindi" International Journal of Advanced Research in Computer Science and Software Engineering -Volume 5, Issue 10, October-2015
  25. Sangramsing Kayte, Monica Mundada, Dr. Charansing Kayte "A Review of Unit Selection Speech Synthesis International Journal of Advanced Research in Computer Science and Software Engineering -Volume 5, Issue 10, October-2015
  26. Sangramsing Kayte, Monica Mundada, Dr. Charansing Kayte “Di-phone-Based Concatenative Speech Synthesis Systems for Marathi Language” OSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 5, Issue 5, Ver. I (Sep –Oct. 2015), PP 76-81e-ISSN: 2319 –4200, p-ISSN No. : 2319 –4197 www.iosrjournals.org
Index Terms

Computer Science
Information Sciences

Keywords

Marathi Speech Synthesis Text-To-Speech (TTS) Hidden-Markov-Model (HMM) Marathi HTS TTS speech synthesis di-phone Unit Selection.