CFP last date
20 June 2024
Reseach Article

Speech Synthesis System for Telugu Language

by G. Swathi, C. Kiran Mai, B. Raveendra Babu
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 81 - Number 5
Year of Publication: 2013
Authors: G. Swathi, C. Kiran Mai, B. Raveendra Babu

G. Swathi, C. Kiran Mai, B. Raveendra Babu . Speech Synthesis System for Telugu Language. International Journal of Computer Applications. 81, 5 ( November 2013), 25-30. DOI=10.5120/14009-2060

@article{ 10.5120/14009-2060,
author = { G. Swathi, C. Kiran Mai, B. Raveendra Babu },
title = { Speech Synthesis System for Telugu Language },
journal = { International Journal of Computer Applications },
issue_date = { November 2013 },
volume = { 81 },
number = { 5 },
month = { November },
year = { 2013 },
issn = { 0975-8887 },
pages = { 25-30 },
numpages = {9},
url = { },
doi = { 10.5120/14009-2060 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
%0 Journal Article
%1 2024-02-06T21:55:17.415906+05:30
%A G. Swathi
%A C. Kiran Mai
%A B. Raveendra Babu
%T Speech Synthesis System for Telugu Language
%J International Journal of Computer Applications
%@ 0975-8887
%V 81
%N 5
%P 25-30
%D 2013
%I Foundation of Computer Science (FCS), NY, USA

A system which takes input as a sequence of words and converts them to speech. Vowels and consonants are most important in Telugu language. The voices are sampled from real recorded speech. The speech synthesis is handheld by computers and mobile phones. To build a natural sounding speech synthesis system, it is essential that text processing component produce an appropriate sequence of phonemic units. Generation of sequence of phonetic units for a given standard word is referred to as letter to phoneme rule or text to phoneme rule. The complexity of these rules and their derivation depends upon the nature of the language. In Telugu TTS the input is Telugu text in Unicode. Speech synthesis is the technique of converting given input text to synthetic speech. Speech synthesis can be used to read written text as in e-mail, SMS, newspapers and can be used by blinds people. Speech synthesis has been widely researched in last four decades. The quality and intelligibility of the synthetic speech produced using the latest methods have been remarkably well for most of the applications. This project focuses primarily on the process of creating a voice for a concatenative Text-To-Speech system, or altering the TTS systems own standard output voice to sound more like the target voice.

  1. C. Bickley, A. Syrdal, and J. Schroeter, ''Speech Synthesis,'' in The Acoustics of Speech Communication, J. M. Picket, Ed. , Boston, NY: Allyn and Bacon, 1998.
  2. T. Dutoit,An Introduction to Text-to-Speech Synthesis, Dordrecht/Boston/London: Kluwer Academic Publishers, 1997.
  3. Lakshmi A, Hema A Murthy. A Syllable Based Continuous Speech Recognizer for Tamil. In Proc. of the 2nd Int. Workshop on East-Asian Language Resources and Evaluation,2009.
  4. Ö. Salor, B. Pellom and M. Demirekler, "Implementation and Evaluation of a Text-to-Speech Synthesis System for Turkish", Proceedings of Eurospeech-Interspeech 2003, Geneva, Switzerland, 2003, pp. 1573-1576.
  5. S. Lemmetty, Review of Speech Synthesis Technology, MSc. thesis, Helsinki University of Technology, 1999.
  6. K. Ishizaka and J. L. Flanagan, ''Synthesis of voiced sounds from a two-mass model of the vocal cords,'' Bell Syst. Tech. J. , vol. 51, no. 6, pp. 133–1268, 1972.
  7. van Santen J. P . H. (1994): " Assignment of seg-mental duration in text-to-speech synthesis". Com-puter Speech and Language 8, 95-128
  8. Sproat R. (1995): " A finite-state architecture for tokenization and grapheme-to-phoneme conver-sion for multilingual text analysis". In F rom text to tags: Issues in multilingual language analysis. Proc. ACL SIGDAT W orkshop (Dublin, Ireland), 65-72
  9. Sproat R. , Olive J. (1995): "Text to speech syn-thesis". AT&T T echnical Journal 74(2), 35-44
  10. Sproat R. , Olive J. (1996): " A modular architec-ture for multi-lingual text-to-speech". In J. van Santen, R. Sproat, J. Olive and J. Hirschberg (eds. ), Progress in speech synthesis (Springer , New Y ork).
  11. T alkin D. , Rowley J. (1990): "Pitch-syn-chronous analysis and synthesis for TTS systems". Proc. ESCA W orkshop on Speech Synthesis (Autrans,France), 55-58.
  12. A. M. Zeki and N. Azizah, "A Speech Synthesizer for Malay Language", National Conference on Research and Development in Computer Science, Selangor, Malaysia, October 2001.
  13. S P Kishore, Rohit Kumar and Rajeev Sangal, "A Data Driven Synthesis Approach For Indian Languages using Syllables as BasicUnit", in Proceedings of Intl. Conf. on NLP (ICON) 2002, pp. 311-316, Mumbai, India, 200.
  14. O. Fujimura and J. Lovins, ''Syllables as concatenative phonetic elements,'' inSyllables and Segments, A. Bell and J. B. Hooper, Eds. , New York: North-Holland, 107–120, 1978.
  15. BlackA. W. ,ZenH. ,andTokudaK. ,"Statistical parametric speech synthesis," in Proceeding sofIEEEInt. Conf. Acoust. , Speech,and Signal Processing, Honolulu,USA, 2007.
  16. Alan W Black, Paul Taylor, "Automatically Clustering similar units for unit selection in speech synthesis", Proceedings of Eurospeech 97.
  17. ZenH. ,NoseT. ,YamagishiJ. ,SakoS. ,MasukoT. ,Black A. W. , andTokudaK. ,"The hmm-based speech synthe sis system version2. 0," in Proc. ofISCASSW6, Bonn, Germany,2007.
  18. A. W. Black, and K. A. Lenzo, Building Synthetic Voices, Language Technologies Institute, Carnegie Mellon University and Cepstral LLC.
  19. B. Williams, R. J. Jones and I. Uemlianin, "Tools and Resources for Speech Synthesis Arising from a Welsh TTS Project", Fifth Language Resources and Evaluation Conference (LREC), Genoa, Italy, 2006.
  20. C. Kamisetty and S. M. Adapa, Telugu Festival Text-to-Speech System.
  21. A. Wasala, R. Weerasinghe and K. Gamage, "Sinhala Grapheme-to-Phoneme Conversion and Rules for Schwa epenthesis", Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, Sydney, Australia, 2006, pp. 890-897.
  22. J. B. Disanayaka. 1991. The Structure of Spoken Sinhala, National Institute of Education, Maharagama.
  23. Marian Macchi, Bellcore,"Issues in text-to-speech Synthesis" In Proc. EEE International Joint Symposia on Intelligence and Systems, pp. 318-325, 1998.
  24. A. Hunt, & A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database", In Proc. of EEE int. Conference acoust, speech, and signal processing, vol. 1, pp. 373–376, 1996.
  25. Carlson, R. , & Nord, L. "Vowel dynamics in a text-to-speech system - some considerations". In Proceedings Eurospeech '93 (pp. 1911-1914). Berlin, 1993.
  26. Anupam Basu, Debasish Sen , Shiraj Sen and Soumen Chakraborty "An Indian Language Speech Synthesizer –Techniques and Applications" National Systems Conference, Indian Institute of Technology, Kharagpur, december 17-19, 2003.
Index Terms

Computer Science
Information Sciences


Text processing speech generation phoneme Speech synthesis