State of the Art Research for Bangla Text to Speech on Android Platform

International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2017
Sheikh Abujar, M. S. I. Shahin, Anisur Rahman, Abdus Sattar

Sheikh Abujar, M S I Shahin, Anisur Rahman and Abdus Sattar. State of the Art Research for Bangla Text to Speech on Android Platform. International Journal of Computer Applications 170(1):19-23, July 2017. BibTeX

	author = {Sheikh Abujar and M. S. I. Shahin and Anisur Rahman and Abdus Sattar},
	title = {State of the Art Research for Bangla Text to Speech on Android Platform},
	journal = {International Journal of Computer Applications},
	issue_date = {July 2017},
	volume = {170},
	number = {1},
	month = {Jul},
	year = {2017},
	issn = {0975-8887},
	pages = {19-23},
	numpages = {5},
	url = {},
	doi = {10.5120/ijca2017914650},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}


There are different kinds of TTS (Text to Speech) systems are already available for Personal computers and web applications. In the Platform of Smart Phone, few of TTS systems are available for Bangla Language. Nowadays android is a popular platform considering Smartphone. There are few Bangla TTS Systems are Available with different kind of Mechanisms and techniques, various kind of tools were used. Here we tried to introduce all mechanisms together and proving a summary above all existing system.


  1. Frances Alias, Xavier Servillano, Joan Claudi socoro and Xavier Gonzalvo “Towards High-Quality Next Generation Text-to-Speech Synthesis:A multi domain Approach by Automatic Domain Classification”,IEEE Transactions on AUDIO,SPEECH AND LANGUAG PROCESSING, VOL16,NO,7 september 2008.
  2. Qing Guo, Jie Zhang, Nobuyuki Katae, Hao Yu , “High –Quality Prosody Generation in Mandrain Text-to-Speech system”, FujiTSu Sci.Tech,J., vol.46, No.1,pp.40-46 ,2010.
  3. Gopalakrishna anumanchipalli, Rahul Chitturi, Sachin Joshi, Rohit Kumar, Satinder Pal Singh, R.n.v Sitaram, D.P.Kishore, “Development of Indian Language Speech Databases for Large Vocabulary Speech Recognition System”,
  4. A.Black, H.Zen and K.Tokuda “Statistical parametric speech synthesis”, in proc.ICASSP, Honolulu, HI 2007, vol IV, PP 1229-1232.
  5. G.Bailly, N.Campbell and b.Mobius, “ISCA special session: Hot topics in speech synthesis”, in proc.Eurospeech,Genea, Switzerland, 2003, pp 37-40.
  6. M.Ostendorf and I.Bulyko, “The impact of speech recognition on speech synthesis”, in proc, IEEE Workshop Speech Synthesis, Santa Monica,2002,pp. 99-106.
  7. Text To Speech Synthesis - a knol by Jaibatrik Dutta .
  8. Silvio Ferreia,Celina Thillou, Bernaud Gosselin, “From Picture to Speech: an Innovative Application for Embedded Environment”,
  9. M.Nageshwara Rao, Samuel Thomas, T.Nagarajan and Hema A.Muthy, “Text-to-Speech Syntheis using syllable line units”
  10. Jindrich Matousek, Josef Psutks, Jiri Krita, “Design of speech Corpus for Text-to-Speech Synthesis”. Beckman M. and Elam G. “Guidelines for ToBI Labeling”. Manuscript, version 3, 1997.
  11. Corrigan G., Massey N., and Karaali O. “Generating Segment Durations in a Text-to-Speech System: A Hybrid Rule-Based/Neural Network Approach”. Proc. Eurospeech ’97, Rhodes, September 1997.
  12. Gerson I., Karaali O., Corrigan G., and Massey N. “Neural Network Speech Synthesis”. Speech Science and Technology (SST-96), Australia, 1996.
  13. Karaali O., Corrigan G., and Gerson I. “Speech Synthesis with Neural Networks”. Invited paper, World Congress on Neural Networks (WCNN-96), San Diego, September 1996.
  14. Karaali O., Corrigan G., Gerson I., and Massey N. “Text-to- Speech Conversion with Neural Networks: A Recurrent TDNN Approach”. Proc. Eurospeech ’97, September 1997.
  15. Kiparsky P. “Lexical phonology and morphology”. Linguistics in the morning calm, ed. by I.S. Yang. Seoul: Hanshin, 1982.
  16. Kruskal J. “An overview of sequence comparison”. Time Warps, String Edits, and Macromolecules, edited by Joseph Kruskal and David Sankoff. Reading, MA: Addison- Wesley, 1983.
  17. Linguistic Data Consortium. COMLEX English pronouncing lexicon. Trustees of the University of Pennsylvania, version 0.2, 1995.
  18. Miller C., Karaali O., and Massey N. “Variation and Synthetic Speech”. NWAVE 26, Quebec, October 1997.
  19. Nusbaum H., Francis A., and Luks T. “Comparative valuation of the quality of synthetic speech produced at Motorola”. Research report, Spoken Language Research Laboratory, University of Chicago, 1995.
  20. O’Shaughnessy, D. “Modeling fundamental frequency, and its relationship to syntax, semantics, and phonetics”. Ph.D. thesis, M.I.T., 1976.
  21. Sejnowski T. and Rosenberg C. “NETtalk: a parallel network that learns to pronounce English text”. Complex Systems 1.145-168, 1987.
  22. Seneff S. and Zue V. “Transcription and alignment of the TIMIT database”. M.I.T., 1988.
  23. Tuerk C. and Robinson T. “Speech Synthesis using Artificial Neural Networks Trained on Cepstral Coefficients”. Proc. Eurospeech ’93, Berlin, September 1993.
  24. Ward G. Moby Pronunciator II, 1996.
  25. Weide R. The Carnegie Mellon Pronouncing Dictionary. cmudict.0.4, 1995.


TTS, Speech Synthesis, Bangla.