Call for Paper - April 2023 Edition
IJCA solicits original research papers for the April 2023 Edition. Last date of manuscript submission is March 20, 2023. Read More

An Intelligent Text to Speech System for Windows based Systems and Mobile Devices

International Journal of Computer Applications
© 2014 by IJCA Journal
Volume 90 - Number 16
Year of Publication: 2014
Abhishek Srivastava
Akshay Sharma
Neelu Jain

Abhishek Srivastava, Akshay Sharma and Neelu Jain. Article: An Intelligent Text to Speech System for Windows based Systems and Mobile Devices. International Journal of Computer Applications 90(16):1-5, March 2014. Full text available. BibTeX

	author = {Abhishek Srivastava and Akshay Sharma and Neelu Jain},
	title = {Article: An Intelligent Text to Speech System for Windows based Systems and Mobile Devices},
	journal = {International Journal of Computer Applications},
	year = {2014},
	volume = {90},
	number = {16},
	pages = {1-5},
	month = {March},
	note = {Full text available}


TTS (Text-to-speech) systems are used invariably as part of our daily lives and have come a long way. In this paper TTS system using Concatenative synthesis based on the SDK (Software Development Kit) platform has been presented. This system is compatible with both computer and mobile devices. It has a user friendly GUI (graphical user interface) to control various speech parameters. Speech signal produced can be saved and listened to whenever required. Signal analysis of the output speech can also be done using TTS System. The results of these signal analysis along with the stored speech signal can be used for further applications depending upon the requirements. It is an intelligent system and is able to overcome various normalization problems.


  • Tokuda et al," Speech Synthesis Based on Hidden Markov Models",Proceedings of the IEEE | Vol. 101, No. 5,pp. 1234-1252 May 2013
  • J. Hamzabegovic*, D. Kalpi? "A Proposal for Development of Software to SupportSpecific Learning Difficulties",12th International Conference on Telecommunications - ConTEL 2013,pp. 207-214,ISBN: 978-953-184-180-1, Zagreb, Croatia
  • JuergenSchroeter AT&T Laboratories
  • A. G. Ramakrishnan, Lakshmish N Kaushik, LaxmiNarayana. M, "Natural Language Processing for Tamil TTS", Proc. 3rd Language and Technology Conference, Poznan, Poland, October 5-7, 2007
  • Chen, G. L. , Yue, D. J. , Zu, Y. Q. , Yu, Z. L. , "An embedded English synthesis approach based on speech concatenation and smoothing", ISCSLP2004, pp. 157-160, Hong Kong, Dec. 2004
  • T. Dutoit, "An Introduction to Text-to-Speech Synthesis"Dordrecht/Boston/London: Kluwer Academic Publishers, 1997.
  • T. Styger and E. Keller, Fundamentals ofSpeech Synthesis and Speech Recognition: Basic Concepts, State of the Art, and Future Challenges Formant synthesis, In Keller E. (ed. ), 109-128, Chichester: John Wiley, 1994. , 4,5
  • 13. D. H. Klatt, ''Software for a cascade/parallel formant synthesizer,'' J. Acoust. Soc. Am. , vol. 67, no. 3,971–995, 1980.
  • J. Allen, M. S. Hunnicutt, and D. Klatt, From Text to Speech, The MITalk System, Cambridge: CambridgeUniversity Press, 1987
  • Moulines, E. , Charpentier, F. "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones", Speech Communication, Vol. 9, pp. 453-468, 1990
  • Sproat, R. , Hirschberg, J. , Yarowsky, D. , "A corpus-based synthesizer", ICSLP1992, pp. 563-566, Alberta, Canada, Oct. 1992
  • Van Santen, J. , Sproat, R. , Olive, J. , Hirshberg, J. , editors, Progress in Speech Synthesis, Springer Verlag, New York, 1995
  • IngmundBjørkan,Speech Generation and Modification in Concatenative Speech Synthesis Ph D Thesis,Norwegian University of Science and Technology . Faculty of Information Technology, Mathematics and Electrical Engineering, Department of Electronics and Telecommunications 2010
  • Sproat, R. and Oliver, J. "An Approach to Text-to-Speech Synthesis". Chapter 17 in book "Speech Coding and Synthesis", Elsevier, 1995
  • S. Nakajima and H. Hamada, "Automatic generation of Synthesis Units based on context oriented clustering", Proc. ICASSP 1988, pp. 659-662, (New York, USA), 1988].
  • R. E. Donovan and E. M. Eide, ''The IBM trainable speech synthesis system,'' in Proc. Int. Conf. Spoken Lang. Process. , 1998, pp. 1703–1706.
  • B. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, and A. Syrdal, ''The AT&T Next-Gen TTS system,'' in Proc. Joint ASA/EAA/DAEA Meeting, 1999,pp. 15–19.
  • G. Coorman, J. Fackrell, P. Rutten, and B. Coile, ''Segment selection in the L&H realspeak laboratory TTS system,'' in Proc. Int. Conf. Spoken Lang. Process. , 2000,pp. 395–398. ]
  • http://msdn. microsoft. com/en-us/library/ms720151(v=vs. 85). aspx
  • http://msdn. microsoft. com/library/windowsphone/develop/ff402529(v=vs. 105). aspx
  • Zeng et a," Speech dynamic range for cochlear implants" . J. Acoust. Soc. Am. , Vol. 111, No. 1, Pt. 1, Jan. 2002.