CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

An Intelligent Text to Speech System for Windows based Systems and Mobile Devices

by Abhishek Srivastava, Akshay Sharma, Neelu Jain
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 90 - Number 16
Year of Publication: 2014
Authors: Abhishek Srivastava, Akshay Sharma, Neelu Jain
10.5120/15801-4625

Abhishek Srivastava, Akshay Sharma, Neelu Jain . An Intelligent Text to Speech System for Windows based Systems and Mobile Devices. International Journal of Computer Applications. 90, 16 ( March 2014), 1-5. DOI=10.5120/15801-4625

@article{ 10.5120/15801-4625,
author = { Abhishek Srivastava, Akshay Sharma, Neelu Jain },
title = { An Intelligent Text to Speech System for Windows based Systems and Mobile Devices },
journal = { International Journal of Computer Applications },
issue_date = { March 2014 },
volume = { 90 },
number = { 16 },
month = { March },
year = { 2014 },
issn = { 0975-8887 },
pages = { 1-5 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume90/number16/15801-4625/ },
doi = { 10.5120/15801-4625 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:11:10.638166+05:30
%A Abhishek Srivastava
%A Akshay Sharma
%A Neelu Jain
%T An Intelligent Text to Speech System for Windows based Systems and Mobile Devices
%J International Journal of Computer Applications
%@ 0975-8887
%V 90
%N 16
%P 1-5
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

TTS (Text-to-speech) systems are used invariably as part of our daily lives and have come a long way. In this paper TTS system using Concatenative synthesis based on the SDK (Software Development Kit) platform has been presented. This system is compatible with both computer and mobile devices. It has a user friendly GUI (graphical user interface) to control various speech parameters. Speech signal produced can be saved and listened to whenever required. Signal analysis of the output speech can also be done using TTS System. The results of these signal analysis along with the stored speech signal can be used for further applications depending upon the requirements. It is an intelligent system and is able to overcome various normalization problems.

References
  1. Tokuda et al," Speech Synthesis Based on Hidden Markov Models",Proceedings of the IEEE | Vol. 101, No. 5,pp. 1234-1252 May 2013
  2. J. Hamzabegovic*, D. Kalpi? "A Proposal for Development of Software to SupportSpecific Learning Difficulties",12th International Conference on Telecommunications - ConTEL 2013,pp. 207-214,ISBN: 978-953-184-180-1, Zagreb, Croatia
  3. JuergenSchroeter AT&T Laboratories
  4. A. G. Ramakrishnan, Lakshmish N Kaushik, LaxmiNarayana. M, "Natural Language Processing for Tamil TTS", Proc. 3rd Language and Technology Conference, Poznan, Poland, October 5-7, 2007
  5. Chen, G. L. , Yue, D. J. , Zu, Y. Q. , Yu, Z. L. , "An embedded English synthesis approach based on speech concatenation and smoothing", ISCSLP2004, pp. 157-160, Hong Kong, Dec. 2004
  6. T. Dutoit, "An Introduction to Text-to-Speech Synthesis"Dordrecht/Boston/London: Kluwer Academic Publishers, 1997.
  7. T. Styger and E. Keller, Fundamentals ofSpeech Synthesis and Speech Recognition: Basic Concepts, State of the Art, and Future Challenges Formant synthesis, In Keller E. (ed. ), 109-128, Chichester: John Wiley, 1994. , 4,5
  8. 13. D. H. Klatt, ''Software for a cascade/parallel formant synthesizer,'' J. Acoust. Soc. Am. , vol. 67, no. 3,971–995, 1980.
  9. J. Allen, M. S. Hunnicutt, and D. Klatt, From Text to Speech, The MITalk System, Cambridge: CambridgeUniversity Press, 1987
  10. Moulines, E. , Charpentier, F. "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones", Speech Communication, Vol. 9, pp. 453-468, 1990
  11. Sproat, R. , Hirschberg, J. , Yarowsky, D. , "A corpus-based synthesizer", ICSLP1992, pp. 563-566, Alberta, Canada, Oct. 1992
  12. Van Santen, J. , Sproat, R. , Olive, J. , Hirshberg, J. , editors, Progress in Speech Synthesis, Springer Verlag, New York, 1995
  13. IngmundBjørkan,Speech Generation and Modification in Concatenative Speech Synthesis Ph D Thesis,Norwegian University of Science and Technology . Faculty of Information Technology, Mathematics and Electrical Engineering, Department of Electronics and Telecommunications 2010
  14. Sproat, R. and Oliver, J. "An Approach to Text-to-Speech Synthesis". Chapter 17 in book "Speech Coding and Synthesis", Elsevier, 1995
  15. S. Nakajima and H. Hamada, "Automatic generation of Synthesis Units based on context oriented clustering", Proc. ICASSP 1988, pp. 659-662, (New York, USA), 1988].
  16. R. E. Donovan and E. M. Eide, ''The IBM trainable speech synthesis system,'' in Proc. Int. Conf. Spoken Lang. Process. , 1998, pp. 1703–1706.
  17. B. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, and A. Syrdal, ''The AT&T Next-Gen TTS system,'' in Proc. Joint ASA/EAA/DAEA Meeting, 1999,pp. 15–19.
  18. G. Coorman, J. Fackrell, P. Rutten, and B. Coile, ''Segment selection in the L&H realspeak laboratory TTS system,'' in Proc. Int. Conf. Spoken Lang. Process. , 2000,pp. 395–398. ]
  19. http://msdn. microsoft. com/en-us/library/ms720151(v=vs. 85). aspx
  20. http://msdn. microsoft. com/library/windowsphone/develop/ff402529(v=vs. 105). aspx
  21. Zeng et a," Speech dynamic range for cochlear implants" . J. Acoust. Soc. Am. , Vol. 111, No. 1, Pt. 1, Jan. 2002.
Index Terms

Computer Science
Information Sciences

Keywords

TTS SDK Concatenative synthesis GUI