Call for Paper - September 2022 Edition
IJCA solicits original research papers for the September 2022 Edition. Last date of manuscript submission is August 22, 2022. Read More

Speech Recognition by Wavelet Analysis

International Journal of Computer Applications
© 2011 by IJCA Journal
Number 8 - Article 5
Year of Publication: 2011
Nitin Trivedi
Dr. Vikesh Kumar
Saurabh Singh
Sachin Ahuja
Raman Chadha

Nitin Trivedi, Dr. Vikesh Kumar, Saurabh Singh, Sachin Ahuja and Raman Chadha. Article: Speech Recognition by Wavelet Analysis. International Journal of Computer Applications 15(8):27–32, February 2011. Full text available. BibTeX

	author = {Nitin Trivedi and Dr. Vikesh Kumar and Saurabh Singh and Sachin Ahuja and Raman Chadha},
	title = {Article: Speech Recognition by Wavelet Analysis},
	journal = {International Journal of Computer Applications},
	year = {2011},
	volume = {15},
	number = {8},
	pages = {27--32},
	month = {February},
	note = {Full text available}


In an effort to provide a more efficient representation of the speech signal, the application of the wavelet analysis is considered. This research presents an effective and robust method for extracting features for speech processing. Based on the time‐frequency multi‐resolution property of wavelet transform, the input speech signal is decomposed into various frequency channels.

The major issues concerning the design of this Wavelet based speech recognition system are choosing optimal wavelets for speech signals, decomposition level in the DWT, selecting the feature vectors from the wavelet coefficients. More specifically automatic classification of various speech signals using the DWT is described and compared using different wavelets. Finally, wavelet based feature extraction system and its performance on an isolated word recognition problem are investigated. For the classification of the words, three layered feed forward network is used.


  • B.T. Tan, M. Fu, A. Spray, P. Dermody, “The use of wavelet transform for phoneme recognition,” Proceedings of the 4th International Conference of Spoken Language Processing Philadelphia, Vol. 4, USA, October 1996, pp.2431-2434.
  • S. G. Mallat, “A theory for multiresolution signal decomposition: the wavelet representation,” IEEE transactions on Pattern Analysis Machine Intelligence, Vol. 11 1989, pp.674-693.
  • Oliver Siohan and Chin-Hui Lee “Iterative Noise and Channel Estimation under the Stochastic Matching Algorithm Framework” IEEE Signal Processing, Processing Letters, Vol. 4, No. 11, Nov 1997.
  • M. Misiti, Y. Misiti, G. Oppenheim and J. Poggi, Matlab Wavelet Tool Box, The Math Works Inc.,2000 Page: 795.
  • George Tzanetakis, Georg Essl, Perry Cook, “Audio Analysis using the Discrete Wavelet Transform” Organized sound, Vol. 4(3), 2000.
  • L. Barbier, G. Chollet, “Robust speech parameters extraction for word recognition in noise using neural networks,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Pages: 145-148, May 1991.
  • X. Huang, “Speaker normalization for speech recognition”, IEEE International Conference on Acoustics, Speech, and Signal Processing, 1:465-468, March 1992.
  • S. Tamura, A Waibel, “Noise reduction using connectionist models.” IEEE International Conference on Acoustics, Speech, and Signal Processing, 1:553-556, April 1988.
  • S. Young, “A review of large vocabulary continues-speech recognition,” Proc. IEEE Sig. Processing. Mag. (September) (1996) 45-57.
  • N. Desmukh, A. Ganapathiraju, J. Picone, “Hierarchical search for large vocabulary conversational speech recognition – working toward a solution to the decoding problem,” IEEE Sig, Process Mag. (September) (1999) 84-107.