CFP last date
20 June 2024
Reseach Article

Design of Optimal MLP NN for Speaker Dependent Spoken Words Recognition Application

Published on February 2015 by Sneha B. Lonkar, Nadir N. Charniya
International Conference on Advances in Science and Technology
Foundation of Computer Science USA
ICAST2014 - Number 4
February 2015
Authors: Sneha B. Lonkar, Nadir N. Charniya

Sneha B. Lonkar, Nadir N. Charniya . Design of Optimal MLP NN for Speaker Dependent Spoken Words Recognition Application. International Conference on Advances in Science and Technology. ICAST2014, 4 (February 2015), 14-18.

author = { Sneha B. Lonkar, Nadir N. Charniya },
title = { Design of Optimal MLP NN for Speaker Dependent Spoken Words Recognition Application },
journal = { International Conference on Advances in Science and Technology },
issue_date = { February 2015 },
volume = { ICAST2014 },
number = { 4 },
month = { February },
year = { 2015 },
issn = 0975-8887,
pages = { 14-18 },
numpages = 5,
url = { /proceedings/icast2014/number4/19493-5043/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
%0 Proceeding Article
%1 International Conference on Advances in Science and Technology
%A Sneha B. Lonkar
%A Nadir N. Charniya
%T Design of Optimal MLP NN for Speaker Dependent Spoken Words Recognition Application
%J International Conference on Advances in Science and Technology
%@ 0975-8887
%V ICAST2014
%N 4
%P 14-18
%D 2015
%I International Journal of Computer Applications

Spoken words recognition provides applications like spoken commands recognitions in robotics command, speech based number dialing for phones and mobiles, etc. It also provides applications in railway and banking areas. This work aims at designing of optimal Multilayer Perceptron Neural Network (MLP NN) based classifiers for speaker dependent spoken digits recognition. The classifier attempted as optimal leading to less number of computations and few components requirement for its future implementation in hardware leading to a low cost speech recognition system. Isolated spoken digits were used as an input data to the neural networks based classifiers. Each spoken word was analyzed for the feature like Mel Frequency Cepstral Coefficients (MFCC). The MLP NN based classifier was designed meticulously with the condition of minimum components and attempting maximum classification accuracy.

  1. J. M. Dias Pereira, Octavian Postolache, P. M. B. Silva Girao, and Mihai Cretu, "Minimizing Temperature Drift Errors of Conditioning Circuits Using Artificial Neural Networks", IEEE transactions on instrumentation and measurement, Vol. 49, pp. 1122-1127, October 2000.
  2. Iryna V. Turchenko, "Simulation Modeling of Mullti-Parameter Sensor Signal Identification Using Neural Networks", second IEEE international conference on intelligent systems, pp. 48-53, June 2004.
  3. Ali Gulbag, Fevzullah Temurtas, Cihat Tasaltin, Zafer Ziya Ozturk, "A study on radial basis function neural network size reduction for quantitative identification of individual gas concentrations in their gas mixtures", ScienceDirect, January 2007.
  4. Henry Leung, Titus Lo and Sichun Wang, "Prediction of Noisy Chaotic Time Series Using an Optimal Radial Basis Function Neural Network", IEEE transactions on Neural Networks, Vol. 12, pp. 1163-1172, September 2001.
  5. Huien Han and Peter Felker, "Estimation of daily soil water evaporation using an artificial neural network", Journal of Arid Environments, pp. 251-260, April 1997.
  6. Pasquale Arpaia, Pasquale Daponte, Domenico Grimaldi and Linus Michaeli, "ANN-Based Error Reduction for Experimentally Modeled Sensors", IEEE transactions on instrumentation and measurement, Vol. 51, pp. 23-30, February 2002.
  7. Guang-Bin Huang, Yan-Qiu Chen and Haroon A. Babri, "Classification Ability of Single Hidden Layer Feedforward Neural Networks", IEEE transactions on Neural Networks, Vol. 11, pp. 799-801, May 2000.
  8. Maxwell Stinchcombe and Halbert White, "Universal approximation using feedforward networks with non-sigmoid hidden layer activation functions", pp. I613-I617.
  9. C. R. Chen, H. S. Ramaswamy, "A neuro-computing approach for modeling of residence time distribution (RTD) of carrot cubes in a vertical scraped surface heat exchanger (SSHE)", Food Research International 33, pp. 549-556, February 2000.
  10. . T. Lewicke, E. S. Sazonov, M. J. Corwin, S. A. C. Schuckers, "Reliable Determnination of Sleep Versus Wake from Heart Rate Variability Using Neural Networks", Proceedings of International Joint Conference on Neural Networks, Montreal, Canada, pp. 2394-2399, August 2005.
  11. Nadir N. Charniya, "Design of Near-Optimal Classifier Using Multi-Layer Perceptron Neural Networks for Intelligent Sensors", International Journal of Modeling and Optimization, Vol. 3, No. 1, pp. 56-60, February 2013.
  12. S V Dudul, "Classification of Radar Returns from the Ionosphere using RBF Neural Network", IE(I) Journal-ET, Vol. 88, pp. 26-33, July 2007.
  13. Christopher M. Bishop, "Neural Network for Pattern Recognition", Indian Edition, Year 2007.
  14. Judith Justin and Ila Vennila, "A hybrid speech recognition system with Hidden Markov Model and Radial Basis Function Neural Network", American Journal of Applied Sciences, pp. 1148-1153, Year 2013.
  15. Bachu R. G. , Kopparthi S. , Adapa B. , Barkana B. D. , "Separation of Voiced and Unvoiced using Zero crossing rate and Energy of the Speech Signal", pp. 1-7.
  16. Bishnu Prasad Das, Ranjan Parekh, " Recognition of Isolated Words using Features based on LPC, MFCC, ZCR and STE, with Neural Network Classifiers", International Journal of Modern Engineering Research (IJMER), pp. 854-858, May-June 2012.
  17. Maruti Limkar, RamaRao & VidyaSagvekar, "Isolated Digit Recognition Using MFCC AND DTW", International Journal on Advanced Electrical and Electronics Engineering, (IJAEEE), pp. 59-64, Year 2012.
  18. Md. Ali Hossain, Md. Mijanur Rahman, Uzzal Kumar Prodhan, Md. Farukuzzaman Khan, "Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition", International Journal of Information Sciences and Techniques (IJIST), Vol. 3, pp. 1-9, July 2013.
  19. Geeta Nijhawan, M. K. Soni, "A Comparative Study of Two Different Neural Models For Speaker Recognition Systems", International Journal of Innovative Technology and Exploring Engineering, Vol. 1, pp. 67-72, June 2012.
  20. D. B. Hanchate Mohini Nalawade, Manoj Pawar, Vijay Pophale, Prabhat Kumar Maurya, "Vocal Digit Recognition using Artificial Neural Network", second International Conference on Computer Engineering and Technology, Vol. 6, pp. 88-91, Year 2010.
  21. Q. Ibrahim, N. Abdulghani, "Security enhancement of voice over Internet protocol using speaker recognition technique", IET Communications, Vol. 6, pp. 604-612, Year 2012.
  22. Lakshmi Kanaka Venkateswarlu Revada, Vasantha Kumari Rambatla and Koti Verra Nagayya Ande, "A Novel Approach to Speech Recognition by Using Generalized Regression Neural Networks", International Journal of Computer Science Issues, Vol. 8, pp. 484-489, March 2011.
  23. Wouter Gevaert, Georgi Tsenov, Valeri Mladenov, "Neural Networks used for Speech Recognition", Journal of Automatic Control, University of Belgrade, Vol. 20, pp. 1-7, Year 2010.
  24. Chin Luh Tan and Adznan Jantan, "Digit Recognition using Neural Networks", Malaysian Journal of Computer Science, Vol. 17, pp. 40-54, December 2004.
  25. Mondher Frikha, Ahmed Ben Hamida, "A Comparitive Survey of ANN and Hybrid HMM/ANN Architectures for Robust Speech Recognition", American Journal of Intelligent Systems, pp. 1-8, 2012.
  26. Martin T. Hagan, Howard B. Demuth, Mark H. Beal, "Neural Network Design", Campus Pub. Service, University of Colorado Bookstore, Year 2002.
  27. Howard Demuth, Mark Beale, "Neural Network Toolbox, version4", The MathWorks, Year 2012.
Index Terms

Computer Science
Information Sciences


Neural Network Multilayer Perceptron Neural Network Speech Recognition Mel Frequency Cepstral Coefficients