CFP last date
20 May 2024
Reseach Article

Feature Selection Method for Speaker Recognition using Neural Network

by Dipen Nath, Sanjib Kr. Kalita
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 101 - Number 3
Year of Publication: 2014
Authors: Dipen Nath, Sanjib Kr. Kalita
10.5120/17670-8499

Dipen Nath, Sanjib Kr. Kalita . Feature Selection Method for Speaker Recognition using Neural Network. International Journal of Computer Applications. 101, 3 ( September 2014), 38-44. DOI=10.5120/17670-8499

@article{ 10.5120/17670-8499,
author = { Dipen Nath, Sanjib Kr. Kalita },
title = { Feature Selection Method for Speaker Recognition using Neural Network },
journal = { International Journal of Computer Applications },
issue_date = { September 2014 },
volume = { 101 },
number = { 3 },
month = { September },
year = { 2014 },
issn = { 0975-8887 },
pages = { 38-44 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume101/number3/17670-8499/ },
doi = { 10.5120/17670-8499 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:30:45.960511+05:30
%A Dipen Nath
%A Sanjib Kr. Kalita
%T Feature Selection Method for Speaker Recognition using Neural Network
%J International Journal of Computer Applications
%@ 0975-8887
%V 101
%N 3
%P 38-44
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The aim of this paper is to extract and select features from speech signal that will make it possible to have acceptable speaker recognition rate in real life. A variety of combinations among formants (F1, F2, F3), Linear Predictive Coefficients (LPC), Mel Frequency Cepstral Coefficients (MFCC) and delta- Mel Frequency Cepstral Coefficients representing features are considered and their effect in speaker recognition is observed. Two similar volume data sets with differed string (words) are considered in the present study. These two data sets are prepared taking into account two differed data sampling rates. The study reveals another interesting fact that the selection of strings in speaker enrollment process is a matter of importance for accurate result. This means that the speaker will be tested for authentication with the same string with which he was enrolled earlier during the time of his first access to the system.

References
  1. Adjoudj Reda, Boukelif Aoued, "Artificial Neural Network & Mel-Frequency Cepstrum Coefficients-Based Speaker Recognition", 3rd International Conference: Sciences of Electronic, Technologies of Information and Telecommunications--TUNISIA, March 27-31, 2005
  2. Mark K. Transtrum and James P. Sethna "Improvements to the Levenberg-Marquardt algorithm for nonlinear least-squares minimization," Preprint submitted to Journal of Computational Physics, January 30, 2012.
  3. Kshamamayee Dash, Debananda Padhi, Bhoomika Panda, Prof. Sanghamitra Mohanty, "Speaker Identification using Mel Frequency Cepstral Coefficient and BPNN", International Journal of Advanced Research in Computer Science and Software Engineering, Volume 2, Issue 4, ISSN: 2277 128X, April 2012
  4. Praveen N, Tessamma Thomas, "Text dependent speaker recognition using MFCC features and BPANN", International Journal of Computer Applications (0975 – 8887), Volume 74– No. 5, July 2013
  5. Bishnu Prasad Das, Ranjan Parekh, "Recognition of Isolated Words using Features based on LPC, MFCC, ZCR and STE with Neural Network Classifiers", International Journal of Modern Engineering Research (IJMER) ,Vol. 2, Issue. 3, pp-854-858 [ISSN: 2249-6645], May-June 2012
  6. Lajish V. L , Sunil Kumar R. K and Vivek P, "Speaker identification using a nonlinear speech model and ANN", International Journal of Advanced Information Technology (IJAIT) Vol. 2, No. 5, October 2012
  7. Thiang, Suryo Wijoyo. "Speech Recognition Using Linear Predictive Coding and Artificial Neural Network for Controlling Movement of Mobile Robot". 2011 International Conference on Information and Electronics Engineering IPCSIT vol. 6 © (2011) IACSIT Press, Singapore, 2011
  8. Talukdar, P. H. , Bhattacharjee, U. , Goswami, C. K. , Barman, J. , "Cepstral Measure of Boro Vowels through LPC-Analysis", Journal of the CSI, Vol. 34 No 1, Jan – Mar, 2004
  9. Kalita S. K. , Dutta R. , and Talukdar P. H. , "A spectral analysis of Bodo and Assamese vowels", Abstracts 3rd International Conference on "Computers and Devices for Communication". CODEC – 06, Kolkata, India, pp. 41, 2006
  10. Braman, J. , Kalita, S. , Talukdar, P. H. , "Features extraction of bodo vowels through lpc-analysis", Proceedings of Frontiers of Research on Speech and Music (FRMS-2004), 2004
  11. Hasan Rashidul, Jamil Mustafa, Rabbani Golam, Rahman Saifur, "Speaker identification using mel frequency cepstral coefficients", 3rd International Conference on Electrical & Computer Engineering, Dhaka, Bangladesh, ICECE 2004, 28-30 December 2004
  12. Rabiner L. , Juang B. H. and Yegnanarayana B. – "Fundamentals of Speech Processing", Pearson Education, ISBN 978-81-775-8560-5, 2011
  13. D. Ripley, "Neural Networks and Related Methods for Classification", Journal of the Royal Statistical Society. Series B (Methodological), Vol. 56, No. 3(1994), pp. 409-456, 1994
  14. Rabiner L. and Juang B. H. – "Fundamental of Speech Processing", Prentice-Hall, 1993
  15. Bishop, C. , "Neural Networks for Pattern Recognition", Oxford University Press, Oxford, 1995
  16. Haykin, S. , "Neural Networks - A Comprehensive Foundation", 2nd ed. Prentice-Hall, Englewood Cliffs, 1998
  17. K. Levenberg. "A Method for the Solution of Certain Non-Linear Problems in Least Squares". The Quarterly of Applied Mathematics, 2: 164-168, 1994
  18. M. I. A. Lourakis. , "A brief description of the Levenberg-Marquardt algorithm" implemented by levmar, Technical Report, Institute of Computer Science, Foundation for Research and Technology, - Hellas, 2005
  19. Vibha Tiwari, "MFCC and its applications in speaker recognition", International Journal on Emerging Technologies 1(1): 19-22(2010) ISSN: 0975-8364, 2010
  20. S. Khan, Mohd Rafibul lslam, M. Faizul, D. Doll, "Speaker recognition using MFCC", presented in IJCSES, International Journal of Computer Science and Engineering System, 2008
Index Terms

Computer Science
Information Sciences

Keywords

Feature Extraction Feed Forward Neural Network Speaker Recognition