Feature Selection Method for Speaker Recognition using Neural Network

Dipen Nath; Sanjib Kr. Kalita

Call for Paper

September Edition

IJCA solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 20 August 2026

Submit your paper

Know more

The week's pick

AI-Assisted Observability in Distributed Microservice Architectures

Kyrylo Sotnykov

Random Articles

An Evaluation of Network Topologies for Enhance Networking

Jun

2023

Semantic Web Application in Learning Resource Ontology Repository

April

2016

FRANSAC: Fast RANdom Sample Consensus for 3D Plane Segmentation

Jun

2017

Recommender Systems for Software Requirements Negotiation and Prioritization

May

2015

Reseach Article

Feature Selection Method for Speaker Recognition using Neural Network

by Dipen Nath, Sanjib Kr. Kalita

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 101 - Number 3

Year of Publication: 2014

Authors: Dipen Nath, Sanjib Kr. Kalita

10.5120/17670-8499

Dipen Nath, Sanjib Kr. Kalita . Feature Selection Method for Speaker Recognition using Neural Network. International Journal of Computer Applications. 101, 3 ( September 2014), 38-44. DOI=10.5120/17670-8499

@article{ 10.5120/17670-8499,

author = { Dipen Nath, Sanjib Kr. Kalita },

title = { Feature Selection Method for Speaker Recognition using Neural Network },

journal = { International Journal of Computer Applications },

issue_date = { September 2014 },

volume = { 101 },

number = { 3 },

month = { September },

year = { 2014 },

issn = { 0975-8887 },

pages = { 38-44 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume101/number3/17670-8499/ },

doi = { 10.5120/17670-8499 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T22:30:45.960511+05:30

%A Dipen Nath

%A Sanjib Kr. Kalita

%T Feature Selection Method for Speaker Recognition using Neural Network

%J International Journal of Computer Applications

%@ 0975-8887

%V 101

%N 3

%P 38-44

%D 2014

%I Foundation of Computer Science (FCS), NY, USA

Abstract

The aim of this paper is to extract and select features from speech signal that will make it possible to have acceptable speaker recognition rate in real life. A variety of combinations among formants (F1, F2, F3), Linear Predictive Coefficients (LPC), Mel Frequency Cepstral Coefficients (MFCC) and delta- Mel Frequency Cepstral Coefficients representing features are considered and their effect in speaker recognition is observed. Two similar volume data sets with differed string (words) are considered in the present study. These two data sets are prepared taking into account two differed data sampling rates. The study reveals another interesting fact that the selection of strings in speaker enrollment process is a matter of importance for accurate result. This means that the speaker will be tested for authentication with the same string with which he was enrolled earlier during the time of his first access to the system.

References

Adjoudj Reda, Boukelif Aoued, "Artificial Neural Network & Mel-Frequency Cepstrum Coefficients-Based Speaker Recognition", 3rd International Conference: Sciences of Electronic, Technologies of Information and Telecommunications--TUNISIA, March 27-31, 2005
Mark K. Transtrum and James P. Sethna "Improvements to the Levenberg-Marquardt algorithm for nonlinear least-squares minimization," Preprint submitted to Journal of Computational Physics, January 30, 2012.
Kshamamayee Dash, Debananda Padhi, Bhoomika Panda, Prof. Sanghamitra Mohanty, "Speaker Identification using Mel Frequency Cepstral Coefficient and BPNN", International Journal of Advanced Research in Computer Science and Software Engineering, Volume 2, Issue 4, ISSN: 2277 128X, April 2012
Praveen N, Tessamma Thomas, "Text dependent speaker recognition using MFCC features and BPANN", International Journal of Computer Applications (0975 – 8887), Volume 74– No. 5, July 2013
Bishnu Prasad Das, Ranjan Parekh, "Recognition of Isolated Words using Features based on LPC, MFCC, ZCR and STE with Neural Network Classifiers", International Journal of Modern Engineering Research (IJMER) ,Vol. 2, Issue. 3, pp-854-858 [ISSN: 2249-6645], May-June 2012
Lajish V. L , Sunil Kumar R. K and Vivek P, "Speaker identification using a nonlinear speech model and ANN", International Journal of Advanced Information Technology (IJAIT) Vol. 2, No. 5, October 2012
Thiang, Suryo Wijoyo. "Speech Recognition Using Linear Predictive Coding and Artificial Neural Network for Controlling Movement of Mobile Robot". 2011 International Conference on Information and Electronics Engineering IPCSIT vol. 6 © (2011) IACSIT Press, Singapore, 2011
Talukdar, P. H. , Bhattacharjee, U. , Goswami, C. K. , Barman, J. , "Cepstral Measure of Boro Vowels through LPC-Analysis", Journal of the CSI, Vol. 34 No 1, Jan – Mar, 2004
Kalita S. K. , Dutta R. , and Talukdar P. H. , "A spectral analysis of Bodo and Assamese vowels", Abstracts 3rd International Conference on "Computers and Devices for Communication". CODEC – 06, Kolkata, India, pp. 41, 2006
Braman, J. , Kalita, S. , Talukdar, P. H. , "Features extraction of bodo vowels through lpc-analysis", Proceedings of Frontiers of Research on Speech and Music (FRMS-2004), 2004
Hasan Rashidul, Jamil Mustafa, Rabbani Golam, Rahman Saifur, "Speaker identification using mel frequency cepstral coefficients", 3rd International Conference on Electrical & Computer Engineering, Dhaka, Bangladesh, ICECE 2004, 28-30 December 2004
Rabiner L. , Juang B. H. and Yegnanarayana B. – "Fundamentals of Speech Processing", Pearson Education, ISBN 978-81-775-8560-5, 2011
D. Ripley, "Neural Networks and Related Methods for Classification", Journal of the Royal Statistical Society. Series B (Methodological), Vol. 56, No. 3(1994), pp. 409-456, 1994
Rabiner L. and Juang B. H. – "Fundamental of Speech Processing", Prentice-Hall, 1993
Bishop, C. , "Neural Networks for Pattern Recognition", Oxford University Press, Oxford, 1995
Haykin, S. , "Neural Networks - A Comprehensive Foundation", 2nd ed. Prentice-Hall, Englewood Cliffs, 1998
K. Levenberg. "A Method for the Solution of Certain Non-Linear Problems in Least Squares". The Quarterly of Applied Mathematics, 2: 164-168, 1994
M. I. A. Lourakis. , "A brief description of the Levenberg-Marquardt algorithm" implemented by levmar, Technical Report, Institute of Computer Science, Foundation for Research and Technology, - Hellas, 2005
Vibha Tiwari, "MFCC and its applications in speaker recognition", International Journal on Emerging Technologies 1(1): 19-22(2010) ISSN: 0975-8364, 2010
S. Khan, Mohd Rafibul lslam, M. Faizul, D. Doll, "Speaker recognition using MFCC", presented in IJCSES, International Journal of Computer Science and Engineering System, 2008

Index Terms

Computer Science

Information Sciences

Keywords

Feature Extraction Feed Forward Neural Network Speaker Recognition