MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors

Satyanand Singh; Dr. E.G Rajan

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 20 July 2026

Submit your paper

Know more

The week's pick

Quantifying Label-Induced Bias in Large Language Model Self and Cross Evaluations

Muskan Saraf Sajjad Rezvani Boroujeni Justin Beaudry Hossein Abedi Tom Bush

Random Articles

An Improvement of Forgery Video Detection Technique using Error Level Analysis

February

2015

Comment on " Application of Improved - Expansion Method to Traveling Wave Solutions of Two Nonlinear Evolution Equations, Adv. Appl. Math. Mech. 4(2012) 122-130"

January

2014

Adaptive Neural Network Controller for Modeling Link Quality in WSANs

January

2013

Effect of Reaction Rate on Dispersion of Atmospheric Aerosols in the Presence of Electric Field

April

2012

Reseach Article

MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors

by Satyanand Singh, Dr. E.G Rajan

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 21 - Number 6

Year of Publication: 2011

Authors: Satyanand Singh, Dr. E.G Rajan

10.5120/2519-3423

Satyanand Singh, Dr. E.G Rajan . MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors. International Journal of Computer Applications. 21, 6 ( May 2011), 1-6. DOI=10.5120/2519-3423

@article{ 10.5120/2519-3423,

author = { Satyanand Singh, Dr. E.G Rajan },

title = { MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors },

journal = { International Journal of Computer Applications },

issue_date = { May 2011 },

volume = { 21 },

number = { 6 },

month = { May },

year = { 2011 },

issn = { 0975-8887 },

pages = { 1-6 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume21/number6/2519-3423/ },

doi = { 10.5120/2519-3423 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:07:45.732227+05:30

%A Satyanand Singh

%A Dr. E.G Rajan

%T MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors

%J International Journal of Computer Applications

%@ 0975-8887

%V 21

%N 6

%P 1-6

%D 2011

%I Foundation of Computer Science (FCS), NY, USA

Abstract

The present study was conducted to evaluate the accuracy affecting factors of a Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) based speaker recognition system. This investigation analyses the factors that affecting recognition accuracy using speech signal from day to day life in surrounding environments. It was studied the mismatch affects of text-dependency, voice sample length, speaking language, speaking style, mimicry, the quality of microphone, utterance sample quality and surrounding noise. The corpuses of 10 people of 20 utterance subjects were collected which were indicate that any mismatch degrades recognition accuracy. It was found that most dominating factors that degrades the accuracy of speaker recognition systems were surrounding noise, quality of microphone by which voice sample were collected, disguise, and degrading of the sample rate and quality. Speech-related factors and sample length were less critical.

References

Gatica-Perez, G. Lathoud, J.-M. Odobez and I. Mc Cowan. 2007 Audiovisual probabilistic tracking of multiple speakers in meetings, IEEE Transactions on Speech and Audio Processing, 15(2), pp. 601–616.
J. P. Cambell, Jr. 1997 Speaker Recognition A Tutorial Proceedings of the IEEE, 85(9), pp. 1437-1462.
Faundez-Zanuy M. and Monte-Moreno E. 2005 State-of-the-art in speaker recognition , Aerospace and Electronic Systems Magazine, IEEE, 20(5), pp. 7-12.
K. Saeed and M. K. Nammous. 2005 Heuristic method of Arabic speech recognition, in Proc. IEEE 7th Int. Conf. DSPA, Moscow, Russia, pp. 528–530.
Lamel, L.F. and Gauvain, J.L., 2000. Speaker Verification over the Telephone, Speech Communication, pp. 141–154.
Ortega-Garcia, J., Gonz´alez-Rodriguez, J., et al., May 1998 AHUMADA: A large speech corpus in Spanish for speaker identification and verification, IEEE Intl. Conf. on Acoust. Speech and Signal Proc, pp. 773–776.
Singh Satyanand, Dr. E.G Rajan. March 2011 Vector Quantization Approach for Speaker Recognition Using MFCC and Inverted MFCC, International Journal of Computer Applications,17(1), pp. 1-7 .
Yegnanarayana B., Prasanna S.R.M., Zachariah J.M. and Gupta C. S. 2005 Combining evidence from source suprasegmental and spectral features for a fixed-text speaker verification system , IEEE Trans. Speech and Audio Processing, 13(4), pp. 575-582.
J. Kittler, M. Hatef, R. Duin, J. Mataz. 1998 On combining classifiers, IEEE Trans, Pattern Anal. Mach. Intell, 20(3), pp. 226-239.
He, J., Liu, L., Palm, G. 1999 A Discriminative Training Algorithm for VQ-based Speaker Identification , IEEE Transactions on Speech and Audio Processing, 7(3), pp. 353-356.
Laurent Besacier and Jean-Francois Bonastre. 2000 Subband architecture for automatic speaker recognition, Signal Processing, 80, pp. 1245-1259.
Ganchev, T., Fakotakis, N., and Kokkinakis, G. 2005 Comparative Evaluation of Various MFCC Implementations on the Speaker Verification Task, Proc. of SPECOM Patras, Greece, pp. 1191-194.
Zheng F., Zhang, G. and Song, Z. 2001 Comparison of different implementations of MFCC, J. Computer Science & Technology 16(6), pp. 582-589.

Index Terms

Computer Science

Information Sciences

Keywords

GF Triangular Filter Subbands Correlation MFCC inverted MFCC Vector Quantization