|
10.5120/806-1146 |
Jagdev Kaur, Reetkamal Kaur and Manjot Kaur Gill. Article: Vector Quantization based Speaker Identification. International Journal of Computer Applications 4(2):1–4, July 2010. Published By Foundation of Computer Science. BibTeX
@article{key:article,
author = {Jagdev Kaur and Reetkamal Kaur and Manjot Kaur Gill},
title = {Article: Vector Quantization based Speaker Identification},
journal = {International Journal of Computer Applications},
year = {2010},
volume = {4},
number = {2},
pages = {1--4},
month = {July},
note = {Published By Foundation of Computer Science}
}
Abstract
The automatic speaker recognition technologies have developed into more and more important modern technologies required by many speech-aided applications. The main challenge for automatic speaker recognition is to deal with the variability of the environments and channels from where the speech was obtained. Speaker recognition system is a system which recognizes the speaker as opposed to what is being said by the speaker as in case of speech recognition. Speaker recognition technology makes it possible to the speaker's voice to control access to restricted services, for example, phone access to banking, database services, shopping or voice mail, and access to secure equipments. The main aim of this paper is speaker identification, which consists of comparing a speech signal from an unknown speaker to a database of known speakers. The methodology followed in this paper for Speaker identification is using Feature Extraction process and then Vector Quantization of extracted features is done using k-means algorithm. At last, the speaker is identified by comparing the data from a tested speaker to the database of each speaker and then measuring the difference.
Reference
-
E. Karpov, “Real-Time Speaker Identification”, Master's thesis, University of Joensuu Department of Computer Science, 2003.
J. R. Deller, J. G. Proakis and J. H. L. Hansen, “Discrete-time Processing of Speech Signals”, Prentice Hall, New Jersey, 1993.
Jain, A., and Zongker, D., “Feature selection: evaluation, application, and small sample performance”. IEEE Trans. on Pattern Analysis and Machine Intelligence 19(1997), 153–158.
K. Sayood, “Introduction to Data Compression”, Second Edition, Morgan Kaufmann Publishers, San Francisco, California, 2000.
Mike N., Wei W. (2004), “Speaker Recognition”, http://cslu.cse.ogi.edu/HLTsurvey/ch1node47.html
Picone, J. (1993), “Signal modeling techniques in Speech Recognition”, IEEE ASSP Magazine, Vol. 81, Issue 9, pp. 1215 – 1247.
Soong, F., A.E., A. R., Juang, B.-H., and Rabiner, L., “A vector quantization approach to speaker recognition”. AT & T Technical Journal 66 (1987), 14–26.
T. Kinnunen and P. Franti, “Speaker Discriminative Weighting Method for VQ-Based Speaker Identification”, Proc. 3rd International Conference on audio and video-based biometric person authentication (AVBPA), Halmstad, Sweden, 2001.
T. Kinnunen and P. Franti, “Spectral Features for Automatic Text-Independent Speaker Recognition” Licentiate’s Thesis. http://www.cs.joensuu.fi/pages/pums/public_results/2004_PhLic_Kinnunen_Tomi.pdf.
http://www.lsv.univsarland.de/dsp_ss05_chap9.pdf
http://en.wikipedia.org/wiki/Euclidean_distance
http://en.wikipedia.org/wiki/Mel_frequency_cepstral_coefficient
UNITED STATES




