CFP last date
20 May 2024
Reseach Article

A Comparative Analysis of Speaker Identification on English and Hindi Database

by Anjali Jain, O. P. Sharma
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 84 - Number 12
Year of Publication: 2013
Authors: Anjali Jain, O. P. Sharma
10.5120/14632-3000

Anjali Jain, O. P. Sharma . A Comparative Analysis of Speaker Identification on English and Hindi Database. International Journal of Computer Applications. 84, 12 ( December 2013), 53-56. DOI=10.5120/14632-3000

@article{ 10.5120/14632-3000,
author = { Anjali Jain, O. P. Sharma },
title = { A Comparative Analysis of Speaker Identification on English and Hindi Database },
journal = { International Journal of Computer Applications },
issue_date = { December 2013 },
volume = { 84 },
number = { 12 },
month = { December },
year = { 2013 },
issn = { 0975-8887 },
pages = { 53-56 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume84/number12/14632-3000/ },
doi = { 10.5120/14632-3000 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:00:46.819775+05:30
%A Anjali Jain
%A O. P. Sharma
%T A Comparative Analysis of Speaker Identification on English and Hindi Database
%J International Journal of Computer Applications
%@ 0975-8887
%V 84
%N 12
%P 53-56
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper a text-dependent speaker recognition method is presented by combining Mel frequency cepstrum coefficients (MFCC) and Euclidean distance. The robustness of this speaker identification method for different speaking language is analyzed in this paper. The speaker identification algorithm using English and Hindi Indian voice database (IVD) which contains sentences of data spoken is accomplished. An improvement in recognition rate is observed by using different windows and increasing the number of training voice samples. Accuracy upto 100% can be obtained for text-dependent speaker identification for different windows by using a short training and testing utterance about 4 seconds.

References
  1. P. Kartik, S. R. H. Prasanna and R. V. S. S. V. Prasad, "Multimodal Biometric Person Authentication System using Speech and Signature Feature", Technical Conference of IEEE Region 10 TENCON, Hyderabad, India, pp. 1-6, 2008.
  2. S Furui, "50 years of progress in speech and speaker recognition research", ECTI Transactions on Computer and Information Technology, Vol. 1, No. 2, November 2005.
  3. D. A. Reynolds, "An overview of automatic speaker recognition technology", Proceeding IEEE International Conference on Acoustics, Speech, Signal Process, pp. IV-4072-IV-4075, 2002.
  4. Joseph P. Campbell, "Speaker Recognition: A Tutorial", Proceedings of the IEEE, Vol. 85, No. 9, pp. 1437-1462, September 1997.
  5. H. Gish and M. Schmidt, "Text Independent Speaker Identification", IEEE Signal Processing Magazine, Vol. 11, No. 4, pp. 18-32, 1994.
  6. K. R. Farrell, R. J. Mammone and K. T, Assaleh, "Speaker Recognition Using Neural Network and Conventional Classifiers", IEEE Transaction speech and Audio processing, Vol. 2, No. 1, PART II, Jan. , 1994.
  7. A. E. Rosenberg, "Automatic speaker verification: A review," Proceedings IEEE, Vol. 64, No. 4, pp. 475–487, Apr. 1976.
  8. D. A. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models," Speech Commun. , Vol. 17, No. 1–2, pp. 91–108, 1995.
  9. S. Nakagawa, W. Zhang, and M. Takahashi, "Text-independent speaker recognition by combining speaker specific GMM with speaker adapted syllable-based HMM" in Proceeding ICASSP, Vol. 1, pp. 81–84, 2004.
  10. F. Bimbot et al. , "A tutorial on text-independent speaker verification", EURASIP Journal of Appl. Signal Processing, pp. 430–451, 2004.
  11. T. B. Alderman, "Forensic Speaker Identification, A Likelihood Ratio- Based Approach Using Vowel Formants", ser. Lincom Studies in Phonetics. Munich, Germany: LINCOM, 2005.
  12. Anjali jain, O. P. Sharma, "A Vector Quantization Approach for Voice Recognition Using Mel Frequency Cepstral Coefficient (MFCC): A Review", International Journal of Electronics & Communication Technology (IJECT) Vol. 4, Issue Spl - 4, pp. 26-29, April - June 2013.
Index Terms

Computer Science
Information Sciences

Keywords

Speaker Identification MFCC Euclidean distance classifier Feature extraction and database.