CFP last date
20 May 2024
Reseach Article

Automatic Speaker Recognition using Circular Sectorization of the DFT Complex Plane

Published on None 2011 by H. B. Kekre, Vaishali Kulkarni
journal_cover_thumbnail
International Conference and Workshop on Emerging Trends in Technology
Foundation of Computer Science USA
ICWET - Number 5
None 2011
Authors: H. B. Kekre, Vaishali Kulkarni
61428acd-2497-4db6-baae-130af5ad79ab

H. B. Kekre, Vaishali Kulkarni . Automatic Speaker Recognition using Circular Sectorization of the DFT Complex Plane. International Conference and Workshop on Emerging Trends in Technology. ICWET, 5 (None 2011), 35-41.

@article{
author = { H. B. Kekre, Vaishali Kulkarni },
title = { Automatic Speaker Recognition using Circular Sectorization of the DFT Complex Plane },
journal = { International Conference and Workshop on Emerging Trends in Technology },
issue_date = { None 2011 },
volume = { ICWET },
number = { 5 },
month = { None },
year = { 2011 },
issn = 0975-8887,
pages = { 35-41 },
numpages = 7,
url = { /proceedings/icwet/number5/2098-bm112/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference and Workshop on Emerging Trends in Technology
%A H. B. Kekre
%A Vaishali Kulkarni
%T Automatic Speaker Recognition using Circular Sectorization of the DFT Complex Plane
%J International Conference and Workshop on Emerging Trends in Technology
%@ 0975-8887
%V ICWET
%N 5
%P 35-41
%D 2011
%I International Journal of Computer Applications
Abstract

In this paper, we propose automatic speaker recognition using circular DFT (Discrete Fourier Transform) sectors. In the first method, the DFT of the speech signal is taken and feature vectors are extracted by dividing the complex DFT spectrum into circular sectors and then taking the weighted density count of the number of points in each of these sectors. In the second method, the speech signal is first divided into frames and these frames are arranged as columns of a matrix. DFT is applied to this matrix and then the complex DFT plane is similarly divided into circular sectors and feature vectors are extracted as in the first method. The results show that this approach gives fairly good speaker recognition (70 % -80%) for the Ist method. Also the results improve as the circular sectors are further divided into quadrants. For the IInd method the accuracy does not improve as the circular sectors are further divided. The best accuracy is obtained with 7 circular sectors.

References
  1. Lawrence Rabiner, Biing-Hwang Juang and B.Yegnanarayana, “Fundamental of Speech Recognition”, Prentice-Hall, Englewood Cliffs, 2009.
  2. S Furui, “50 years of progress in speech and speaker recognition research”, ECTI Transactions on Computer and Information Technology, Vol. 1, No.2, November 2005.
  3. D. A. Reynolds, “An overview of automatic speaker recognition technology”, Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP’02), 2002, pp. IV-4072–IV-4075.
  4. Joseph P. Campbell, Jr., Senior Member, IEEE, “Speaker Recognition: A Tutorial”, Proceedings of the IEEE, vol. 85, no. 9, pp. 1437-1462, September 1997.
  5. F. Bimbot, J.-F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-García, D.Petrovska-Delacrétaz, and D. A. Reynolds, “A tutorial on text-independent speaker verification,” EURASIP J. Appl. Signal Process., vol. 2004, no. 1, pp. 430–451, 2004.
  6. D. A. Reynolds, “Experimental evaluation of features for robust speaker identification,” IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639–643, Oct. 1994.
  7. Tomi Kinnunen, Evgeny Karpov, and Pasi Fr¨anti, “Realtime Speaker Identification”, ICSLP2004.
  8. Marco Grimaldi and Fred Cummins, “Speaker Identification using Instantaneous Frequencies”, IEEE Transactions on Audio, Speech, and Language Processing, vol., 16, no. 6, August 2008.
  9. Zhong-Xuan, Yuan & Bo-Ling, Xu & Chong-Zhi, Yu. (1999). “Binary Quantization of Feature Vectors for Robust Text-Independent Speaker Identification” in IEEE Transactions on Speech and Audio Processing, Vol. 7, No. 1, January 1999. IEEE, New York, NY, U.S.A.
  10. Dr. H B Kekre, Vaishali Kulkarni,”Speaker Identification using Power Distribution in Frequency Spectrum”, Technopath, Journal of Science, Engineering & Technology Management, Vol. 02, No.1, January 2010.
  11. Dr. H B Kekre, Vaishali Kulkarni, “Speaker Identification by using Power Distribution in Frequency Spectrum”, ThinkQuest - 2010 International Conference on Contours of Computing Technology”, BGIT, Mumbai,13th -14th March 2010.
  12. H B Kekre, Vaishali Kulkarni, “Speaker Identification by using Vector Quantization”, International Journal of Engineering Science and Technology, May 2010.
  13. H B Kekre, Vaishali Kulkarni, “Performance Comparison of Speaker Recognition using Vector Quantization by LBG and KFCG”, International Journal of Computer Applications, vol. 3, July 2010.
  14. H B Kekre, Vaishali Kulkarni, “Performance Comparison of Automatic Speaker Recognition using Vector Quantization by LBG KFCG and KMCG”, International Journal of Computer Science and Security, Vol: 4 Issue: 4, 2010.
  15. H B Kekre, Vaishali Kulkarni, “Comparative Analysis of Automatic Speaker Recognition using Kekre’s Fast Codebook Generation Algorithm in Time Domain and Transform Domain”, International Journal of Computer Applications, Volume 7 No.1. September 2010.
  16. H B Kekre, Dhirendra Mishra, “Performance Comparison of Density Distribution and Sector mean of sal and cal functions in Walsh Transform Sectors as Feature Vectors for Image Retrieval”, International Journal of Image Processing ,Volume :4, Issue:3, 2010.
  17. H B Kekre, Dhirendra Mishra, “CBIR using Upper Six FFT Sectors of Color Images for Feature Vector Generationl”, International Journal of Engineering and Technology,Volume :2(2) ”, 2010.
  18. H B Kekre, Dhirendra Mishra, “Performance Comparison of Four, Eight & Twelve Walsh Transform Sectors Feature Vectors for Image Retrieval from Image Databases”, International Journal of Engineering Science and Technology”, Volume :2(5) , 2010.
  19. H B Kekre, Dhirendra Mishra, “Four Walsh Transform Sectors Feature Vectors for Image Retrieval from Image Databases ”, International Journal of Computer Science and Information Technologies”, Volume :1(2) , 2010.
  20. H B Kekre, Dhirendra Mishra, “Digital Image Search & Retrieval using FFT Sectors of Color Images”, International Journal of Computer Science and Engineering”, Volume :2 , No.2, 2010.
Index Terms

Computer Science
Information Sciences

Keywords

Discrete Fourier Transform (DFT) circular sectors Euclidean distance