Call for Paper - May 2023 Edition
IJCA solicits original research papers for the May 2023 Edition. Last date of manuscript submission is April 20, 2023. Read More

Text Dependent Speaker Recognition using MFCC features and BPANN

International Journal of Computer Applications
© 2013 by IJCA Journal
Volume 74 - Number 5
Year of Publication: 2013
Praveen N
Tessamma Thomas

Praveen N and Tessamma Thomas. Article: Text Dependent Speaker Recognition using MFCC features and BPANN. International Journal of Computer Applications 74(5):31-39, July 2013. Full text available. BibTeX

	author = {Praveen N and Tessamma Thomas},
	title = {Article: Text Dependent Speaker Recognition using MFCC features and BPANN},
	journal = {International Journal of Computer Applications},
	year = {2013},
	volume = {74},
	number = {5},
	pages = {31-39},
	month = {July},
	note = {Full text available}


Mel-Frequency Cepstral Coefficients are spectral feature which are widely used for speaker recognition and text dependent speaker recognition systems are the most accurate in voice based authentication systems. In this paper, a text dependent speaker recognition method is developed. MFCCs are computed for a selected sentence. The first 13 MFCCs are considered for each frames of duration 26ms and each coefficient is clustered to a 5 element cluster centres and finally to a form a 65 element speech code vector for the entire speech. The speech code is trained using a multi-layer perceptron backpropagation gradient descent network and the network is tested for various test patterns. The performance is measured using FAR, FRR and EER parameters. The recognition rate achieved is 96. 18% for a cluster size of 5 in each coefficient.


  • Thomas F. Quatieri, Discrete Time Signal Processing Principles and Practice, Pearson Education Inc. India.
  • J. P. Campbell, Speaker recognition: A tutorial, Proc. IEEE, vol. 85, pp. 1437–1462, 1997.
  • Tomi H. Kinnunen, Optimizing Spectral Feature Based Text-Independent Speaker Recognition, Academic Dissertation, University of Joensuu, 2005.
  • P. Melmerstein and S. B. Davis, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Trans Acoustic, Speech, and Signal Processing, vol 28, no. 4, pp. 357-366,1980.
  • Hassoun, M. H, Fundamentals of Artificial Neural Networks. MIT Press, Cambridge, MA.
  • Y. Linde, A. Buzo, and R. M. Gray, An algorithm for vector quantizer design, IEEE Trans. on Communications, vol. COM_28 (1), pp. 84-96, Jan. 1980.
  • I. A. Basheer, M. Hajmeer, Artificial neural networks: fundamentals, computing, design and application, Journal of microbiological methods 43 (2000) 3–31.
  • Masters, T. , 1994. Practical Neural Network Recipes in C11. Academic Press, Boston, MA.
  • Haykin S, Neural Networks: A Comprehensive Foundation, Macmillan, New York, 2004