CFP last date
22 April 2024
Call for Paper
May Edition
IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 22 April 2024

Submit your paper
Know more
Reseach Article

Multi Lingual Speaker Identification on Foreign Languages using Artificial Neural Network

by Prateek Agrawal, Harjeet Kaur, Gurpreet Kaur
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 57 - Number 13
Year of Publication: 2012
Authors: Prateek Agrawal, Harjeet Kaur, Gurpreet Kaur
10.5120/9177-3584

Prateek Agrawal, Harjeet Kaur, Gurpreet Kaur . Multi Lingual Speaker Identification on Foreign Languages using Artificial Neural Network. International Journal of Computer Applications. 57, 13 ( November 2012), 36-42. DOI=10.5120/9177-3584

@article{ 10.5120/9177-3584,
author = { Prateek Agrawal, Harjeet Kaur, Gurpreet Kaur },
title = { Multi Lingual Speaker Identification on Foreign Languages using Artificial Neural Network },
journal = { International Journal of Computer Applications },
issue_date = { November 2012 },
volume = { 57 },
number = { 13 },
month = { November },
year = { 2012 },
issn = { 0975-8887 },
pages = { 36-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume57/number13/9177-3584/ },
doi = { 10.5120/9177-3584 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:01:21.323837+05:30
%A Prateek Agrawal
%A Harjeet Kaur
%A Gurpreet Kaur
%T Multi Lingual Speaker Identification on Foreign Languages using Artificial Neural Network
%J International Journal of Computer Applications
%@ 0975-8887
%V 57
%N 13
%P 36-42
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Based on the Back Propagation Algorithm, this paper portrait a method for speaker identification in multiple foreign languages. In order to identify speaker, the complete process goes through recording of the speech utterances of different speakers in multiple foreign languages, features extraction, data clustering and system training. In order to realize the purpose, a database has been prepared which contains one sentence in 8 different international languages i. e. Catalan, French, Finnish, Italian, Portuguese, Indonesian, Hindi, English spoken by 19 distinct speakers, both male and female, in each language. With total size of 760 speech utterances, the average performance of the system is 95. 657%. Application of developed system is mainly used in speaker authentication in telephony security oriented applications where the normal conversations are of short durations and the tendency of the spokesperson is to switch language from one to another

References
  1. Love,C. and Kinsne,W. , A Speech Recognition System Using A Neural Network Model For Vocal Shaping, Department of Electrical and Computer Engineering, University of Manitoba Winnipeg, Manitoba, Canada R3T- 2N2, March 1991, 198 pp.
  2. Chougule,S. and Rege,P. , Language Independent Speaker Identification, IEEE 3rd international conference, ieeexplore pp 364-368, May 2006.
  3. Mak,M. W. ; Allen,W. G. and Sexton,G. G. , Speaker Identification Using Radial Basis Functions University of Northumbria at Newcastle, U. K. , IEEE 3rd International Conference, ieeexplore pp. 138-142, 1993
  4. Bum,D. , Experiments On Neural Net Recognition Of spoken And Written, Text IEEE Transaction on Acoust. . Speech and Signal Proc. , vol. ASSP-36. no. 7. pp. 1162-1168, 1988.
  5. Deiri,Z. and Botros ,N. , LPC-Based Neural Network For Automatic Speech Recognition, Proc. IEEE Engineering in Medicine and Biology Soc. , IEEE Cat. no. 90 CH2936-3. pp. 1429-1430, 1990
  6. Philip P. Wassermann,, Neural Computing: Theory and Practice , VNR, New York, 1989
  7. Richard P. Lipmann, Review of Neural Networks for Speech Recognition, Neural Computation 1, pp. 1-38, Massachusets Institute of Technology, 1989.
  8. Zebdum, RS; Guedes, K; Vellasco, M. ; Pacheco, M. A,. Short Term Load Forecasting Using Neural Nets , Proceedings of the International Workshop on Artificial Neural Networks, LNCS No. 930, Springer Verlag, Torremohos, Spain, JuneJ995
  9. Zurada ,J. M. , Introduction to W c i a l Neural Systems , West Publishing Company, 1992.
  10. K. Rahul, S. Anupam, T Ritu, Fuzzy Neuro Systems for Machine Learning for Large Data Sets , Proceedings of the IEEE International Advance Computing Conference, ieeexplore, pp 223-227, March 2009, Patiala, India
  11. K. Santhosh, C. Mohandas ,V. P. and H. Li, Multilingual Speech Recognition: A Unified Approach , Proceedings of Interspeech 2005, (Eurospeech - 9th European Conference on Speech Communication and Technology), Lisboa, Sept. 2005.
  12. S. C. Kumar; L. Haizhou, Language identification System for Multilingual Speech Recognition Systems , Proceedings of the 9th International Conference Speech and Computer (SPECOM 2004), St. Petersburg, Russia, Sept. 2004.
  13. S. Hanwu, M. Bin, L. Haizhou, An Efficient Feature Selection Method for Speaker Recognition , International Symposium on Chinese Spoken Language Processing, December 16-19, 2008, China.
  14. M. Bin, G. Cuntai, L. Haizhou, L. Chin-Hu, Multilingual Speech Recognition with Language Identification , International Conference on Spoken Language Processing (ICSLP), DENVER-COLORADO, Sept. 16-20, 2002.
  15. N. Mohini, P. Manoj, P. Vijay, M. K. Prabhat, Vocal digit recognition using Artificial Neural Networks ,Dept. of Computer Engineering, VPCOE, Baramati in (2010 2nd International Conference on Computer Engineering and Technology) Page(s): V6-88 - V6-91
  16. A. Prateek, S. Anupam and T. Ritu, Multi Lingual Speaker Recognition Using Artificial Neural Networks , Advances in Computational Intelligence, Springer VerLlog,, 2009 , pages 1-9.
  17. http://www. sp4comm. org/webversion. html
  18. http://www. mathworks. in/help/toolbox/dsp/ref/lpctofromcepstralcoefficients. html
Index Terms

Computer Science
Information Sciences

Keywords

Artificial Neural Network (ANN) Back Propagation Algorithm (BPA) Cepstral Analysis Multilingual Speaker Recognition Power Spectral Density (PSD)