CFP last date
20 May 2024
Reseach Article

Text Dependent Speaker Identification using Hidden Markchov Model and Mel Frequency Cepstrum Coefficient

by Mohd. Manjur Alam, Md. Salah Uddin Chowdury, Niaz Uddin Mahmud
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 104 - Number 14
Year of Publication: 2014
Authors: Mohd. Manjur Alam, Md. Salah Uddin Chowdury, Niaz Uddin Mahmud
10.5120/18272-9360

Mohd. Manjur Alam, Md. Salah Uddin Chowdury, Niaz Uddin Mahmud . Text Dependent Speaker Identification using Hidden Markchov Model and Mel Frequency Cepstrum Coefficient. International Journal of Computer Applications. 104, 14 ( October 2014), 33-37. DOI=10.5120/18272-9360

@article{ 10.5120/18272-9360,
author = { Mohd. Manjur Alam, Md. Salah Uddin Chowdury, Niaz Uddin Mahmud },
title = { Text Dependent Speaker Identification using Hidden Markchov Model and Mel Frequency Cepstrum Coefficient },
journal = { International Journal of Computer Applications },
issue_date = { October 2014 },
volume = { 104 },
number = { 14 },
month = { October },
year = { 2014 },
issn = { 0975-8887 },
pages = { 33-37 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume104/number14/18272-9360/ },
doi = { 10.5120/18272-9360 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:36:10.607146+05:30
%A Mohd. Manjur Alam
%A Md. Salah Uddin Chowdury
%A Niaz Uddin Mahmud
%T Text Dependent Speaker Identification using Hidden Markchov Model and Mel Frequency Cepstrum Coefficient
%J International Journal of Computer Applications
%@ 0975-8887
%V 104
%N 14
%P 33-37
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Speaker identification is a biometric process. The objective of speaker identification is to extract, characterize and recognize the information about speaker identity. Speaker Recognition technology has recently been used in a vast number of commercial areas successfully such as in voice based biometrics; voice controlled appliances, security control for confidential information, remote access to computers and many more interesting areas. A speaker identification system has two phases which are the training phase and the testing phase. Feature extraction is the first step for each phase in speaker recognition. Many algorithms are used for feature extraction. In this work, the Mel Frequency Cepstrum Coefficient (MFCC) feature has been used for designing a text dependent speaker identification system. In the identification phase, the existing reference templates are compared with the unknown voice input. In this thesis, Hidden Markov Model (HMM) method is used as the training/recognition algorithm which makes the final decision about the specification of the speaker by comparing unknown features to all models in the database and selecting the best matching model. i, e. the highest scored model. The speaker who obtains the highest score is selected as the target speaker.

References
  1. Speaker Verification using Vector Quantization and Hidden Markov Model. Mohd Zaizu Ilyas, Member, IEEE, Salina Abdul Samad, Senior Member, IEEE, Aini Hussain, Member , IEEE and Khairul Anuar Ishak, Member, IEEE, "The 5th Student Conference on Research and Development –SCOReD 2007 11-12 December 2007, Malaysia.
  2. MFCC and its applications in speaker recognition, Vibha Tiwari. Deptt. of Electronics Engg. , Gyan Ganga Institute of Technology and Management, Bhopal, (MP) INDIA (Received 5 Nov. , 2009, Accepted 10 Feb. , 2010), "International Journal on Emerging Technologies 1(1): 19-22(2010) ISSN : 0975-8364.
  3. Text-Independent Speaker Identification Using Hidden Markov Model Sayed Jaafer Abdallah, Izzeldin Mohamed Osman, Mohamed Elhafiz Mustafa, College of Computer Science and Information Technology Sudan University of Science and Technology, "World of Computer Science and Information Technology Journal (WCSIT) ISSN: 2221-0741 Vol. 2, No. 6, 203-208, 2012 Khartoum, Sudan.
  4. SPEAKER IDENTIFICATION USING MEL FREQUENCY CEPSTRAL COEFFICIENTS, Md. Rashidul Hasan, Mustafa Jamil, Md. Golam Rabbani Md. Saifur Rahman, Electrical and Electronic Engineering, Bangladesh University of Engineering and Technology, "Dhaka-1000, 3rd International Conference on Electrical & Computer Engineering ICECE 2004, 28-30 December 2004, Dhaka, Bangladesh.
  5. Speaker Recognition Using MFCC Front End Analysis and VQ Modeling Technique for Hindi Words using MATLAB, Nitisha M. Tech. (pursuing) Hindu College of engineering Sonipat, Haryana India, Ahu Bansal Assistant professor Hindu College of engineering Sonipat, Haryana, India. "International Journal of Computer Applications (0975 – 8887) Volume 45– No. 24, May 2012.
  6. HMM Speaker Identification Using Linear and Non-linear Merging Techniques Unathi Mahola, Fulufhelo V. Nelwamondo, Tshilidzi Marwala School of Electrical and information Engineering University of the Witwatersrand, Johannesburg, South Africa.
  7. J. M. Naik, "Speaker Verification: A Tutorial", IEEE Communication Magazine, January 1990, pp. 42-48.
  8. J. P. Campbell, "Speaker Recognition: A tutorial ", Proc. of the IEEE, Vol. 85, No. 9, September 1997, pp. 1437 – 1462.
  9. L. R. Rabiner, "A Tutorial on Hidden Markov Models and Selected Application in Speech Recognition", Proceeding of The IEEE, Vol. 77, No. 2, February 1989.
  10. L. R. Rabiner and B. H. Juang, Fundamental of Speech Recognition, Prentice Hall, New Jersey, 1993.
  11. National Science and Technology Council (NTSC), "Speaker Recognition", 2006. [Online]. Available: http://www. biometricscatalog. org/NSTCSubcommittee/Documents/Speaker%20Recognition. pdf [7 August 2006].
Index Terms

Computer Science
Information Sciences

Keywords

Mel Frequency Cepstrum Coefficient (MFCC) Hidden Markchov Model (HMM) Speaker Identification (SI) Fast Fourier Transform (FFT).