Comparison of VQ and GMM for Text Independent Speaker Identification System for The Bengali Language

Md Mahadi Hasan Nahid; Md Ashraful Islam; Md Saiful Islam

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 20 July 2026

Submit your paper

Know more

The week's pick

Quantifying Label-Induced Bias in Large Language Model Self and Cross Evaluations

Muskan Saraf Sajjad Rezvani Boroujeni Justin Beaudry Hossein Abedi Tom Bush

Random Articles

Video Steganography using Zero Order Hold Method for Secured Data Transmission

Oct

2017

Image Compression using Orthogonal Wavelets Viewed from Peak Signal to Noise Ratio and Computation Time

June

2012

Building a Web-based IDE from Web 2.0 perspective

June

2014

Performing Transactions Simultaneously in Multiple Heterogeneous Database Instances using Vocal Commands with One Time Password Authenticator as an Extended Security Feature

January

2011

Reseach Article

Comparison of VQ and GMM for Text Independent Speaker Identification System for The Bengali Language

by Md Mahadi Hasan Nahid, Md Ashraful Islam, Md Saiful Islam

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 178 - Number 47

Year of Publication: 2019

Authors: Md Mahadi Hasan Nahid, Md Ashraful Islam, Md Saiful Islam

10.5120/ijca2019919354

Md Mahadi Hasan Nahid, Md Ashraful Islam, Md Saiful Islam . Comparison of VQ and GMM for Text Independent Speaker Identification System for The Bengali Language. International Journal of Computer Applications. 178, 47 ( Sep 2019), 18-21. DOI=10.5120/ijca2019919354

@article{ 10.5120/ijca2019919354,

author = { Md Mahadi Hasan Nahid, Md Ashraful Islam, Md Saiful Islam },

title = { Comparison of VQ and GMM for Text Independent Speaker Identification System for The Bengali Language },

journal = { International Journal of Computer Applications },

issue_date = { Sep 2019 },

volume = { 178 },

number = { 47 },

month = { Sep },

year = { 2019 },

issn = { 0975-8887 },

pages = { 18-21 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume178/number47/30866-2019919354/ },

doi = { 10.5120/ijca2019919354 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:53:21.594527+05:30

%A Md Mahadi Hasan Nahid

%A Md Ashraful Islam

%A Md Saiful Islam

%T Comparison of VQ and GMM for Text Independent Speaker Identification System for The Bengali Language

%J International Journal of Computer Applications

%@ 0975-8887

%V 178

%N 47

%P 18-21

%D 2019

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Speaker identification (SI) is the system to identify the person by the signal pattern of their voices. In recent years, many speaker identification models are proposed, but till now speaker identification technology do not reach their full potential. This paper presents a comprehensive comparative study of VQ and GMM to identify the speaker who speaks in Bengali accent. We consider the problem of text-independent speaker identification. We compare the performance/accuracy of VQ and GMM based Speaker Identification System (SIS). We use Mel Frequency Cepstral Coefficients (MFCC) and Liner Predictive Coding Coefficients (LPCC) for feature extraction.

References

Ling Feng, "Speaker Recognition", IMM-THESIS: ISSN 1601-233X, Kgs. Lyngby 2004
G. Saha, Sandipan Chakroborty, Suman Senapati, “A New Silence Removal and Endpoint Detection Algorithm for Speech and Speaker Recognition Applications” Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Khragpur, Kharagpur-721 302, India
Yuan Yujin, Zhao Peihua, Zhou Qun,, “Research of speaker recognition based on combination of LPCC and MFCC”, Intelligent Computing and Intelligent Systems (ICIS), IEEE International Conference , vol.3, 29-31 Oct. 2010, pp.765-767. Reynolds, A.D., and Rose, C.R.: “Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models”. IEEE Transactions on Speech and Audio Processing, 3(1): 72-83, 1995.
Ningping Fan, Justinian Rosca, "Enhanced VQ-based Algorithms for Speech Independent Speaker Identification", Siemens Corporate Research Inc., 755 College Road East, Princeton, New Jersey 08540
Douglas Reynolds, "Gaussian Mixture Models" MIT Lincoln Laboratory, 244 Wood St., Lexington, MA 02140, USA.
M.Campbell, D. E. Sturim, D. A. Reynolds: “Support Vector Machines using GMM Super vectors for Speaker Verification”, MIT Lincoln Laboratory.
Tomi Kinnunen, Teemu Kilpeläinen And Pasi Fränti "Comparison Of Clustering Algorithms In Speaker Identification", Department Of Computer Science, University Of Joensuu, P.O.Box 111, 80101 Joensuu, Finland.
Lindasalwa Muda, Mumtaj Begam and I. Elamvazuthi, "Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques", JOURNAL OF COMPUTING, VOLUME 2, ISSUE 3, MARCH 2010, ISSN 2151-9617
Evgeny Karpov, "Real-Time Speaker Identification”, Master’s Thesis, Department of Computer Science, University of Joensuu, Finland, 2003
Lindasalwa Muda, Mumtaj Begam and I. Elamvazuthi, "Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques", JOURNAL OF COMPUTING, VOLUME 2, ISSUE 3, MARCH 2010, ISSN 2151-9617
Kim, Taesun, and Chulhun Seo. "A novel photonic bandgap structure for low-pass filter of wide stopband." IEEE Microwave and Guided Wave Letters 10.1 (2000): 13-15.
Han, W., Chan, C. F., Choy, C. S., & Pun, K. P. (2006, May). An efficient MFCC extraction method in speech recognition. In 2006 IEEE international symposium on circuits and systems (pp. 4-pp). IEEE.
MacLean, K. Voxforge. Ken MacLean. [Online]. Available: http://www. voxforge. org/home. [Acedido em 2016].
Nahid, Md Mahadi Hasan, et al. "Comprehending Real Numbers: Development of Bengali Real Number Speech Corpus." arXiv preprint arXiv:1803.10136 (2018).

Index Terms

Computer Science

Information Sciences

Keywords

Bengali Speaker Identification SI Voice Recognition MFCC LPCC VQ GMM.