Glottal Excitation Feature based Gender Identification System using Ergodic HMM

R. Rajeshwara Rao; A. Prasad

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 20 July 2026

Submit your paper

Know more

The week's pick

RackOps: Software Architecture and Automation Patterns for Large-Scale Server Rack Validation

Gopimahesh Vatram

Random Articles

Big Data Analysis with Dataset Scaling in Yet Another Resource Negotiator (YARN)

April

2014

Fuzzy based Probability Factor Calculation for Number of Cluster Estimation to K-Mean by using Apriori

March

2015

Comparison of various Security Protocols in RFID

June

2011

Code and Performance-based Metrics for Multithreaded Object-Oriented Software

Jan

2025

Reseach Article

Glottal Excitation Feature based Gender Identification System using Ergodic HMM

by R. Rajeshwara Rao, A. Prasad

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 17 - Number 3

Year of Publication: 2011

Authors: R. Rajeshwara Rao, A. Prasad

10.5120/2200-2794

R. Rajeshwara Rao, A. Prasad . Glottal Excitation Feature based Gender Identification System using Ergodic HMM. International Journal of Computer Applications. 17, 3 ( March 2011), 31-36. DOI=10.5120/2200-2794

@article{ 10.5120/2200-2794,

author = { R. Rajeshwara Rao, A. Prasad },

title = { Glottal Excitation Feature based Gender Identification System using Ergodic HMM },

journal = { International Journal of Computer Applications },

issue_date = { March 2011 },

volume = { 17 },

number = { 3 },

month = { March },

year = { 2011 },

issn = { 0975-8887 },

pages = { 31-36 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume17/number3/2200-2794/ },

doi = { 10.5120/2200-2794 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:04:40.769101+05:30

%A R. Rajeshwara Rao

%A A. Prasad

%T Glottal Excitation Feature based Gender Identification System using Ergodic HMM

%J International Journal of Computer Applications

%@ 0975-8887

%V 17

%N 3

%P 31-36

%D 2011

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In this paper, through different experimental studies it is demonstrated that the time varying glottal excitation component of speech can be exploited for text independent gender recognition studies. Linear prediction (LP) residual is used as a representation of excitation information in speech. The gender-specific information in the excitation of voiced speech is captured using the Hidden Markov Models (HMMs). The decrease in the error during training and recognizing genders during testing phase close to 100 % accuracy demonstrates that the excitation component of speech contains gender-specific information and is indeed being effectively captured by continuous Ergodic HMM. A gender recognition study using gender specific features for different HMM states, mixture components, size of testing data on the performance of the gender recognition is evaluated. We demonstrate the gender recognition studies on TIMIT database.

References

Alex Acero and Xuedong Huang, Speaker and Gender Normalization for Continuous-Density Hidden Markov Models, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal , IEEE, May 1996
C. Neti and Salim Roukos. Phone-specific gender-dependent models for continuous speech recognition, Automatic Speech Recognition and Understanding Workshop (ASRU97), Santa Barbara, CA, 1997.
R. Vergin, A. Farhat and D.O’Shaughnessy, “Robust gender-dependent acoustic-phonetic modeling in continuous speech recognition based on a new automatic male/female classification”, Proc. Of IEEE Int. Conf. on Spoken Language (ICSLP), pp. 1081, Oct. 1996.
S. Slomka and S. Sridharan, “Automatic gender identification optimized for language independence”, Proc. Of IEEE TENCON’97, pp. 145-148,Dec. 1997.
O’Shaughnessy, D., 1987. Speech Communication: Human and Machine. Addison-Wesley, New York.
Rabiner, L.R., Juang, B.H., 1993. Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliﬀs, NJ.
Makhoul, J., 1975. Linear prediction: a tutorial review. Proc. IEEE 63, 561–580.
B.S. Atal, “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification” J. Acoust. Soc. Ameri., vol. 55, pp.1304-1312, Jun. 1974.K. Elissa, “Title of paper if known,” unpublished
A.E. Rosenberg and M. Sambur, “New techniques for automatic speaker verification.”, vol. 23, no.2, pp.169-175, 1975.
M. R. Sambur, “Speaker recognition using orthogonal linear prediction,” IEEE Trans. Acoust. Speech, Signal Processing, vol. 24, pp.283-289, Aug. 1976
J. Naik and G. R. Doddington, “ high performance speaker verification using principal spectral components”, in proc. IEEE Int. Conf. Acoust. Speech, Singal Processing, pp. 881-884, 1986.
Furui, S., 1997. Recent advances in speaker recognition. Pattern Recognition Lett. 18, 859–872.
S.R.Mahadeva Prassana, Cheedella S. Gupta, B. Yegnanarayana. Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Communications Vol.48 (2006) pp.1243-1261.
Dempster, A., Laird, N., and Rubin, D., “Maximum likelihood from incomplete data via the EM algorithm,” Journal of the Royal Statistical Society, vol. 39, pp. 1-38, 1977.
Molau, S., Pitz, M., Schluter, R., and Ney, H., “Computing Mel-frequency cepstral coefficients on the power spectrum,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 73-76, May. 2001.
Picone, J. W., “Signal modeling techniques in speech recognition,” Proceedings of IEEE, vol. 81, no. 9, pp. 1215-1247, Sep. 1993.
M. Forsyth and M. Jack, ―Discriminating semi-continuous HMM for
speaker verification,‖ in proc. IEEE Int. Conf. Acoust. Speech, Signal
Processing, vol.1, pp. 313-316, 1994.
M. Forsyth, ―Discriminating observation probability (DOP) HMM for
speaker verification,‖ Speech Communicaiton, vol. 17, pp.117-129,
1995.
A. P. Dempster, N. M. Laird, and D. B. Rubin, “Maximum likelihood from incomplete data via the EM algorithm”, J. Royal Statist. Soc. Ser. B. (methodological), vol. 39, pp. 1-38, 1977
K.N. Stevens, Acoustic Phonetics. Cambridge, England: The MIT Press, 1999

Index Terms

Computer Science

Information Sciences

Keywords

Gender Hidden Markov Model (HMM) LPC MFCC