CFP last date
20 May 2024
Reseach Article

A Comparative Study of Phoneme Recognition using GMM-HMM and ANN based Acoustic Modeling

by Farheen Fauziya, Geeta Nijhawan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 98 - Number 6
Year of Publication: 2014
Authors: Farheen Fauziya, Geeta Nijhawan
10.5120/17186-7366

Farheen Fauziya, Geeta Nijhawan . A Comparative Study of Phoneme Recognition using GMM-HMM and ANN based Acoustic Modeling. International Journal of Computer Applications. 98, 6 ( July 2014), 12-16. DOI=10.5120/17186-7366

@article{ 10.5120/17186-7366,
author = { Farheen Fauziya, Geeta Nijhawan },
title = { A Comparative Study of Phoneme Recognition using GMM-HMM and ANN based Acoustic Modeling },
journal = { International Journal of Computer Applications },
issue_date = { July 2014 },
volume = { 98 },
number = { 6 },
month = { July },
year = { 2014 },
issn = { 0975-8887 },
pages = { 12-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume98/number6/17186-7366/ },
doi = { 10.5120/17186-7366 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:25:29.756890+05:30
%A Farheen Fauziya
%A Geeta Nijhawan
%T A Comparative Study of Phoneme Recognition using GMM-HMM and ANN based Acoustic Modeling
%J International Journal of Computer Applications
%@ 0975-8887
%V 98
%N 6
%P 12-16
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Phoneme is the smallest analogous unit of sound employed to form meaningful contrast between utterances. Hidden Markov Model (HMM), Gaussian Mixture model (GMM) and Artificial Neural Network (ANN) have been used in this paper to measure the accuracy and performance of recognition system using toolkits HTK, Sphinx3 and Quicknet, which are freely available for academic works. In this paper the performance of an ASR System based on Accuracy has been compared with TIMIT database.

References
  1. S. B. Davis and P Mermelstein. "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions Acoustics Speech and Signal Processing", 28:357–366, 1980.
  2. Deller J. R. , Hansen J. H. L. & Proakis J. G. : Discrete-Time Processing of Speech Signals. IEEE Press, 2000
  3. H. Bourlard and C. J. Wellekens, "Links between Markov models and multilayer Perceptron" IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 12, no. 12, pp. 1167– 1178,1990. Forman, G. 2003.
  4. L. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of IEEE, 1989
  5. Gold, B. , Morgan, N. : Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John Wiley, New York (2000)
  6. Steven,S. Volkmann,J. and Newmann,E. , "A scale for Measurement of Psychological Magnitude Pitch. " Journal of the Acoustical Society of America 8:185-190,1935
  7. Utpal Bhattacharjee, "A comparative study of LPCC and MFCC feature for the recognition of Assamese phonemes"IJERT,ISSN:2278-0181,Vol. 2 issue 1,January-2013
  8. Anant G. Veeravalli,W. D. pan,Reza Adhami,paul G Cox,"Phoneme recognition using Hidden Markov Model,Huntsville Simulation conference,October 2004
  9. International computer science Institute, Speech Group, http://www. icsi. berkeley. edu/groups/speech,
  10. An introduction to Hidden Makov Model,"L. R. Rabiner & B. H. Juang"IEEE ASSP MAGAZINE JANUARY 1986
  11. S. Young, G. Evermann, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, P. Woodland. The HTK Book. Revised for HTK Version 3. 2 Dec. 2002. http://htk. eng. cam. ac. uk/
  12. Phil Blunsom, "Hidden Markov Models", August 19, 2004
  13. P Zolfaghari and A. J. Robinson. Formant analysis using mixtures of Gaussians. In ProceeSLP, pages 904-907,1996
  14. Anjan Basu and Torbjgrn Svendsen ,"A Time-Frequency Segmental Neural Networks for Phoneme Recognition "Acoustic Speech and Signal processing, IEEE 1993.
  15. Zaihu PANG et al. Discriminative training of GMM-HMM acoustic model by RPCL learning, Front. Electr. Electron. Eng. China 2011, 6(2): 283–290
Index Terms

Computer Science
Information Sciences

Keywords

Automatic Speech Recognition MFCC Hidden Markov Model