Auditory Scale Analysis and Evaluation of Phonemes in MISING Language

Print
International Journal of Computer Applications
© 2015 by IJCA Journal
Volume 113 - Number 15
Year of Publication: 2015
Authors:
Rizwan Rehman
Gopal Chandra Hazarika
D. Kardong
10.5120/19899-2001

Rizwan Rehman, Gopal Chandra Hazarika and D Kardong. Article: Auditory Scale Analysis and Evaluation of Phonemes in MISING Language. International Journal of Computer Applications 113(15):1-5, March 2015. Full text available. BibTeX

@article{key:article,
	author = {Rizwan Rehman and Gopal Chandra Hazarika and D. Kardong},
	title = {Article: Auditory Scale Analysis and Evaluation of Phonemes in MISING Language},
	journal = {International Journal of Computer Applications},
	year = {2015},
	volume = {113},
	number = {15},
	pages = {1-5},
	month = {March},
	note = {Full text available}
}

Abstract

Frequency analyzer is one of the important functions of peripheral auditory system. Psycho-acoustically this gives rise to the concept of critical band, which represents the frequency resolution of the auditory system. Mel-Scale warping is one of the common techniques used for the analysis in speech recognition. Bark and ERB (Equivalent Rectangular Bandwidth) rate scales are two other auditory scales which have comparable performance to Mel-Scale. In this paper the acoustic features generated using filter banks with Mel-Scale, Bark-Scale and ERB-Scale has been investigated and analyzed with respect to the phonemes in the MISING language.

References

  • Biljana Prica and Sinisallic. 2010 Recognition of Vowels in Continuous Speech by Using Formants.
  • Rizwan Rehman and G. C. Hazarika "Analysis and Recognition of Vowels in SHAI`YANG MIRI Language using Formants", International Journal of Computer Applications (USA), Volume 89/Number 2,2014.
  • J. O. Pickles (1988) An Introduction to the Physiology of Hearing, London: Academic (2nd ed. ).
  • R. Plomp (1976) Aspects of Tone Sensation: A Psychophysical Study, London: Academic.
  • Thomas F Quateier,"Discrete Time Processing of Speech Signals- Principles and Practice"Pearson Eduaction,1997.
  • Sharada V Chougule, Mahesh S Chavan (2013) Comparison of Frequency-Warped Filter Banks in relation to Robust Features for Speaker Identification: Recent Advances in Electrical Engineering
  • S. S. Stevens and J. Volkman (1940) "The relation of pitch to frequency: A revised scale" Am. J. Psychol. 53: 329- 353.
  • J. Volkmann, S. S. Stevens, and E. B. Newman,"A scale for the measurement of the psychological magnitude pitch (A)," J. Acoust. Soc. Am. , vol. 8, no. 3, pp. 208– 208, 1987
  • Thomas F Quateier,"Discrete Time Processing of Speech Signals- Principles and Practice", Pearson Eduaction,1997.
  • E. Zwicker, G. Flottorp and S. S. Stevens (1957) "Critical bandwidth in loudness summation" J. Acoust. Soc. Am. 29: 548-557.
  • H. Traunmüller (1990) "Analytical expressions for the tonotopic sensory scale" J. Acoust. Soc. Am. 88: 97-100.
  • B. C. J. Moore and B. R. Glasberg (1983) "Suggested formulae for calculating auditory-filter bandwidths and excitation patterns" J. Acoust. Soc. Am. 74: 750-753.
  • B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data", Hearing Research, Vol. 47, Issues 1-2, p. 103-138, 1990.
  • Claudio Becchetti, Klucio Prina Ricotti "Speech Recognition: Theory And C++ Implementation", Wiley Publication ,2004