CFP last date
21 October 2024
Reseach Article

Search Key Identification in a Spoken Query using Isolated Keyword Recognition

by Utpal Bhattacharjee
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 5 - Number 8
Year of Publication: 2010
Authors: Utpal Bhattacharjee
10.5120/933-1310

Utpal Bhattacharjee . Search Key Identification in a Spoken Query using Isolated Keyword Recognition. International Journal of Computer Applications. 5, 8 ( August 2010), 14-21. DOI=10.5120/933-1310

@article{ 10.5120/933-1310,
author = { Utpal Bhattacharjee },
title = { Search Key Identification in a Spoken Query using Isolated Keyword Recognition },
journal = { International Journal of Computer Applications },
issue_date = { August 2010 },
volume = { 5 },
number = { 8 },
month = { August },
year = { 2010 },
issn = { 0975-8887 },
pages = { 14-21 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume5/number8/933-1310/ },
doi = { 10.5120/933-1310 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:53:42.643606+05:30
%A Utpal Bhattacharjee
%T Search Key Identification in a Spoken Query using Isolated Keyword Recognition
%J International Journal of Computer Applications
%@ 0975-8887
%V 5
%N 8
%P 14-21
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This article presents a novel technique for the recognition of isolated keywords from spoken search queries. Recognition of the isolated keywords from spoken search queries may be considered as the first step towards the development of a speech-operated keyword-based searching technique. A database of 300 spoken search queries from Assamese language, a major Indian language mostly spoken by the people of north east India, has been created. The system developed during the study has been tested and evaluated with the above mentioned database. In the present study, Mel Frequency Cepstral Coefficient (MFCC) has been used as the feature vector and Multilayer Perceptron (MLP) to identify the phoneme boundaries as well as for recognition of the phonemes. Viterbi search technique has been used to identify the keywords from the sequence of phonemes generated by the phoneme recognizer. A recognition accuracy of 74.67% has been achieved in the present study.

References
  1. Ahad, A., Fayyaz, A. and Mehmood, T. 2002. Speech recognition using multi-layer perceptron, Proceedings of IEEE Students Conference (ISCON-02), Vol. 1, 103 – 109.
  2. Box, G.E.P, Jenkins, G.M. and Reinsel, G.C. 1994. Time Series Analysis – Forecasting and Control, 3rd Edition, Englewood Cliffs, NJ, Prentice-Hall.
  3. Buniet, L and Fohr, D. 1995. Continuous Speech Segmentation with the Gamma Memory Model, Proc. of EUROSPEECH’95, 1685-1688.
  4. Carpenter, C.A. and Grossberg, S. (Eds.). 1992. Neural Network for Vision and Image Processing, MIT Press.
  5. Gelenb, E. (Eds.). 1991. Neural Network: Advances and Applications, North-Holland, New York.
  6. Grayden, D.B. and Scordilis, M.S. 1994. Phoneme segmentation of Fluent Speech, Proc ICASSP, 73-76.
  7. Kohonen, T. 1995. Self-Organized Maps, Springer-Verlag.
  8. Schwatz, R. and Makhoul, J. 1975. Where the Phoneme Are: Dealing with Ambiguity in Accoustic-Phonetic Recognition, IEEE Trans. ASSP, Vol. 23, 50-53.
  9. Sodani, M., Nitsuwat, S. and Haruechaiyasak, C. 2010. Thai Word Recognition Using Hybrid MLP-HMM, International Journal of Computer Science and Network Security, VOL.10 No.3, 103-110.
  10. Suh, Y. and Lee, Y. 1996. Phoneme Segmentation of Continuous Speech using Multilayer Perceptron, ICSLP 96, 1297-1300.
  11. Talukdar, P.H., Bhattacharjee, U., Goswami, C. and Barman, J. 2005. A Robust Recogniser for Assamese and Bodo Vowels using Artificial Neural Network, Proc. Int. Sym. Frontiers of Research on Speech and Music-2005, 148-152.
  12. Ting, H.N., Jasmy, Y., Sheikh Hussain, S.S. and Cheah, E.L. 2001. Malay syllable recognition based on multilayer perceptron and dynamic time warping, Proceedings of the Sixth International Symposium on Signal Processing and its Applications, vol. 2, 743 – 744.
  13. Weinsterin, C.J., McCandless, S.S., Mondehin, L.F. and Zue V.W. 1975. A System for Acoustic Phonetic Analysis of Continuous Speech, IEEE Trans. ASSP, Vol. 23, 54-67.
  14. Zeidenberg, M. 1990. Neural Network Models in Artificial Intelligence, E.Horwood, London.
  15. Zue V.W. 1985. The Use of Speech Knowledge in Automatic Speech Recognition, Proceedings of the IEEE, Vol. 73, 1602-1615.
Index Terms

Computer Science
Information Sciences

Keywords

Query Identification Phoneme Segmentation Multilayer Perceptron Viterbi Search