CFP last date
20 June 2024
Reseach Article

Speech Recognition System for Windows Commands

Published on May 2013 by Sumit Patel, Amit Bramhecha, Santosh Mahale, Anant Maind, Mahesh Sanghavi
International Conference on Recent Trends in Engineering and Technology 2013
Foundation of Computer Science USA
ICRTET - Number 5
May 2013
Authors: Sumit Patel, Amit Bramhecha, Santosh Mahale, Anant Maind, Mahesh Sanghavi
5fca76ad-c0b1-48b3-8624-da0d749b6fa8

Sumit Patel, Amit Bramhecha, Santosh Mahale, Anant Maind, Mahesh Sanghavi . Speech Recognition System for Windows Commands. International Conference on Recent Trends in Engineering and Technology 2013. ICRTET, 5 (May 2013), 26-30.

@article{
author = { Sumit Patel, Amit Bramhecha, Santosh Mahale, Anant Maind, Mahesh Sanghavi },
title = { Speech Recognition System for Windows Commands },
journal = { International Conference on Recent Trends in Engineering and Technology 2013 },
issue_date = { May 2013 },
volume = { ICRTET },
number = { 5 },
month = { May },
year = { 2013 },
issn = 0975-8887,
pages = { 26-30 },
numpages = 5,
url = { /proceedings/icrtet/number5/11795-1360/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Recent Trends in Engineering and Technology 2013
%A Sumit Patel
%A Amit Bramhecha
%A Santosh Mahale
%A Anant Maind
%A Mahesh Sanghavi
%T Speech Recognition System for Windows Commands
%J International Conference on Recent Trends in Engineering and Technology 2013
%@ 0975-8887
%V ICRTET
%N 5
%P 26-30
%D 2013
%I International Journal of Computer Applications
Abstract

To develop a system to recognize system commands through voice and convert it into equivalent text, the system accepts voice commands from user and displays its equivalent text. The system accepts voice commands, performs processing on it to recognize the actual command before displaying the corresponding output. For this particular system processing being done are noise removal, feature extraction and pattern matching. Various features are available. These are totally application dependent i. e. for a particular application particular feature is being extracted. Hence performing this various processing, text format of equivalent voice command is being displayed. To accept the voice commands User Must use a good quality microphone. The voice commands is being recorded and saved as a . wav file. Wav file is being used because it stores the data in the digital form. Initially the features of each command would be saved in a file. Once the 'init' is recognized the system will then wait for the users commands. On getting a command the system will save the input as a . wav file. The features of this command are then matched against the predefined command features. If the match is found the command is a valid one. It then displays its text form. If the command is not valid it simply discards it.

References
  1. Markku Turunen and Jaakko Hakulinen, Design and Development of Speech Interfaces Course Material http://www. cs. uta. fi/hci/spi/ddsi/
  2. Pinker, S. , (1994), the Language Instinct, Harper Collins, New York City, New York, USA.
  3. Deshmukh, N. , Ganapathiraju, A, Picone J. , (1999), Hierarchical Search for Large Vocabulary Conversational Speech Recognition. IEEE Signal Processing Magazine, 1(5):84-107.
  4. Zue, V. , Cole, R. , Ward, W. (1996). Speech Recognition. Survey of the State Of the Art in Human Language Technology. Kauii, Hawaii, USA.
  5. Dix, A. J. , Finlay. Abowd, G. , Beale, R. (1998). Human-Computer Interaction, 2nd edition, Prentice Hall, Englewood Cliffs, NJ, USA.
  6. Rudnicky, A. I. , Lee, K. F. , and Hauptmann, A. G. (1992) Survey of current Speech Technology. Communications of the ACM, 37(3):52-57.
  7. Picheny, M. , (2002). Large vocabulary speech Recognition, 3 5(4):42-50.
  8. Rabiner, L. , R. , and Wilpon, J. G. , (1979). Considerations In applying clustering Techniques to speaker-independent word recognition. Journal of Acoustic Society of America. 66(3):663-673.
  9. Kumar, M. Rajput, N. Verma, A . (2006) IBM Journal of Research and Development, 0018-8646,10. 1147/rd. 485. 0703,Sponsored by: IBM
  10. De Mori, Renato, Lam, Lily, Gilloux, Michel. (1987) Pattern Issue, 0162- 8828, 10. 1109/TPAMI. 1987. 4767902, IEEE Computer Society
  11. Bahl, Lalit R, Jelinek, Frederick, Mercer, Robert L, (2000), IBM T. J. Watson Research Center, Yorktown Heights, NY 10598. PAMI-5 Issue: 2 , IEEE Computer Society
  12. Liu, Y. Jones, H. Vaidya, S. Perrone, (2009). http://research. microsoft. com/pubs/80528/SPM-MINDS-I. pdf
  13. M. Tydlitat, B. Nanda, A. K. (2010), IBM Journal of Research and Development, Issue: 5, 0018-8646, 1147/rd. 515. 0583.
  14. Mengjie, Z. , (2001) Overview of speech Recognition and related machine Learning techniques, Technical report. Retrieved December 10, 2004 from http://www. mcs. vuw. ac. nz/comp/Publications/archive/CS-TR-01/CS-TR- 01-15. Pdf
  15. "Research Developments and Directions in Speech Recognition and Understanding, Part 1" , (2009). http://research. microsoft. com/pubs/80528/SPM- MINDS-I. pdf
  16. Speech Recognition Technologies, (John Kirriemuir, 2003 ). http://www. ceangal. com/
  17. Speech Recognition – Wikipedia http://en. wikipedia. org/wiki/Speech_recognition
  18. Voice Recognition Technology http://cobweb. ecn. purdue. edu/~tanchoco/MHE/ADC-is/Voice/main. shtml
  19. http://www. opendl. net/solutions/recognition. aspx
  20. Casey Brains http://www. scribd. com/doc/6901516/ugSpeechSpeech
  21. Wolfgang Wahlster, Verbmobil: Foundations of Speech-To-SpeechTranslation http://books. google. com/books?hl=en&lr=&id=RiT0aAzeudkC&oi=fnd&pg=PR5&dq=Verbmobil:+Foundations+of+Speech-ToSpeech+Translation&ots=jBhMwQ0HnT&sig=zx2EWMK4n-lYhG9k5gKU2zGieE#PPP1,M1
  22. Roni Rosenfeld, Alexander Rudnicky, Stefanie Tomko, Thomas Harris, Universal Speech Interface project http://www. cs. cmu. edu/~usi/
  23. Wikipedia the Free Encyclopedia – Talkman http://en. wikipedia. org/wiki/Talkman
  24. Talking Windows http://msdn. microsoft. com/da-dk/magazine/cc163663(en-us ,printer). aspx
  25. IBM Research, IBM Text-to-Speech Research http://www. research. ibm. com/tts/
  26. Microsoft Corporation, Windows Speech Recognition http://www. microsoft. com/windows/products/windowsvista/features/details/speechrecognition. mspx
  27. Nuance Communications, Inc. , Nuance – Open Speech Recognizer http://www. nuance. com/recognizer/openspeechrecognizer/
  28. Carnegie Mellon University, Sphinx-4 A Speech Recognizer Written http://cmusphinx. sourceforge. net/sphinx4/
Index Terms

Computer Science
Information Sciences

Keywords

Recognize Feature Extraction Pattern Matching Noise Removal