CFP last date
22 April 2024
Reseach Article

Mediscript Mobile Cloud Collabrative Speech Recognition Framework

Published on June 2013 by T.senthil Kumar, Vishnu Gajendran, Rajamanuri Harshad, Sneha Aswani, Deepti Sankara Narayanan
International Conference on Innovation in Communication, Information and Computing 2013
Foundation of Computer Science USA
ICICIC2013 - Number 3
June 2013
Authors: T.senthil Kumar, Vishnu Gajendran, Rajamanuri Harshad, Sneha Aswani, Deepti Sankara Narayanan
d04338fb-c209-417a-ab23-6e2421a2f3ef

T.senthil Kumar, Vishnu Gajendran, Rajamanuri Harshad, Sneha Aswani, Deepti Sankara Narayanan . Mediscript Mobile Cloud Collabrative Speech Recognition Framework. International Conference on Innovation in Communication, Information and Computing 2013. ICICIC2013, 3 (June 2013), 36-45.

@article{
author = { T.senthil Kumar, Vishnu Gajendran, Rajamanuri Harshad, Sneha Aswani, Deepti Sankara Narayanan },
title = { Mediscript Mobile Cloud Collabrative Speech Recognition Framework },
journal = { International Conference on Innovation in Communication, Information and Computing 2013 },
issue_date = { June 2013 },
volume = { ICICIC2013 },
number = { 3 },
month = { June },
year = { 2013 },
issn = 0975-8887,
pages = { 36-45 },
numpages = 10,
url = { /proceedings/icicic2013/number3/12278-0161/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Innovation in Communication, Information and Computing 2013
%A T.senthil Kumar
%A Vishnu Gajendran
%A Rajamanuri Harshad
%A Sneha Aswani
%A Deepti Sankara Narayanan
%T Mediscript Mobile Cloud Collabrative Speech Recognition Framework
%J International Conference on Innovation in Communication, Information and Computing 2013
%@ 0975-8887
%V ICICIC2013
%N 3
%P 36-45
%D 2013
%I International Journal of Computer Applications
Abstract

Speech recognition is a vital part in medical transciription. The existing speech recognition systems, that run as standalone desktop applications, fall short in many cases due to low accuracy rates and high processing time. The bottleneck in these systems, is the lack of computation power (in terms of processing power and memory) made accessible to them. This paper proposes a mobile-cloud collaborative approach for the automation of speech to text conversion. The model proposed leverages the power of cloud computing and the ubiquitous nature of mobile computing. Computing resthisces can be scaled up/down in the cloud (Elastic Computing) depending on the usage of the system. This kind of speech recognition framework has many real time applications such as IVR systems, Medical Transcription systems, Railway Enquiries, Jthisnalism, Interactive User Interfaces, etc. A generic framework is advantageous, because the speech models in the Automatic Speech Recognizer (ASR) could be trained according to the specific domain required, allowing wide usability. The proposed speech framework is used for medical transcription process. Medical transcription process involves a medical transcriptionist who listens to the recorded speech of a doctor and manually types a transcript file. This process is automated by using the proposed speech framework. With this system, the work of the medical transcriptionist is reduced to error checking in the auto generated transcript file. The entire model is developed for a mobile cloud environment considering the characteristics of cloud delivery models.

References
  1. Kwang Mong Sim," Agent-Based Cloud Computing" IEEE Transactions on Services Computing, Vol. 5,pp. 564 - 577,2012.
  2. David Chiu,Gagan N. Agrawal,"Evaluating caching and storage options on the Amazon Web Services Cloud" ,IEEE/ACM International Conference on Grid Computing, pp. 17 - 24,2010.
  3. V. "Juggy" Jagannathan. , "The Careflow Architecture-A Case Study in Medical Transcription",IEEE Internet Computing,pp. 59-64,2001.
  4. Bernd Grobauer,Tobias Walloschek,Elmar Stöcker, "Understanding Cloud Computing Vulnerabilities", IEEE Security & Privacy,Vol. 9,pp. 50 - 57,2011.
  5. Alexandru Iosup,Simon Ostermann,Nezih Nezih Yigitbasi,Radu Prodan,Thomas Fahringer,Dick H J Epema,"Performance Analysis of Cloud Computing Services for Many-Tasks Scientific Computing",IEEE Transactions on Parallel and Distributed Systems, Vol. 22,pp. 931 - 945,2011.
  6. . A. Alshuwaier,A. A. Alshwaier,A. M. Areshey,"Applications of cloud computing in education",8th International Conference on Computing and Networking Technology (ICCNT),pp. 26 - 33,2012.
  7. Arshdeep Bahga,Vijay K. Madisetti, "Analyzing Massive Machine Maintenance Data in a Computing Cloud",IEEE Transactions on Parallel and Distributed Systems, Vol. 23,pp. 1831 - 1843,2012.
  8. N. R. R. Mohan,E. B. Raj,"Resthisce Allocation Techniques in Cloud Computing -- Research Challenges for Applications", Fthisth International Conference on Computational Intelligence and Communication Networks (CICN),pp. 556 - 560,2012.
  9. Khaled M. Khan,Qutaibah M. Malluhi,"Establishing Trust in Cloud Computing",IEEE,Vol. 12, pp. 20 - 27,2010
  10. Jorge Martins,João Pereira,Sergio M. Fernandes,João Cachopo, "Towards a Simple Programming Model in Cloud Computing Platforms",First International Symposium on Network Cloud Computing and Applications (NCCA),pp. 83 - 90,2011.
  11. Lori M. Kaufman,"Data Security in the World of Cloud Computing",IEEE Security & Privacy, Vol. 7 ,pp. 61 - 64,2009.
  12. . Abu S. Asaduzzaman,Abilash Rao Joseph,Fadi N. Sibai,Nader Mohamed, " Cloud computing: A cloudy future?",International Conference on Innovations in Information Technology (IIT), pp. 78 - 82,2012
  13. Qian Wang,Cong Wang, Kui Ren,Wenjing Lou,Jin Li, "Enabling Public Auditability and Data Dynamics for Storage Security in Cloud Computing",IEEE Transactions on Parallel and Distributed Systems, Vol. 22,pp. 847 - 859,2011
  14. Mohamed Hamdi,"Security of cloud computing, storage, and networking",International Conference on Collaboration Technologies and Systems (CTS), pp. 1-5,2012
  15. H. Gilbert Miller,John Veiga,"Cloud Computing: Will Commodity Services Benefit Users Long Term?",IT Professional,Vol. 11,pp. 57-59,2009
  16. Sven Anderson,Diane Kewley-Port, "Evaluation of speech recognizers for speech training applications",IEEE Transactions on Speech and Audio Processing, Vol. 3,pp. 229 - 241,1995.
  17. Liang Gu,Yuqing Gao,Fu-Hua Liu,Michael A. Picheny, "Concept-based speech-to-speech translation using maximum entropy models for statistical natural concept generation",IEEE Transactions on Audio, Speech, and Language Processing, Vol. 14,pp. 377 - 392,2006.
  18. Tomohiro Nakatani, ;Büng-Hwang Hwang Fred Juang,Takuya Yoshioka,Keisuke Kinoshita,Marc Delcroix,Masato Miyoshi, "Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Sthisce Model",IEEE Transactions on Audio, Speech, and Language Processing, Vol. 16 ,pp. 1512 - 1527,2008
  19. Ji Ming,Ramji Srinivasan,Danny Crookes, "A Corpus-Based Approach to Speech Enhancement From Nonstationary Noise",IEEE Transactions on Audio, Speech, and Language Processing, Vol. 19,pp. 822 - 836,2011
  20. Jont B. Allen,"How do humans process and recognize speech?",IEEE Transactions on Speech and Audio Processing,Vol. 2 , pp. 567 - 577,1994
  21. John H L Hansen,Brian David Womack, "Feature analysis and neural network based classification of speech under stress",IEEE Transactions on Speech and Audio Processing, Vol. 4,pp. 307 - 313,1996.
  22. John H L Hansen,Vaishnevi Varadarajan, "Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition",IEEE Transactions on Audio,Speech and Language Processing,Volume: 17,pp. 366 - 378,2009.
  23. Jiaohua Hua Tao,Yongguo Kang,Ai-Jun Li,"Prosody conversion from neutral speech to emotional speech",IEEE Transactions on Audio, Speech, and Language Processing, Vol. 14,pp. 1145 - 1154,2006.
  24. Ki-Seung Lee,Richard V. Cox,"A very low bit rate speech coder based on a recognition/synthesis paradigm",IEEE Transactions on Speech and Audio Processing,Vol. 9,pp. 482 - 491,2011.
  25. Sadaoki Furui,Tomonori Kikuchi,Yousuke Shinnaka,Chiori Hori, "Speech-to-text and speech-to-speech summarization of spontaneous speech",IEEE Transactions on Speech and Audio Processing, Vol. 12,pp. 401 - 408,2004.
  26. E. Bryan George,Mark J T Smith, "Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model",IEEE Transactions on Speech and Audio Processing,Vol. 5,pp. 389 - 406,1997.
  27. Sorin Dusan, James L. Flanagan,Amod Karve,Mridul Balaraman,"Speech Compression by Polynomial Approximation",IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15 ,pp. 387 - 395,2007.
  28. M. S. Hawley,S. P. Cunningham,P. D. Green, P. Enderby,R. Palmer,S. Sehgal , P. O'Neill,"A Voice-Input Voice-Output Communication Aid for People With Severe Speech Impairment",IEEE Transactions on Neural Systems and Rehabilitation Engineering, Vol. 21,pp. 23 - 31,2013.
  29. Jiaohua Hua Tao,Yongguo Kang,Ai-Jun Li, "Prosody conversion from neutral speech to emotional speech",IEEE Transactions on Audio, Speech, and Language Processing, Vol. 14 ,pp. 1145 - 1154,2006.
  30. . BrianHayes,"CloudComputing",Communications of the ACM - Web science Magazine Vol. 51,pp. 9-11,2008 .
Index Terms

Computer Science
Information Sciences

Keywords

Framework Cloud Buckets Model Speech