Speech Recognition System to Leverage the Accuracy of Training Sample using Optimized Matching Window

Gunjan Thakur; Anudeep Goraya

Call for Paper

June Edition

IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper

Know more

The week's pick

Enhancing Privacy Preservation: Multi-Attribute Protection with P-Sensitive K-Anonymity

Twinkle Patel Kiran Amin

Random Articles

An Efficient Hybrid Parallel Prefix Adders for Reverse Converters using QCA Technology

Nov

2016

Computerized Preventive Maintenance Management System (CPMMS) for Haematology Department Equipments

January

2015

Security Enhancement in Cloud Storage using ARIA and Elgamal Algorithms

Aug

2017

EARRA: Enhanced Adaptive Rate Response Adjustment Technique for Congestion Control in Networks

Jun

2017

Reseach Article

Speech Recognition System to Leverage the Accuracy of Training Sample using Optimized Matching Window

by Gunjan Thakur, Anudeep Goraya

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 124 - Number 15

Year of Publication: 2015

Authors: Gunjan Thakur, Anudeep Goraya

10.5120/ijca2015905670

Gunjan Thakur, Anudeep Goraya . Speech Recognition System to Leverage the Accuracy of Training Sample using Optimized Matching Window. International Journal of Computer Applications. 124, 15 ( August 2015), 23-28. DOI=10.5120/ijca2015905670

@article{ 10.5120/ijca2015905670,

author = { Gunjan Thakur, Anudeep Goraya },

title = { Speech Recognition System to Leverage the Accuracy of Training Sample using Optimized Matching Window },

journal = { International Journal of Computer Applications },

issue_date = { August 2015 },

volume = { 124 },

number = { 15 },

month = { August },

year = { 2015 },

issn = { 0975-8887 },

pages = { 23-28 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume124/number15/22181-2015905670/ },

doi = { 10.5120/ijca2015905670 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T23:14:30.596728+05:30

%A Gunjan Thakur

%A Anudeep Goraya

%T Speech Recognition System to Leverage the Accuracy of Training Sample using Optimized Matching Window

%J International Journal of Computer Applications

%@ 0975-8887

%V 124

%N 15

%P 23-28

%D 2015

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In this voice recognition system is to recognize the voice samples spoken by human and recognize over the system. In this select the most commanly used features of the voice samples with the help of MFCC(Mel frequency cofficient ceptrum) and that feature match with the real time voice sample features using DTW(Dynamic time wrapping) and it is accepted by the system.

References

Yogesh Kumar Sen, R. K. Chaurasiya. IEEE International Conference on voice rcoginition-june2014,24:58-95.
Daubechies, I. The wavelet transform, time-frequency localization andsignal analysis. IEEETransformation and Information Theory.2014,36: 961-1005.
Hasan Serhan Yavuz, Hakan Çevikalp .A wavelet Tour of Signal ProcessingIEEE International Conference on signal processing june 2014,34:19-445.
Tiecheng Yu. The Development State of the Voice Identification. The Development communication world.2005,2:56-59.
Dian RetnoAnggraini .The development of a voice recognition system based on Principal Component Analysis(PCA) and unsupervised learning algorithm.2012,4:35-58.
Jiqing Han, Lei Zhang, Tieran Zheng. Voice Signals Processing[M].Beijing: Tsinghua University Press 2004,3:67-94.
Remzi Serdar Kurcan, “Isolated word recognition from in-ear microphone data using hidden markov models (hmm)”, Master’s Thesis, 2006.
Nikolai Shokhirev ,”Hidden Markov Models “, 2010.
L.R. Rabiner, “A tutorial on Hidden Markov Models and selected applications in Speech Recognition”, Proceedings of the IEEE Journal, Feb 1989, Vol 77, Issue: 2.
Suma Swamy, Manasa S, Mani Sharma, Nithya A.S, Roopa K.S and K.V Ramakrishnan, “An Improved Speech Recognition System”, LNICST Springer Journal, 2013.
Lindasalwa Muda, Mumtaj Begam and Elamvazuthi.,”Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and DTW Techniques “,Journal of Computing, Volume 2, Issue 3, March 2010.
Mahdi Shaneh and Azizollah Taheri ,”Voice Command Recognition System based on MFCC and VQAlgorithms”, World Academy of Science, Engineering and Technology Journal , 2009.
Remzi Serdar Kurcan, “Isolated word recognition from in-ear microphone data using hidden markovmodels (hmm)”, Master’s Thesis, 2006.
Nikolai Shokhirev ,”Hidden Markov Models “, 2010.
L.R. Rabiner, “A tutorial on Hidden Markov Models and selected applications in Speech Recognition”, Proceedings of the IEEE Journal, Feb 1989, Vol 77, Issue: 2.
Suma Swamy, Manasa S, Mani Sharma, Nithya A.S, Roopa K.S and K.V Ramakrishnan, “An Improved Speech Recognition System”, LNICST Springer Journal, 2013.
M. L. Shire and B. Y. Chen, “Data-driven RASTA filters in reverberation,”in Proc. ICASSP’00, 2000, vol. 3, pp. 1627–1630.
T. Takiguchi and Y. Ariki, “Robust feature extraction using kernel PCA,” in Proc. ICASSP’06, 2006, pp. 509–512.
S. Furui, “Speaker-independent isolated word recognition using dynamic features of speech spectrum,” IEEE Trans. Acoust., Speech,Signal Process., vol. ASSP-34, no. 1, pp. 52–59, Feb. 1986.
O. Ichikawa, T. Fukuda, R. Tachibana, and M. Nishimura, “Dynamic features in the linear domain for robust automatic speech recognition in a reverberant environment,” in Proc. Interspeech’09, 2009, pp. 44–47.
M. Nakayama et al., “CENSREC-4: Development of evaluation framework for distant-talking speech recognition under reverberant environments,” in Proc. Interspeech’08, 2008, pp. 968–971.
T. Nishiura et al., “Evaluation framework for distant-talking speech recognition under reverberant environments—Newest part of the CENSREC series-,” in Proc. LREC ’08, 2008.

Index Terms

Computer Science

Information Sciences

Keywords

Dynamic Time Wrapping (DTW) Mel Frequency Cepstral Coefficient (MFCC) Voice recognition.