Analysis of Various Features using Different Temporal Derivatives from Speech Signals

Muskan; Naveen Aggarwal

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 20 July 2026

Submit your paper

Know more

The week's pick

ReLeaf: A MobileNetV2-Based Mobile Application for Real-Time Waste Classification with LLM-Assisted Recycling Guidance

Fatimah H. Alyami Nadeen N. Abduljabbar Ghadi T. Alzahrani Dana B. Alakeel Amal S. Almirsal Atheer S. Algherairy

Random Articles

Transmit Power Minimization using Fuzzy Rule based System in Relay Assisted Cognitive Radio Networks

November

2015

An Optimized Classifier Frame Work based on Rough Set and Random Tree

Feb

2017

An Intelligent approach to enhance the help messages for a compiler - An expert system

February

2010

Advanced Algorithm for Detection and Prevention of Cooperative Black and Gray Hole Attacks in Mobile Ad Hoc Networks

February

2010

Reseach Article

Analysis of Various Features using Different Temporal Derivatives from Speech Signals

by Muskan, Naveen Aggarwal

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 118 - Number 8

Year of Publication: 2015

Authors: Muskan, Naveen Aggarwal

10.5120/20762-3191

Muskan, Naveen Aggarwal . Analysis of Various Features using Different Temporal Derivatives from Speech Signals. International Journal of Computer Applications. 118, 8 ( May 2015), 1-9. DOI=10.5120/20762-3191

@article{ 10.5120/20762-3191,

author = { Muskan, Naveen Aggarwal },

title = { Analysis of Various Features using Different Temporal Derivatives from Speech Signals },

journal = { International Journal of Computer Applications },

issue_date = { May 2015 },

volume = { 118 },

number = { 8 },

month = { May },

year = { 2015 },

issn = { 0975-8887 },

pages = { 1-9 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume118/number8/20762-3191/ },

doi = { 10.5120/20762-3191 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T23:01:06.255976+05:30

%A Muskan

%A Naveen Aggarwal

%T Analysis of Various Features using Different Temporal Derivatives from Speech Signals

%J International Journal of Computer Applications

%@ 0975-8887

%V 118

%N 8

%P 1-9

%D 2015

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Speech recognition being an upcoming field is evaluated and research is being done for the same. Research in speech recognition for different languages is at peak. Less amount of work has been done for Indian languages particularly for Punjabi language. In this paper, Punjabi speech has been analyzed by extracting various features along with different temporal derivatives using feature extraction techniques. The dataset which has been considered for the research work is the set of Punjabi isolated digit recorded as 24 bit 44100 Hz mono PCM signal. Comparison of range and accuracy for acceptable results has been determined using HMM.

References

L. Rabiner and R. Schafer, "Introduction to digital speech processing", Foundations and Trends in Signal Processing, Journal of ACM vol. 1, no. 1-2, pp. 1–194, 2007.
L. Rabiner and B. H. Jaung, Fundamentals of Speech Recognition, Englewood Cliffs, NJ: Prentice-Hall, 1993.
X. Huang, J. Baker and R Reddy, "A Historical Perspective of Speech Recognition", Communications of the ACM, vol. 57, no. 1, January 2014.
K. H. Davis, R. Biddulph and S. Balashek, "Automatic recognition of spoken digits," J. A. S. A. , vol. 24, no. 6, pp. 637-642, 1952.
S. C. Sajjan and C. Vijaya, "Comparison of DTW and HMM for isolated word recognition", Proceedings of International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME), IEEE, pp. 466-470, 2012.
H Sakoi and S Chiba, "Dynamic Programming Algorithm Optimization for Spoken Word Recognition", IEEE Transactions on acoustics, speech and signal processing, vol. Assp- 26, no. 1, February 1978.
L R Rabiner, A E Rosenberg, S E Levinson and J G Wilpon, "Speaker-Independent Recognition of Isolated Words Using Clustering Techniques", IEEE Transactions on acoustics, speech and signal processing, vol. Assp- 27, no. 4, August 1979.
L. R. Rabinar and M. R. Sambur, "An algorithm for determining the endpoints of isolated utterances", The Bell System Technical Journal, pp. 297-315, 1975.
L R Rabiner, A E Rosenberg, L F Lamel and J G Wilpon , "An Improved Endpoint Detector for Isolated Word Recognition", IEEE Transactions on acoustics, speech and signal processing, vol. Assp- 29, no. 4, 1981.
M A Bush, G E Kopec and N Lauritzen, "Segmentation in Isolated Word Recognition Using Vector Quantization", Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84, vol. 9, 1984
L R Rabiner and B H Jaung, "An Introduction to Hidden Markov Models", IEEE ASSP Magazine, pp. 4-16, January 1986.
L R Rabiner and B H Jaung, "Hidden Markov Models for Speech Recognition", Technometrics, vol 33, no. 3, 1991.
S B Davis and P Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences", IEEE Transactions on acoustics, speech and signal processing, vol. Assp- 28, no. 4, 1980.
Wei Han, Cheong-Fat Chan, Chiu-Sing Choy and Kong-Pang Pun, "An Efficient MFCC Extraction Method in Speech Recognition", Proceedings of IEEE, ISCAS 2006, 2006
H A Patil and T K Basu, "Development of speech corpora for speaker recognition research and evaluation in Indian languages", IJST 2008, Springer, 2008
S Ranjan, "A Discrete Wavelet Transform Based Approach to Hindi Speech Recognition", Proceedings of International Conference on Signal Acquisition and Processing, IEEE, pp, 345-348, 2010.
K S Rao, "Application of prosody models for developing speech systems in Indian languages", IJST 2011, Springer, 2011.
I Bhardwaj and N D Londhe, "Hidden Markov Model Based Isolated Hindi Word Recognition", Proceedings of 2nd International Conference on Power, Control and Embedded Systems, IEEE, 2012.
T Pruthi, S Saksena and P K Das, "Swaranjali: Isolated Word Recognition for Hindi Language using VQ and HMM", International Conference on Multimedia Processing and Systems (ICMPS), IIT Madras.
S Tripathy, N Baranwal and G C Nandi, "A MFCC based Hindi Speech Recognition Technique using HTK Toolkit", Proceedings of the 2013 IEEE Second International Conference on Image Information Processing (ICIIP-2013), pp. 539-544, 2013.
A Sharma and A Kaur, "Automatic Segmentation of Punjabi Speech into Syllable-Like Units using Group Delay: A Review", Proceedings of International Journal of Computer Science & Engineering Technology (IJCSET), vol 4, no 6, 2013.
R. Kumar, "Comparison of HMM and DTW for isolated word recognition system for punjabi language", Proceedings of IJSC, vol 5, no. 3, pp. 88-92, 2010
Gurpreet Kaur, Parminder Singh and Amandeep Kaur, "Syllable Boundary Detection System for Punjabi Language", Proceedings of International Journal of Applied Research in Computing, vol. 1, no. 2, July 2013.
Ramana Rao G. V. and Srichand J, "Word boundary detection using pitch variations", Proceedings of Fourth International Conference on Spoken Language, 1996. ICSLP 96, pp. 813-816, May 1996.
Wiqas Ghai and Navdeep Singh, "Continuous Speech recognition for Punjabi Language", International Journals of Computer Application, vol. 72, no. 14, May 2013.
J. Psutka, L. Muller and J. V. Psutka, "Comparison of MFCC and PLP Parametrizations in Speaker Independent continuous speech recognition task", Eurospeech 2001, Scandanavia.
A. M. Toh, R. Togneri and S. Nordholm, "Investigating robust features for speech recognition in hostile environment", Asia Pacific Conference on Communication IEEE, October 2005.
H. Manabe and Z. Zhang, "Multi-stream HMM for EMG-Based Speech Recognition", Multimedia Laboratories, NTT Docomo, Kanagawa, Japan.
Muskan and Naveen Aggarwal, "Punjabi Speech Recognition: A Survey", Proceedings of ICAET, May 2014.
A. N. Mishra, M Chandra, A Biswas, S. N. Sharan, "Robust features for connected Hindi digits recognition", International Journal of Signal Processing, Image Processing and Pattern Recognition, Vol. 4, No. 2, June, 2011
K. M Krishna, M V Lakshmi and S. Sathiya Lakshmi, "Feature Extraction and Dimensionality Reduction using IPS for Isolated Tamil Words Speech Recognizer", International Journal of Advanced Research in Computer and Communication Engineering, Vol. 3, Issue 3, March 2014.
M Alsulaiman, G Muhammad and Z Ali, "Comparison of Voice Features for Arabic Speech Recognition", IEEE, 2011.
V Tiwari, "MFCC and its applications in speaker recognition", Proceedings of IJET, 2010.
Bassam A. Q. Al-Qatab and Raja N. Ainon, "Arabic speech recognition using Hidden Markov Model toolkit (HTK)", IEEE, 2010.
M Yanzhou and Y Mianzhu, "Russian Speech Recognition System Design Based on HMM", Proceedings of LEMCS, 2014.
J Kaur, Nidhi, R Kaur, "Issues involved in speech-to-text conversion", International Journal Of Computational Engineering Research, Vol. 2, Issue No. 2, Page No. 512-515, Mar-Apr 2012
S R Mankala, S R Bojja, V S Ramaiah & R. Rajeswara Rao, "Automatic speech processing using HTK for Telghu language", International Journal of Advances in Engineering & Technology, Jan. 2014
A Kumar, M Dua and T Chaudhary, "Continuous Hindi Speech Recognition using Monophone based Acoustic Modeling", International Journal of Computer Applications, 2014.
K. Murali Krishna, M. Vanitha Lakshmi and S. Sathiya Lakshmi, "Feature Extraction and Dimensionality Reduction using IPS for Isolated Tamil Words Speech Recognizer", International Journal of Advanced Research in Computer and Communication Engineering, Vol. 3, Issue 3, March 2014.
S Young, "The HTK Book", Cambridge University Engineering Department.
E Vozarikova, J Juhar and A Cizmar, "Dual Shots Detection", Information and Communication technologies and services, Vol. 10, Issue. 4, 2012.
N H Quang, T V Loan, LE The Dat, "Automatic Speech Recognition for Vietnamese using HTK System", IEEE, 2010.

Index Terms

Computer Science

Information Sciences

Keywords

Speech Recognition MFCC PLP LPC FBank Melspec