Feature Extraction Techniques in Speech Processing: A Survey

Rekha Hibare; Anup Vibhute

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

Navigating the Future of Cybersecurity: A Strategic Approach to Crypto Agility for Modern Enterprises

Aditya Gupta

Random Articles

Automatic Speaker Age Estimation and Gender Dependent Emotion Recognition

May

2015

A Hybrid Data Model to Share Medical Images

Mar

2017

A RFID based Inventory Control System for Nigerian Supermarkets

April

2015

Article:Analyzing EAP TLS & ERP Protocol with varying processor speed

October

2010

Reseach Article

Feature Extraction Techniques in Speech Processing: A Survey

by Rekha Hibare, Anup Vibhute

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 107 - Number 5

Year of Publication: 2014

Authors: Rekha Hibare, Anup Vibhute

10.5120/18744-9997

Rekha Hibare, Anup Vibhute . Feature Extraction Techniques in Speech Processing: A Survey. International Journal of Computer Applications. 107, 5 ( December 2014), 1-8. DOI=10.5120/18744-9997

@article{ 10.5120/18744-9997,

author = { Rekha Hibare, Anup Vibhute },

title = { Feature Extraction Techniques in Speech Processing: A Survey },

journal = { International Journal of Computer Applications },

issue_date = { December 2014 },

volume = { 107 },

number = { 5 },

month = { December },

year = { 2014 },

issn = { 0975-8887 },

pages = { 1-8 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume107/number5/18744-9997/ },

doi = { 10.5120/18744-9997 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T22:40:14.439917+05:30

%A Rekha Hibare

%A Anup Vibhute

%T Feature Extraction Techniques in Speech Processing: A Survey

%J International Journal of Computer Applications

%@ 0975-8887

%V 107

%N 5

%P 1-8

%D 2014

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Speech processing includes the various techniques such as speech coding, speech synthesis, speech recognition and speaker recognition. In the area of digital signal processing, speech processing has versatile applications so it is still an intensive field of research. Speech processing mostly performs two fundamental operations such as Feature Extraction and Classification. The main criterion for the good speech processing system is the selection of feature extraction technique which plays an important role in the system accuracy. This paper intends to focus on the survey of various feature extraction techniques in speech processing such as Fast Fourier Transforms, Linear Predictive Coding, Mel Frequency Cepstral Coefficients, Discrete Wavelet Transforms, Wavelet Packet Transforms, Hybrid Algorithm DWPD and their applications in speech processing.

References

Dr. Yousra F. , Al-Irhaim Enaam Ghanem Saeed, "Arabic word recognition using wavelet neural network", Scientific Conference in Information Technology, November 2010.
Sonia Sunny, David Peter S, K Poulose Jacob, "Design of a Novel Hybrid Algorithm for Improved Speech Recognition with Support vector Machines Classifier", International Journal of Emerging Technology and Advanced Engineering, vol. 3, pp. 249-254, June 2013.
Tingxiao Yang, "The Algorithms of Speech Recognition, Programming and Simulating in MATLAB", University of Gavale, pp. 1-49, January 2012.
Yoshua Bengio, Renato De Mori, Regis Cardin, "Speaker Independent Speech Recognition with Neural Networks and Speech Knowledge", Department of Computer Science McGill University, pp. 218-225.
Bhiksha Raj, Lorenzo Turicchia, Bent Schmidt-Nielsen, and Rahul Sarpeshkar, "An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition", EURASIP Journal on Audio, Speech, and Music Processing, vol. 2007,pp. 1-13,2007.
Greg Hopper, Reza Adhami, "An FFT-based speech recognition system", Journal of Franklin Institute, vol. 329, no. 3, pp. 555-565, May 1992.
Adam Glowacz, Witold Glowacz, Andrzej Glowacz, "Sound Recognition of Musical Instruments with Application of FFT and K-NN classifier with Cosine Distance ",AGH university of Science and Technology, Work supported by European Regional Development Fund INSIGMA Project No. POIG. 01. 01. 02-00-062/09, 2010.
Gil Lopes, Fernando Ribeiro, Paulo Carvalho, "Whistle Sound Recognition in Noisy Environment "Universidade do Minho, Departamento de Electrónica Industrial, Guimarães, Portugal.
Shing-Tai Pan, Chih-Chin Lai and Bo-Yu Tsai, "The Implementation of Speech Recognition Systems on FPGA-Based Embedded Systems with SOC Architecture", International Journal of Innovative Computing, Information and Control,vol. 7, no. 11,pp. 6161-6175, November 2011.
Hemant Tyagi, Rajesh M. Hegde, Hema A. Murthy and Anil Prabhakar, "Automatic Identification of Bird calls using Spectral Ensemble Average Voice Prints", Indian Institute of Technology Madras.
Dwijen Rudrapal, Smita Das, S. Debbarma, N. Kar, N. Debbarma, "Voice Recognition and Authentication as a Proficient Biometric Tool and its Application in Online Exam for P. H People", International Journal of Computer Applications (0975 – 8887), vol. 39,no. 12, pp. 7-12, February 2012.
Umesh Kumar Gupta,Dr. R. K. Prasad, "Frequency Analisys of Speech Signals for Devanagari Script and Numerals Using FFT", International Journal of Advanced Research in Computer Science and Software Engineering, vol. 3. no. 5, pp. 471-477, May 2013.
Asm Sayem, "Speech Analysis for Alphabets in Bangla Language:Automatic Speech Recognition", International Journal of Engineering Research, vol. 3, no. 2, pp. 88-93, February 2014.
Shady Y. EL-Mashed ,Mohammed I. Sharway , Hala H. Zayed, "Speaker Independent Arabic Speech Recognition using Support Vector Machin", Department of Electrical Engineering, Shoubra Faculty of Engineering, Benha University, Cairo, Egypt.
Shumaila Iqbal, Tahira Mahboob and Malik Sikandar Hayat Khiyal, "Voice Recognition using HMM with MFCC for Secure ATM", IJCSI International Journal of Computer Science Issues, vol. 8, No 3, pp. 297-303, November 2011,
Vaishali M. Chavan, V. V. Gohokar, "Speech Emotion Recognition by using SVM-Classifier", International Journal of Engineering and Advanced Technology (IJEAT), vol. 1, pp. 11-15, June 2012.
Siddheshwar S. Gangonda, Dr. Prachi Mukherji, "Speech Processing for Marathi Numeral Recognition using MFCC and DTW Features", International Journal of Engineering Research and Applications (IJERA), pp. 218-222, March 2012.
John H. L. Hansen, Ruhi Sarikaya, Umit Yapanel, Bryan Pellom, "Robust Speech Recognition in Noise: An Evaluation using the SPINE Corpus", CSLR: Center for Spoken Language Research; Robust Speech Processing Laboratory, 2001.
David Wagner, "A Speech Recognition Project", http://www. cs. dartmouth. edu/~dwagn/aiproj/speech. html
Kumar Rakesh, Subhangi Dutta and Kumara Shama, "Gender Recognition using Speech Processing Techniques in Lab View", International Journal of Advances in Engineering & Technology, vol. 1, pp. 51-63, May 2011,
Wahyu Kusuma R. , Prince Brave Guhyapati V. , "Simulation Voice Recognition System for controlling Robotic Applications", Journal of Theoretical and Applied Information Technology,vol. 39, no. 2,pp. 188-196, May 2012.
Thiang and Suryo Wijoyo, "Speech Recognition Using Linear Predictive Coding and Artificial Neural Network for Controlling Movement of Mobile Robot", International Conference on Information and Electronics Engineering, vol. 6, pp. 179-183, 2011.
John G. Ackenhusen, L. R. Rabiner, "Microprocessor Implementation of an LPC-Based Isolated Word Recognizer", Proc. IEEE, pp. 746-749, 1981.
J. E. Munoz-Exposito, S. Garcia-Galan, N. Ruiz-Reyes, P. Vera-Candeas and F. Rivas-Pena, "Speech/Music Discrimination using a single Warped LPC-Based Feature", Queen Mary University of Londan, pp. 614-617, 2005.
Bishnu Prasad Das, Ranjan Parekh, "Recognition of Isolated Words using Features based on LPC, MFCC, ZCR and STE, with Neural Network Classifiers", International Journal of Modern Engineering Research, vol. 2, pp. 854-858, May-June 2012.
Tobias Bengtsson, "Speech recognition using multilayer perceptron artificial neural network", Department of Computer Science Lund University.
Omesh Wadhwani, Amit Kolhe, Sanjay Dekate, "Recognition of Vernacular Language Speech for Discrete Words using Linear Predictive Coding Technique", International Journal of Soft Computing and Engineering, vol. 1, pp. 188-192, November 2011.
Paul A. K. , Das D. , Kamal M. M. , "Bangla Speech Recognition System Using LPC and ANN", Proc. IEEE, pp. 171-174, 2001.
Javier Harnando, Climent Nadeu, "Speech Recognition in noisy car environment based on OSALPC representation and robust similarity measuring techniques", Proc. IEEE, 1994.
Kadam V. K, Dr. R. C. Thool, "Performance Analysis of Optimization Tool for Speech Recognition Using LPC & DSK TMS3206711/13 Using Simulink & Matlab", International Journal Of Computational Engineering Research, vol. 2, pp. 1243-1248, September 2012.
Gatt E. , Grech I, Casha O. , "Discrete wavelet transforms with multiclass SVM for phoneme recognition", Proc. IEEE, pp. 1695-1700, 2013.
Firoz Shah. A, Raji Sukumar. A and Babu Anto. P, "Discrete Wavelet Transforms and Artificial Neural Networks for Speech Emotion Recognition", International Journal of Computer Theory and Engineering, vol. 2, no. 3, pp. 319-322, June 2010.
Jeih-Weih Hung, Hao-Teng Fan , and Syu-Siang Wang, "Several New DWT-Based Methods for Noise-Robust Speech Recognition", International Journal of Innovation, Management and Technology, vol. 3, no. 5, pp. 547-551, October 2012.
Jagannath H Nirmal, Mukesh A Zaveri, Suprava Patnaik1 and Pramod H Kachare, "A novel voice conversion approach using admissible wavelet packet decomposition", EURASIP Journal on Audio, Speech, and Music Processing, 2013.
T. B. Adam, M. S. Salam, T. S. Gunawan, "Wavelet Cesptral Coefficients for Isolated Speech Recognition", International Islamic University Malaysia, vol. 11, no. 5, pp. 2731-2738, May 2013.
Mariusz Zio?ko, Jakub Ga?ka, Bartosz Zio?ko, Tomasz Jadczyk, Dawid Skurzok, Jan Wicijowski, "Automatic Speech Recognition System Based on Wavelet Analysis", IEEE Fourth International Conference on Semantic Computing, pp. 450-451, 2010.
Adriano de Andrade Bresolin, Adriao Duarte Doria Neto e Pablo Javier Alsina, "A New Hierarchical Structure for Speech Recognition by units smaller than words, using Wavelet Packet and SVM", UTFPR Brazil, UFRN Brazil.
Sozan Mahmood and Mihran Abdulrahim, "Hybrid Speech Recognition System based on Wavelet 9/7 and Mel-Frequency Cepstral Coefficient", International Conference on Emerging Trends in Computer and Electronics Engineering, pp. 19-22, March 2012.
S. Datta and Farooq O. "Wavelet-based denoising for robust feature extraction for speech recognition", Proc. IEEE, vol. 39, pp. 163-165, January 2003.
Bartosz Ziolko, Wojciech Koz lowski, Mariusz Ziolko, Rafa Samborski, David Sierra, Jakub Ga lka, "Hybrid Wavelet-Fourier-HMM Speaker Recognition", AGH University of Science and Technology Krakow, Poland, July 2011.
Sanja Grubesa, Tomislav Grubesa, Hrvoje Domitrovic, "Speaker Recognition Method combining FFT, Wavelet Functions and Neural Networks", Faculty of Electrical Engineering and Computing, University of Zagreb, Croatia.
Mohamed Cherif Amara Korba, Djemil Messadeg, Rafik Djemili, Hocine Bourouba, "Robust Speech Recognition Using Perceptual Wavelet Denoising and Mel-frequency Product Spectrum Cepstral Coefficient Features", Informatica 32, pp. 283-288, 2008.
Aniruddha Adiga, Mathew Magimai, Chandra Sekhar Seelamantula,"Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition".
Sonia Sunny, David Peter S, K Poulose Jacob, "Development of a Speech Recognition System for Speaker Independent Isolated Malayalam Words", International Journal of Computer Science & Engineering Technology, vol. 3, no. 4, pp. 69-75, April 2012.
Sonia Sunny, David Peter S, K Poulose Jacob, "Recognition of Speech Signals: An Experimental Comparison of Linear Predictive Coding and Discrete Wavelet Transforms", International Journal of Engineering Science and Technology, vol. 4, no. 4, pp. 1594-1601, April 2012.
Mohammed Anwer and Rezwan-Al-Islam Khan, "Voice identification Using a Composite Haar Wavelets and Proper Orthogonal Decomposition", International Journal of Innovation and Applied Studies, vol. 4, no. 2, pp. 353-358, October 2013.
Tariq Abu Hilal , Hasan Abu Hilal, RiyadQqQ El Shalabi and Khalid Daqrouq, "Speaker Verification System Using Discrete Wavelet Transform And Formants Extraction Based On The Correlation Coefficient", International Multi Conference of Engineers and Computer Scientists,vol. 2, March 2011.
Marco Jeub, Dorothea Kolossa, Ramon F. Astudillo, Reinhold Orglmeister, "Performance Analysis of Wavelet-based Voice Activity Detection", NAG/DAGA-Rotterdam, 2009.
Beng T Tan, Robert lang, Hieko Schroder, Andrew Spray, Phillip Dermody, "Applying Wavelet Analysis to Speech Segmentation and Classification", Department of Computer Science.
Bartosz Zioko, Suresh Manandhar, Richard C. Wilson and Mariusz Zioko, "Wavelet Method of Speech Segmentation", University of York Heslington, YO10 5DD, York, UK.
Akhilesh Tiwari, Dr. A. S. Zadgaoankar, "Speech Signal Analysis through Wavelets and Finding Similar Patterns in Signals of Regional Dialects of Large Demographic Region", International Journal of Advanced Research in Computer Science and Software Engineering, vol. 3,pp. 420-423, July 2013.
Mohamed El-wakdy, Ehab El-sehely, Mostafa El-tokhy, Adel El-hennawy I. M. , "Speech Recognition using a Wavelet Transform to establish Fuzzy Inference System through Subtractive Clustering and Neural Network (ANFIS)", 12th WSEAS International Conference on SYSTEMS, Heraklion, Greece, pp. 381-386, July 2008.

Index Terms

Computer Science

Information Sciences

Keywords

Feature Extraction Fast Fourier Transform Mel Frequency Cepstral Coefficients Linear Predictive Coding Discrete Wavelet Transforms Wavelet Packet Transform Hybrid Algorithm DWPD.