Music Genre Classification using MFCC, SVM and BPNN

Gursimran Kour; Neha Mehan

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 20 July 2026

Submit your paper

Know more

The week's pick

Quantifying Label-Induced Bias in Large Language Model Self and Cross Evaluations

Muskan Saraf Sajjad Rezvani Boroujeni Justin Beaudry Hossein Abedi Tom Bush

Random Articles

Key-Aggregate Cryptosystem based on Elliptic Curve Cryptography for Data Sharing in Cloud Storage with Result and Analysis

Jun

2018

Real time MATLAB Interface for speed control of Induction motor drive using dsPIC 30F4011

February

2010

A Resource Pool Management Model using Fuzzy Logic Decision Making

September

2011

Kinematic Model of a Four Mecanum Wheeled Mobile Robot

March

2015

Reseach Article

Music Genre Classification using MFCC, SVM and BPNN

by Gursimran Kour, Neha Mehan

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 112 - Number 6

Year of Publication: 2015

Authors: Gursimran Kour, Neha Mehan

10.5120/19669-1119

Gursimran Kour, Neha Mehan . Music Genre Classification using MFCC, SVM and BPNN. International Journal of Computer Applications. 112, 6 ( February 2015), 12-14. DOI=10.5120/19669-1119

@article{ 10.5120/19669-1119,

author = { Gursimran Kour, Neha Mehan },

title = { Music Genre Classification using MFCC, SVM and BPNN },

journal = { International Journal of Computer Applications },

issue_date = { February 2015 },

volume = { 112 },

number = { 6 },

month = { February },

year = { 2015 },

issn = { 0975-8887 },

pages = { 12-14 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume112/number6/19669-1119/ },

doi = { 10.5120/19669-1119 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T22:48:43.386410+05:30

%A Gursimran Kour

%A Neha Mehan

%T Music Genre Classification using MFCC, SVM and BPNN

%J International Journal of Computer Applications

%@ 0975-8887

%V 112

%N 6

%P 12-14

%D 2015

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In the field of musical information retrieval, genre categorization is a complicated mission. MFCC is one of the feature extraction method use in classification of musical genre that is based on short speech signals. Searching and organizing are the main characteristics of the music genre classification system these days. This paper describes a new technique that uses support vector machines to classify songs based on features using MFCC, BPNN and SVM classifier does not classify songs based on the short signals. So these categories a number of acoustic features that include Mel-frequency Cepstral coefficients are extracted to characterize the audio content. Support vector machines and BPNN classifies audio into their respective classes by learning from training data. The simulation is taken place in MATLAB by making experiments on different genres . The results obtained by this proposed technique are promising.

References

Abu-El-Quran, A. R. , Goubran, R. A. , & Chan, A. D. C. (2006). Security monitoring using microphone arrays and audio classification. IEEE Transactions on Instrumentation and Measurement, 55(4), 1025–1032.
Ajmera, J. , McCowan, I. , & Bourlard, H. (2003). Speech/music segmentation using entropy and dynamism features in a HMM classification framework. Speech Communication, 40(3), 351–363.
Duda, R. O. , Hart, P. E. , & Stork, D. G. (2001). Pattern classification. New York: John Wiley-Interscience. Eronen, A. J. , Peltonen, V. T. , Tuomi, J. T. , Klapuri, A. P. , Fagerlund, S.
Sorsa, T. , et al. (2006). Audio-based context recognition. IEEE Transactions on Audio, Speech and Language Processing, 14(1), 321–329.
Esmaili, S. , Krishnan, S. , & Raahemifar, K. (2004). Content based audio classification and retrieval using joint time–frequency analysis. In IEEE international conference on acoustics, speech and signal processing (pp. 665–668).
Guo, G. , & Li, S. Z. (2003). Content-based audio classification and retrieval by support vector machines. IEEE Transactions on Neural Networks, 14(1), 308–315.
Haykin, S. (2001). Neural networks a comprehensive foundation. Asia: Pearson Education.
Huang, R. , & Hansen, J. H. L. (2006). Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora. IEEE Transactions on Audio, Speech and Language Processing, 14(3), 907–919.
Jiang, H. , Bai, J. , Zhang, S. , & Xu, B. (2005). SVM-based audio scene classification. Proceeding of the IEEE, 131–136.
Kiranyaz, S. , Qureshi, A. F. , & Gabbouj, M. (2006). A generic audio classification and segmentation approach for multimedia indexing and retrieval. IEEE Transactions on Audio, Speech and Language Processing, 14(3), 1062–1081.
Li, S. Z. (2000). Content-based audio classification and retrieval using the nearest feature line method. IEEE Transactions on Speech and Audio Processing, 8(5), 619–625.
Lin, C. -C. , Chen, S. -H. , Truong, T. -K. , & Chang, Y. (2005). Audio classification and categorization based on wavelets and support vector machine. IEEE Transactions on Speech and Audio Processing, 13(5), 644–651.
Li, D. , Sethi, I. K. , Dimitrova, N. , & McGee, T. (2001). Classification of general audio data for content-based retrieval. Pattern Recognition Letters, 22, 533–544.
Lu, L. , Zhang, H. -J. , & Li, S. Z. (2003). Content-based audio classification and segmentation by using support vector machines. Multimedia Systems, 8, 482–492.
McConaghy, T. , Leung, H. , Boss, E. , & Varadan, V. (2003). Classification of audio radar signals using radial basis function neural networks. IEEE Transactions on Instrumentation and Measurement, 52(6), 1771–1779.
Mubarak, O. M. , Ambikairajah, E. , & Epps, J. (2005). Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources. In IEEE international conference on acoustics, speech and signal processing (pp. 619–622).
Palanivel, S. (2004). Person authentication using speech, face and visual speech. Ph. D thesis, Madras: IIT.
Panagiotakis, C. , & Tziritas, G. (2005). A speech/music discriminator based on rms and zero-crossings. IEEE Transactions on Multimedia, 7(1), 155–156.
Rabiner, L. , & Juang, B. (2003). Fundamentals of speech recognition. Singapore: Pearson Education.

Index Terms

Computer Science

Information Sciences

Keywords

SVM MFCC BPNN training classification feature extraction.