Text Dependent Speaker Identification using Row Mean Vector of Variable Sized Spectrogram

Call for Paper

July Edition

IJCA solicits high quality original research papers for the upcoming July edition of the journal. The last date of research paper submission is 22 June 2026

Submit your paper

Know more

The week's pick

Multi-Band RLS Estimation with Rank Two Updates: Application to Short-Term Temperature Forecast

Alexander Stotsky

Random Articles

FPGA Implementation of Interrupt Controller (8259) by using Verilog HDL

June

2012

Tracking Line Segment without Knowledge of Camera Motion

July

2013

Analysis of Boneh-Shaw Finger Printing Codes under Majority Value Collusion Attacks

Jun

2017

AMHCD: A Database for Amazigh Handwritten Character Recognition Research

August

2011

Reseach Article

Text Dependent Speaker Identification using Row Mean Vector of Variable Sized Spectrogram

Published on None 2011 by Dr. H. B. Kekre, Archana Athawale, Mrunali Desai

International Conference and Workshop on Emerging Trends in Technology

Foundation of Computer Science USA

ICWET - Number 1

None 2011

Authors: Dr. H. B. Kekre, Archana Athawale, Mrunali Desai

Dr. H. B. Kekre, Archana Athawale, Mrunali Desai . Text Dependent Speaker Identification using Row Mean Vector of Variable Sized Spectrogram. International Conference and Workshop on Emerging Trends in Technology. ICWET, 1 (None 2011), 43-47.

@article{

author = { Dr. H. B. Kekre, Archana Athawale, Mrunali Desai },

title = { Text Dependent Speaker Identification using Row Mean Vector of Variable Sized Spectrogram },

journal = { International Conference and Workshop on Emerging Trends in Technology },

issue_date = { None 2011 },

volume = { ICWET },

number = { 1 },

month = { None },

year = { 2011 },

issn = 0975-8887,

pages = { 43-47 },

numpages = 5,

url = { /proceedings/icwet/number1/2065-aca171/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Proceeding Article

%1 International Conference and Workshop on Emerging Trends in Technology

%A Dr. H. B. Kekre

%A Archana Athawale

%A Mrunali Desai

%T Text Dependent Speaker Identification using Row Mean Vector of Variable Sized Spectrogram

%J International Conference and Workshop on Emerging Trends in Technology

%@ 0975-8887

%V ICWET

%N 1

%P 43-47

%D 2011

%I International Journal of Computer Applications

Abstract

In this paper a simple approach to text dependent speaker identification using spectrograms and row mean is presented. This, mainly, revolves around trapping the complex patterns of variation in frequency and amplitude with time while an individual utters a given word through histogram equalized spectrogram. These histogram equalized spectrograms are used as a database to successfully identify the unknown individual from his/her voice. The features used for identifying, rely on optimal spectrogram segmentation and the Euclidean distance of the distributional features of the spectrograms of the unknown voice with that of a given known speaker in the database. Performance of this novel approach on a sample collected as two separate databases from 12 speakers and 28 speakers show that this methodology can be effectively used to produce a desirable success rate.

References

Abdul Manan Ahmad, Loh Mun Yee “Vector Quantization Decision Function for Gaussian Mixture Model Based Speaker Identification”, 2008 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS2008) Swissôtel Le Concorde,Bangkok,Thailand
Ali Zulfiqar, A. Muhammad, A. Enriquez, A.M., “A Speaker Identification System using MFCC Features with VQ Technique”, 2009 Third International Symposium on Intelligent Information Technology Application, Vol 3, pp 115-118.
Bojan Imperl, “Speaker recognition techniques”, Laboratory for Digital Signal Processing, Faculty of Electrical Engineering and Comp. Sci., Smetanova 17, 2000 Maribor, Slovenia.
Dr. H. B. Kekre, S D Thepade, A Athawale, A Shah, P Verlekar, S Shirke, “Image Retrieval using DCT on Row Mean, Column Mean and Both with Image Fragmentation”, International Conference and Workshop on Emerging Trends in Technology (ICWET 2010) – TCET, Mumbai, India, February 26-27, 2010.
Dr. H. B. Kekre, Dr. Tanuja K. Sarode, Shachi J. Natu, Prachi J. Natu, “Speaker Identification Using 2-D DCT, Walsh And Haar On Full And Block Spectrogram”, (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 05, 2010, 1733-1740.
Tridibesh Dutta, “Text dependent speaker identification based on spectrograms”, Proceedings of Image and vision computing, pp. 238-243, New Zealand 2007.
Y. Linde, A. Buzo, R. M. Gray, “An algorithm for Vector Quantizer Design”, IEEE Transaction on Communications, 28: 1980, pp 84-95.

Index Terms

Computer Science

Information Sciences

Keywords

Speaker Identification Speaker Recognition Histogram Spectrograms Row Mean