CFP last date
22 April 2024
Reseach Article

Methodology for Gender Identification, Classification and Recognition of Human Age

Published on December 2015 by Shivaji J Chaudhari, Ramesh M. Kagalkar
National Conference on Advances in Computing
Foundation of Computer Science USA
NCAC2015 - Number 2
December 2015
Authors: Shivaji J Chaudhari, Ramesh M. Kagalkar
a655d505-5eb6-4d51-a7ad-c1c77f876ab5

Shivaji J Chaudhari, Ramesh M. Kagalkar . Methodology for Gender Identification, Classification and Recognition of Human Age. National Conference on Advances in Computing. NCAC2015, 2 (December 2015), 5-10.

@article{
author = { Shivaji J Chaudhari, Ramesh M. Kagalkar },
title = { Methodology for Gender Identification, Classification and Recognition of Human Age },
journal = { National Conference on Advances in Computing },
issue_date = { December 2015 },
volume = { NCAC2015 },
number = { 2 },
month = { December },
year = { 2015 },
issn = 0975-8887,
pages = { 5-10 },
numpages = 6,
url = { /proceedings/ncac2015/number2/23362-5023/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Conference on Advances in Computing
%A Shivaji J Chaudhari
%A Ramesh M. Kagalkar
%T Methodology for Gender Identification, Classification and Recognition of Human Age
%J National Conference on Advances in Computing
%@ 0975-8887
%V NCAC2015
%N 2
%P 5-10
%D 2015
%I International Journal of Computer Applications
Abstract

The human voice is comprised of sound made by a human being using the vocal cord for talking,singing, laughing, crying and shouting. It is particularly a piece of human sound creation inwhich the vocal cord is the essential sound source, which play an important role in the conversation. The applications of speech or voice processing technology play a crucial role in humancomputer interaction. The system improves gender identification, age group classification, ageand emotion recognition performance. The research work uses new and efficient methods forfeature extraction of speech or voice and classification of standard method on the various audiodatasets. Mel Frequency Cepstral Coefficients feature extraction and selection is performed tofind a more suitable feature set for building speaker models. The proposed system uses GaussianMixture Model is a supervector for system feature selection and feature modelling. SupportVector Machine classification and feature matching technique is used to classify the featurefor different age groups like child, teenage, young, adult and senior to increase the resultantperformance and accuracy. The database is created using the audio files for each age group ofspeaker and for each emotion as an input, performs feature extraction and identifies the gender,classify age group, recognize age and emotion.

References
  1. Gil Dobry, Ron M. Hecht, Mireille Avigal and Yaniv Z, SEPTEMBER, 2011. Supervector Dimension Reduction for Efficient Speaker Age Estimation Based on the Acoustic Speech Signal,IEEE transaction V. 19, NO. 7.
  2. Hugo Meinedo1 and Isabel Trancoso, 2008Age and Gender Classification using Fusion of Acoustic and Prosodic Features,Spoken Language Systems Lab, INESC-ID Lisboa, Portugal, Instituto Superior Tecnico, Lisboa, Portugal.
  3. Ismail Mohd Adnan Shahin, 2013Gender-dependent emotion recognition based on HMMs and SPHMMs,Int J Speech Technol, Springer 16:133141.
  4. Mohamad Hasan Bahari and Hugo Van h, ITN2008 Speaker Age Estimation and Gender Detection Based on Supervised NonNegative Matrix Factorization, Centre for Processing Speech and Images Belgium.
  5. Shivaji J Chaudhari and Ramesh M Kagalkar, May 2015 Automatic Speaker Age Estimation and Gender Dependent Emotion Recognition, International Journal of Computer Applications(IJCA) (0975 - 8887),Volume 117 No. 17.
  6. Shivaji J. Chaudhari and Ramesh M. Kagalkar, July 2015 A Methodology for Efficient Gender Dependent Speaker Age and Emotion Identification System,International Journal of Advanced Research in Computer and Communication Engineering(IJARCCE) ISSN 2319- 5940,Volume 4, Issue 7.
  7. Chul Min Lee and Shrikanth S. Narayanan, 2005 Toward Detecting Emotions in Spoken Dialogs, IEEE transaction 1063-6676.
  8. Tetsuya Takiguchi and Yasuo Ariki, 2006 Robust feature extraction using kernel PCA,Department of Computer and System Engg Kobe University, Japan, ICASSP 1-4244-0469.
  9. Michael Feld, Felix Burkhardt and Christian Muller, 2010 Automatic Speaker Age and Gender Recognition in the Car for Tailoring Dialog and Mobile Services,German Research Center for Artificial Intelligence, INTERSPEECH.
  10. M A. Hossan, Sheeraz Memon and Mark A Gregory, A Novel Approach for MFCC Feature extraction, RMIT university, Melbourne, Australia, IEEE, 2010.
  11. Ruben Solera-Ure, 2008 Real-time Robust Automatic Speech Recognition Using Compact Support Vector Machines,TEC 2008-06382 and TEC 2008-02473.
  12. Wei HAN and Cheong fat CHAN, 2006 An Efficient MFCC Extraction Method in Speech Recognition,Department of Electronic Engineering, The Chinese University of Hong Kong Hong Kong, 7803-9390-06/IEEE ISCAS.
  13. AU Khan and L. P. Bhaiya, 2008 Text Dependent Method for Person Identification through Voice Segment,ISSN- 2277-1956 IJECSE.
  14. Felix Burkhardt, Martin Eckert, Wiebke Johannsen and Joachim Stegmann, 2010A Database of Age and Gender Annotated Telephone Speech, Deutsche Telekom AG Laboratories, Ernst-Reuter-Platz 7, 10587 Berlin, Germany.
  15. Lingli Yu and Kaijun Zhou, March 2014, A Comparative Study on Support Vector Machines classifiers for Emotional Speech Recognition, Immune Computation (IC) Volume2, Number:1.
  16. Rui Martins, Isabel Trancoso, Alberto Abad and Hugo Meinedo, 2009, Detection of Childrens Voices, Intituto Superior Tecnico, Lisboa, Portugal INESC-ID Lisboa, Portugal.
  17. Chao Gao, Guruprasad Saikumar, Amit Srivastava and Premkumar Natarajan, 2011, Open set Speaker Identification in Broadcast News, IEEE 978-1-4577-0539.
  18. Shivaji J Chaudhari and RameshMKagalkar, 2014, A Review of Automatic Speaker Age Classification, Recognition and Identifying Speaker Emotion Using Voice Signal, International Journal of Science and Research (IJSR 2014), ISSN(Online): 2319-7064,Volume 3.
  19. M Ferras, C CLeung, C Barras and Jean Luc Gauvain, 2010, Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition,IEEE Transaction 1558-7916.
  20. Chao Gao, Guruprasad Saikumar, Amit Srivastava and Premkumar Natarajan, 2011, Open-SetSpeaker Identification in Broadcast News, IEEE 978-1-4577-0539.
  21. ChaoWang, Ruifei Zhu, Hongguang Jia, QunWei, Huhai Jiang, Tianyi Zhang and LinyaoYu, 2013, Design of Speech Recognition System, IEEE 978-1-4673-2764-0/13.
  22. Manan Vyas, 2013"Gaussian Mixture Model Based Speech Recognition System Using Matlab",Signal and Image Proc An International Journal (SIPIJ) Vol. 4, No. 4.
Index Terms

Computer Science
Information Sciences

Keywords

Mel Frequency Cepstral Coefficient (mfcc) Gaussian Mixture Model (gmm) support Vector Machine (svm) Expectation-maximization (em) Maximum A Posteriori (map) Hidden Markov Models (hmms) Suprasegmental Hidden Markov Models (sphmms) Interactive Voice Response System (ivrs).