Designing and Recording Emotional Speech Databases

Call for Paper

February Edition

IJCA solicits high quality original research papers for the upcoming February edition of the journal. The last date of research paper submission is 20 January 2026

Submit your paper

Know more

The week's pick

Enhancing Precision in Spice Bag Dispensing for Noodle Cup Production through Automated Fuzzy Inference System Integration

Ayman M. Mansour Yazan A. Yousef Mohammad A. Obeidat Hesham I. Al-salem

Random Articles

Reseach Article

Designing and Recording Emotional Speech Databases

Published on March 2012 by Swati D. Bhutekar, M. B. Chandak

2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)

Foundation of Computer Science USA

NCIPET - Number 14

March 2012

Authors: Swati D. Bhutekar, M. B. Chandak

Swati D. Bhutekar, M. B. Chandak . Designing and Recording Emotional Speech Databases. 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013). NCIPET, 14 (March 2012), 4-10.

@article{

author = { Swati D. Bhutekar, M. B. Chandak },

title = { Designing and Recording Emotional Speech Databases },

journal = { 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013) },

issue_date = { March 2012 },

volume = { NCIPET },

number = { 14 },

month = { March },

year = { 2012 },

issn = 0975-8887,

pages = { 4-10 },

numpages = 7,

url = { /proceedings/ncipet/number14/5294-1106/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Proceeding Article

%1 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)

%A Swati D. Bhutekar

%A M. B. Chandak

%T Designing and Recording Emotional Speech Databases

%J 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)

%@ 0975-8887

%V NCIPET

%N 14

%P 4-10

%D 2012

%I International Journal of Computer Applications

Abstract

This paper describes the factors used in designing and recording large speech databases for applications requiring speech synthesis. Given the growing demand for customized and domain specific voices for use in corpus based synthesis systems, good practices should be established for the creation of these databases which are a key factor in the quality of the resulting speech synthesizer. This paper focuses on the factors affecting to the designing of the recording prompts, on the speaker selection procedure, on the recording setup and on the quality control of the resulting database. One way to find the emotions in the speech is , Once the speech has been recorded from the user it is converted into text, at the same time the stressed word from the speech is recorded & then the frequency for that word is find out for recording the corresponding emotion.

References

Gregor O. Hofer , âEmotional Speech Synthesisâ, Master of Science School of Informatics University of Edinburgh 2004
Ibon Saratxaga, Eva Navas, Inmaculada HernÃ¡ez, Iker Luengo, Aholab - âDesigning and Recording an Emotional Speech Database for Corpus Based Synthesis in Basqueâ, Dept. of Electronics and Telecommunications. Faculty of Engineering. University of the Basque Country.
Inger S. Engberg, Anya V. Hansen, Ove Andersen and Paul Dalsgaard, âDesign, Recording and verification of a Danish Emotional speech Databaseâ
LuÂ´?s C. Oliveira, SÂ´ergio Paulo, LuÂ´?s Figueira, Carlos Mendes, Ana Nunesâ¡, Joaquim Godinhoâ¡ ,âMethodologies for Designing and Recording Speech Databases for Corpus Based Synthesisâ
Masaki Kurematsu, Jun Hakura and Hamido Fujita, âAn Extraction of Emotion in Human Speech Using Speech Synthesize and Classifiers for Each Emotionâ, in International Journal of Circuits, Systems and Signal Processing
Dimitrios Ververidis and Constantine Kotropoulos, âA Review of Emotional Speech Databasesâ, in Proc. 9th Panhellinic Conference on Informatics (PCI) , pp-560-574,Thessaloniki, Greece, November 2003
Voice", Irvine, âModels of Speech Synthesisâ, draft version of a paper presented at the "Colloquium on Human-Machine Communication California, February 8-9, 1993, organized by the National Academy of Sciences, USA.
âFeatures and Algorithms for the Recognition of Emotions in Speechâ, in Proceedings of the 1st International Conference on Speech Prosody (2002)
C. Lee and S. Narayanan, "Toward detecting emotions in spoken dialogs," IEEE transaction on speech and audio processing, vol.13, 2005.
B. Kort, R. Reilly, and R. W. Picard, "An Affective Model of Interplay Between Emotions and Learning: Reengineering Educational Pedagogy-Building a Learning Companion.," presented at In Proceedings of International Conference on Advanced Learning Technologies (ICALT 2001), Madison, Wisconsin, August 2001.
Slobodan T. Jovi?i?, Zorka KaÅ¡i?, Miodrag ?or?evi?, Mirjana Rajkovi?, â Serbian emotional speech database: design, processing and evaluationâ, presented at SPECOMâ2004: 9th Conference Speech and Computer St.Petersburg, Russia September20-22,2004.

Index Terms

Computer Science

Information Sciences

Keywords

Extraction of Emotion in Speech Database Recording