Call for Paper - July 2022 Edition
IJCA solicits original research papers for the July 2022 Edition. Last date of manuscript submission is June 20, 2022. Read More

Neutral Speech to Target Speech Conversion by Prosodic Modification

Print
PDF
IJCA Proceedings on National Conference on Electronics, Signals and Communication
© 2018 by IJCA Journal
NCESC 2017 - Number 1
Year of Publication: 2018
Authors:
Shreegowri A J
D. J. Ravi

Shreegowri A J and D J Ravi. Article: Neutral Speech to Target Speech Conversion by Prosodic Modification. IJCA Proceedings on National Conference on Electronics, Signals and Communication NCESC 2017(1):4-6, July 2018. Full text available. BibTeX

@article{key:article,
	author = {Shreegowri A J and D. J. Ravi},
	title = {Article: Neutral Speech to Target Speech Conversion by Prosodic Modification},
	journal = {IJCA Proceedings on National Conference on Electronics, Signals and Communication},
	year = {2018},
	volume = {NCESC 2017},
	number = {1},
	pages = {4-6},
	month = {July},
	note = {Full text available}
}

Abstract

The dynamics of prosodic features are utilized for speech emotion conversion. In particular, emotion conversion of neutral speech to sad, fear, anger and happy speech is accomplished. The prosodic features considered for the study are pitch contour and duration. Subjective listening test results show that the effectiveness of perception of emotion is better in the case of pitch contour and duration for the whole utterance. The results show that the converted sad, fear, angry speech are perceived very close to natural sad, fear, anger and happy emotions.

References

  • Amrita, Bageshree Pathak "Emotion Conversion Of Speech Signal Using Neural Network"; IJEEDC-2017
  • "GMM-Based Emotional Voice Conversion Using Spectrum and Prosody Features"; American Journal of Signal Processing;p-ISSN: 2012.
  • Vroomen, J. , Collier, R. , Mozziconacci, S. : Duration and intonation in emotional speech. Eurospeech 1, 577–580 (1993)
  • Tao, J. , Kang, Y. , Li, A. : Prosody conversion from neutral speech to emo- tional speech. IEEE Transactions on Audio, Speech, and Language Processing 14, 1145–1154 (2006)
  • Rao, K. S. , Yegnanarayana, B. : Prosody modi?cation using instants of signi?cant ex- citation. IEEE Transactions on Audio, Speech and Language Processing 14, 972–980 (2006)
  • Paeschke, A. , Sendlmeier, W. F. : Prosodic characteristics of emotional speech: measurements of fundamental frequency movements. In: Speech Emotion, pp. (2000)
  • Koolagudi, S. G. , Maity, S. , Kumar, V. A. , Chakrabarti, S. , Sreenivasa Rao, K. : IITKGP-SESC: Speech database for emotion analysis. In: Ranka, S. , et al. (eds. ) IC3 2009. CCIS, vol. 40, pp. 485–492. Springer, Heidelberg (2009)
  • Yegnanarayana, B. , Murty, K. S. R. : Event-based instantaneous fundamental fre- quency estimation from speech signals. IEEE Transactions on Audio, Speech and Language Process 17(4), 614–625 (2009)