Analysis of a Modern Voice Morphing Approach using Gaussian Mixture Models for Laryngectomees

Aman Chadha; Bharatraaj Savardekar; Jay Padhya

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

FORENSIC ANALYSIS FRAMEWORKS FOR ENCRYPTED CLOUD STORAGE INVESTIGATIONS

Joy Awoleye Sarah Mavire Allan Munyira Kelvin Magora

Random Articles

Design of Instruction Service Quality System in Accordance with the Information and Communication Technology Frameworks

March

2016

Novel Notch Detection Algorithm for Detection of Dicrotic Notch in PPG Signals

January

2014

Design and Simulation of OTA using DTMOS Technique in 180 nm CMOS Process

April

2016

A Survey on FM-UWB Transceivers

January

2013

Reseach Article

Analysis of a Modern Voice Morphing Approach using Gaussian Mixture Models for Laryngectomees

by Aman Chadha, Bharatraaj Savardekar, Jay Padhya

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 49 - Number 21

Year of Publication: 2012

Authors: Aman Chadha, Bharatraaj Savardekar, Jay Padhya

10.5120/7896-1235

Aman Chadha, Bharatraaj Savardekar, Jay Padhya . Analysis of a Modern Voice Morphing Approach using Gaussian Mixture Models for Laryngectomees. International Journal of Computer Applications. 49, 21 ( July 2012), 25-30. DOI=10.5120/7896-1235

@article{ 10.5120/7896-1235,

author = { Aman Chadha, Bharatraaj Savardekar, Jay Padhya },

title = { Analysis of a Modern Voice Morphing Approach using Gaussian Mixture Models for Laryngectomees },

journal = { International Journal of Computer Applications },

issue_date = { July 2012 },

volume = { 49 },

number = { 21 },

month = { July },

year = { 2012 },

issn = { 0975-8887 },

pages = { 25-30 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume49/number21/7896-1235/ },

doi = { 10.5120/7896-1235 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:47:01.205356+05:30

%A Aman Chadha

%A Bharatraaj Savardekar

%A Jay Padhya

%T Analysis of a Modern Voice Morphing Approach using Gaussian Mixture Models for Laryngectomees

%J International Journal of Computer Applications

%@ 0975-8887

%V 49

%N 21

%P 25-30

%D 2012

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper proposes a voice morphing system for people suffering from Laryngectomy, which is the surgical removal of all or part of the larynx or the voice box, particularly performed in cases of laryngeal cancer. A primitive method of achieving voice morphing is by extracting the source's vocal coefficients and then converting them into the target speaker's vocal parameters. In this paper, we deploy Gaussian Mixture Models (GMM) for mapping the coefficients from source to destination. However, the use of the traditional/conventional GMM-based mapping approach results in the problem of over-smoothening of the converted voice. Thus, we hereby propose a unique method to perform efficient voice morphing and conversion based on GMM, which overcomes the traditional-method effects of over-smoothening. It uses a technique of glottal waveform separation and prediction of excitations and hence the result shows that not only over-smoothening is eliminated but also the transformed vocal tract parameters match with the target. Moreover, the synthesized speech thus obtained is found to be of a sufficiently high quality. Thus, voice morphing based on a unique GMM approach has been proposed and also critically evaluated based on various subjective and objective evaluation parameters. Further, an application of voice morphing for Laryngectomees which deploys this unique approach has been recommended by this paper.

References

Abe M. , Nakamura S. , Shikano K. and Kuwabara H. , " Voice conversion through vector quantization" International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1988, 655-658.
Baudoin G. and Stylianou Y. , " On the transformation of the speech spectrum for voice conversion" International Conference on Spoken Language (ICSLP), Philadephia, October 1996, Vol. 3, 1405-1408.
Kain A. and Macon M. , "Spectral voice conversion for text to speech synthesis " Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1998, Vol. 1, 285-288.
Stylianou Y. and Cappe O. , "A system for voice conversion based on probabilistic classification and a harmonic plus noise model " International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1998, Seattle, 281-284.
Ye H. and Young S. , "High quality voice morphing ", International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2004, Montreal, Vol. 1, 9-12.
Upperman, G. , "Linear Predictive Coding In Voice Conversion ", December 21, 2004.
Bradbury J. , "Linear Predictive Coding " December 5, 2000.
Gundersen, T. , "Voice Transformation based on Gaussian mixture models, " Master of Science in Communication Technology Thesis, Norwegian University of Science and Technology, Department of Electronics and Telecommunications, 2010, 55.
Cliff M. , "GMM and MINZ Program Libraries for Matlab, " Krannert Graduate School of Management, Purdue University, March 2, 2003.
Scherrer B. , "Gaussian Mixture Model Classifiers, " February 5, 2007.
Resch B. , "Mixtures of Gaussians-A Tutorial for the Course Computational Intelligence," Signal Processing and Speech Communication Laboratory Inffeldgasse 16c, http://www. igi. tugraz. at/lehre/CI, June 2012.
XuN. and Yang Z. , "A Precise Estimation of Vocal Tract Parameters for High Quality Voice Morphing", 9th International Conference on Signal Processing (ICSP), October 2008, 684-687.
Nakamura K. , Toda T. , Nakajima Y. , Saruwatari H. and Shikano K. , "Evaluation of Speaking-Aid System with Voice Conversion for Laryngectomees Toward Its Use in Practical Environments," Interspeech (ISCA), 2008, Brisbane.
Huang X. , Acero A.and Hon H. "Spoken Language Processing: A Guide to Theory, Algorithm and System Development", Prentice Hall, 2001.
Reynolds D. , "Gaussian Mixture Models," Encyclopedia of Biometrics, 2009, 659-663.
Mesbahi L. , Barreaud V, and Boeffard O. ,"GMM-Based Speech Transformation Systems under Data Reduction," Sixth ISCA Workshop on Speech Synthesis, 2007, 119-124.
CMU_ARCTIC Speech Synthesis Databases, Carnegie Mellon University, http://festvox.org/cmu_arctic, March 2012.
Russell M. , "Towards Speech Recognition using Palato-Lingual Contact Patterns for Voice Restoration," PhD Thesis, Faculty of Engineering, University of the Witwatersrand, June 2011.
Dae-Hyeong K. et al. , "Epidermal Electronics," Science, Vol. 333, No. 6044, 12 August 2011, 838-843.

Index Terms

Computer Science

Information Sciences

Keywords

Voice Morphing Laryngectomy Gaussian Mixture Models