Call for Paper - November 2023 Edition
IJCA solicits original research papers for the November 2023 Edition. Last date of manuscript submission is October 20, 2023. Read More

Cepstrum Based Voice Transformation Using ANN

IJCA Proceedings on International Conference in Computational Intelligence (ICCIA2012)
© 2012 by IJCA Journal
iccia - Number 2
Year of Publication: 2012
Suparva Patnaik
Mukesh Zaveri

J.H.Nirmal, Suparva Patnaik and Mukesh Zaveri. Article: Cepstrum Based Voice Transformation using ANN. IJCA Proceedings on International Conference in Computational Intelligence (ICCIA 2012) ICCIA(2):13-16, March 2012. Full text available. BibTeX

	author = {J.H.Nirmal and Suparva Patnaik and Mukesh Zaveri},
	title = {Article: Cepstrum Based Voice Transformation using ANN},
	journal = {IJCA Proceedings on International Conference in Computational Intelligence (ICCIA 2012)},
	year = {2012},
	volume = {ICCIA},
	number = {2},
	pages = {13-16},
	month = {March},
	note = {Full text available}


The basic goal of the voice conversion system to mimics the characteristics of the target speaker voice by keeping the linguistic and paralinguistic information intact. The characteristics of a speaker in speech reflect at different level such as vocal tract, excitation and prosodic parameters. This propose work based on cepstrum which represents the vocal tract and excitation parameters of the speech. This paper proposes the decomposition of the cepstrum by wavelet and mapped the source cepstrum features in to target cepstrum features using Radial basis function neural network. The results are evaluated using subjective and objective measures based on voice quality method and the listening tests prove that the proposed algorithm converts speaker individuality while maintaining high speech quality


  • Stylianou Y 2009. "Voice Transformation: A survey." Acoustics, Speech and Signal Processing, IEEE International Conference on 2009. ICASSP 2009
  • A. Kain, " High resolution voice transformation," PhD Thesis, OGI School of Science and Engineering,2001
  • Lehana P.K, Pande P.C (2011).,”Transformation of short term spectral envelope of speech signal using multivariate polynomial modelling”, National conference on communication pp :1-5.
  • H. Kuwabara and Y. Sagisak,1995 "Acoustic characteristics of speaker individuality: Control and conversion, “Speech Communication, vol.16, pp. 165-173, .
  • M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, 1988 "Voice conversion through vector quantization," in Acoustics, Speech, and Signal Processing 88. ,International Conference on, 1988, pp. 655-658
  • H. Valbret, E. Moulines and J. P. Tubach,1992 "Voice transformation using PSOLA technique," Speech Communication, vol. II, pp. 175-187,
  • Shikano, K,Nakamura S,Abe M,” Speaker adaptation and voice conversion by codebook mapping” Circuits and Systems, 1991., IEEE International Sympoisum on,vol 1,pp.594-597.
  • Y. Stylianou, O. Cappe and E. Moulines (1998), Continuous probabilistic transform for voice Conversion," Speech and Audio Processing, IEEE Transactions on, vol. 6, pp. 131-142.
  • Y. Kang, Z. Shuang, J. Tao, W. Zhang, and B. Xu I(2005), " A Hybrid GMM and Codebook Mapping Method for Spectral Conversion, " Affective Computing and Intelligent Interaction, pp. 303-310,
  • Desai, S; Black, A W; Yegnanarayana, B; Prahallad, K.T. 2010 "Spectral mapping using artificial neural networks For voice conversion," IEEE Transactions on Audio, Speech,and Language Processing,vol 18,no.5,pp. 954 -64,
  • K.S.Rao 2010,,”Voice conversion by a mapping the speaker specific features using pitch synchronous approach” Computer speech and language ,vol 24 issue 3 pp 474-494.
  • Alan V Opphenheim-1969,”Speech Analysis and Synthesis System based on Homomorphic filtering”, The Journal of the Acoustical society of America vol 45 No 2.pp 458-465