CFP last date
20 May 2024
Reseach Article

Duration Modeling in Hindi

by Somnath Roy, Nishant Sinha
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 97 - Number 6
Year of Publication: 2014
Authors: Somnath Roy, Nishant Sinha
10.5120/17015-7296

Somnath Roy, Nishant Sinha . Duration Modeling in Hindi. International Journal of Computer Applications. 97, 6 ( July 2014), 42-46. DOI=10.5120/17015-7296

@article{ 10.5120/17015-7296,
author = { Somnath Roy, Nishant Sinha },
title = { Duration Modeling in Hindi },
journal = { International Journal of Computer Applications },
issue_date = { July 2014 },
volume = { 97 },
number = { 6 },
month = { July },
year = { 2014 },
issn = { 0975-8887 },
pages = { 42-46 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume97/number6/17015-7296/ },
doi = { 10.5120/17015-7296 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:23:26.521712+05:30
%A Somnath Roy
%A Nishant Sinha
%T Duration Modeling in Hindi
%J International Journal of Computer Applications
%@ 0975-8887
%V 97
%N 6
%P 42-46
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Duration is one of the important cues for finding the prosodic variation in human speech and other cues are pitch(F0), amplitude(Intensity) and pauses. Two important concept of duration modeling is in play nowadays, 1-segmental duration modeling, 2- syllable duration modeling. Since speech is a complex continuous signal, hence finding the boundary of segments and syllables is a manual intensive work and prone to error. As per our observation, it is relatively more error prone to find the segment boundary than the syllable boundary. This paper presents the effect of duration both at the level of syllable and segment and its distinctive role in finding the prosodic variation at the sentence level. The findings could be adopted as a model for implementing the prosodic variation in text to speech synthesis(TTS) and automatic speech recognition(ASR). The approach for finding the the model is purely based on the statistical inference derived from the duration values with respect to other cues extracted from the recorded speech data. .

References
  1. Roy, Somnath (2014). Conference Proceedings of Computational Science and Computational Intelligence, Las Vegas, USA, 10 March-13 March 2014, IEEE. .
  2. Dennis H Klaat, (1979). Synthesis by rule of segmental durations in English sentences, In B. Lindblom and S. Ohman, Editors, Frontiers of Speech Communication Research, pp 287-300, American Press, New York.
  3. Anderson, M. , Pierrehumbert, J. & Liberman, M. Y. (1984). Synthesis by rule of English intonation patterns. IEEE Congress on Acoustics, Speech, and Signal Processing: pp 77-80.
  4. Ashwin Bellur, K Badri Narayan, Raghava Krishnan K, Hema A Murthy. Prosody Modeling for Syllable-Based Concatenative Speech Synthesis of Hindi and Tamil. IEEE 2011.
  5. Jonathan Allen, M. S. Hunnicut, D. H. Klatt (1987). From Text to Speech: The MIT Talk System. Cambridge University Press,Cambridge.
  6. Bagshaw Christopher Paul. (1994). Automatic prosodic analysis for computer aided pronunciation teaching, Ph. d thesis.
  7. Black & Kominek. (2009). Optimizing Segment Label Boundaries for Statistical Speech Synthesis. IEEE: pp. 3785-
  8. Black W Alan, Hunt J Andrew. (1996). Unit Selection Synthesis in a Concatenative Speech Synthesis Using a Large Speech Database. IEEE: pp. 373-376.
  9. Roy Somnath (2014), A Technical Guide to Concatenative Speech Synthesis for Hindi using Festival. International Journal of Computer Applications, Vol. 86, pp-30-34.
  10. Pandey Pramod, Roy Somnath, D. Kumar , M. Mahesh. Inconsistencies in the Pronunciation of Hindi for a Pronunciation Lexicon, unpublished.
  11. Chomsky & Halle (1968). The sound pattern of English. New York: Harper & Row Publishers.
  12. Cutler, A. , Dahan D. & Donselaar. (1997),Prosody in the comprehension of spoken language: A literature review, Language and Speech,141-201.
  13. Singh Rajendra, Agnihotri R. K (1997). Hindi Morphology: A Word based Description. Motilal Banarsidas Publishers, New Delhi.
  14. Kachru Yamuna (1987). Hindi . John Benjamin Publishing Company. London. Vol-12. ISBN: 1382-3485.
  15. Roy Somnath(2013). Statistical Approach to Prosodic Modeling in Speech Synthesis, Phd. Synopsis. Jawaharlal Nehru University, New Delhi. Unpublished.
  16. Pandey Pramod (2007). Orthography-Phonology Interface in Devnagri for Hindi. Written Language & Processing. ISSN:1387-6732. 227-236, 1989.
Index Terms

Computer Science
Information Sciences

Keywords

Duration Modeling Prosody Hindi Hindi speech synthesis