CFP last date
22 April 2024
Reseach Article

Visual Speech Analysis, Application to Arabic Phonemes

Published on September 2012 by Fatma Zohra Chelali, Khadidja Sadeddine, Amar Djeradi
Software Engineering, Databases and Expert Systems
Foundation of Computer Science USA
SEDEX - Number 2
September 2012
Authors: Fatma Zohra Chelali, Khadidja Sadeddine, Amar Djeradi
4814582c-1b19-4f53-87be-9f90c08e99b5

Fatma Zohra Chelali, Khadidja Sadeddine, Amar Djeradi . Visual Speech Analysis, Application to Arabic Phonemes. Software Engineering, Databases and Expert Systems. SEDEX, 2 (September 2012), 29-34.

@article{
author = { Fatma Zohra Chelali, Khadidja Sadeddine, Amar Djeradi },
title = { Visual Speech Analysis, Application to Arabic Phonemes },
journal = { Software Engineering, Databases and Expert Systems },
issue_date = { September 2012 },
volume = { SEDEX },
number = { 2 },
month = { September },
year = { 2012 },
issn = 0975-8887,
pages = { 29-34 },
numpages = 6,
url = { /specialissues/sedex/number2/8364-1015/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Special Issue Article
%1 Software Engineering, Databases and Expert Systems
%A Fatma Zohra Chelali
%A Khadidja Sadeddine
%A Amar Djeradi
%T Visual Speech Analysis, Application to Arabic Phonemes
%J Software Engineering, Databases and Expert Systems
%@ 0975-8887
%V SEDEX
%N 2
%P 29-34
%D 2012
%I International Journal of Computer Applications
Abstract

The aim of this work is to introduce a primary research on Arabic audiovisual analysis. Each language has multiple phonemes and visemes and each viseme can have multiple phonemes. The first part focuses on how to classify Arabic visemes from still images, whereas the second part shows the variation of Pitch for each viseme. We haven't taken co-articulation of visemes in context. To evaluate the performance of the proposed method, we collected a large number of speech visual signal of ten Algerian speakers male and female at different moments pronouncing 28 Arabic syllabuses. In our work, we demonstrate 11 final visemes representing the 28 consonantal Arabic phonemes.

References
  1. Waters,keith and Levergood, Thomas M, DECface: An automatic lip-synchronization algorithm for synthetic faces, In Technical report series, CRL 93/4, Digital Equipment Corporation, Cambridge Research Lab, September 23, 1993 .
  2. Breen, Dr. A. P, Bowers Ms. E. and Welsh Dr. W, An Investigation into the generation of mouth shapes for a talking head.
  3. Möttönen, Riikka Olivés Jean-Luc, Kulja Janne and Sams, Mikko, Parameterized visual speech synthesis and its evaluation.
  4. Tiddeman, Bernard and Perret, David, Prototyping and transforming visemes for animated speech,
  5. Abdul Rafay Abbasi and Naveed Ahmad, Urdu Viseme Identification", pp68-71.
  6. Tony Ezzat and Tomaso Poggio, Visual Speech Synthesis by Morphing Visemes, In A. I. Memo No. 1658 C. B. C, L, Paper No. 173, Artificial Intelligence ,Laboratory, M. I. T, May 1999.
  7. Michael M. Cohen and Dominic W Massaro, Modeling Coarticulation in synthetic visual speech.
  8. Riikka Möttönen, Jean-Luc Olivés, Janne Kulju, and Mikko Sams, Parameterized visual speech synthesis and its evaluation, Eusipco 2000 X, European Signal Processing Conference,September 4-8, 2000Tampere, Finland.
  9. Hassan Satori, Hussein Hiyassat, Mostafa Harti, and Noureddine Chenfour, Investigation Arabic Speech Recognition Using CMU Sphinx System",The International Arab Journal of Information Technology, Vol. 6, No. 2, April 2009.
  10. Rabiner, L. and B. Juang , Fundamentals of Speech Recognition, Englewood Cliffs, N. J. : Prentice Hall,1993.
  11. WANG Anhong, BAO Huaiqiao, CHEN, Primary research on the viseme system in Standard Chinese,
  12. C. Benoît, T. Lallouache, T. Mohamadi and C. Abry, A set of French visemes for visual speech synthesis, In Talking Machines: Theories Models and Designs, G Bailly and C. Benoît, Editors. Elsevier B. V. p. 485-501.
  13. L. Revéret, Conception et évaluation d'un système de suivi automatique des gestes labiaux en parole, docteur de l'institut national polytechnique de Grenoble,thése préparée au sein de l'institut de la communication parlée.
  14. Salah Werda, Walid Mahdi and Abdelmajid Ben Hamadou, Lip Localization and Viseme Classification for Visual Speech Recognition, International Journal of Computing & Information Sciences Vol. 5, No. 1, April 2007.
  15. Gerasimos Potamianos, Chalapathy Neti, "Audio-Visual Automatic Speech Recognition": An Overview, Chapter to appear in: Issues in Visual and Audio-Visual Speech Processing, G. Bailly, E. Vatikiotis-Bateson, and P. Perrier, Eds. , MIT Press, 2004.
  16. Fatma zohra CHELALI , Amar DJERADI, " Primary research on Arabic visemes, Analysis in space and frequency domain", International Journal of Mobile Computing and Multimedia Communication (IJMCMC), published by IGI Global, USA,pp 1-19, DOI: 10. 4018/IJMCMC, ISSN: 1937-9412, EISSN: 1937-9404, vol. 3 , N°4,2011.
  17. Sergios Theodoridis and Konstantinos koutroumbas. (2003). book pattern recognition, second edition, Elsevier (USA).
  18. Herve Abdi,"Neural networks",Program in Cognition and neurosciences,MS:Gr. 4. 1,The university of Texas at Dallas.
  19. McGurck et J. Mcdonald. "Hearing lips and seeing voice". Nature, 264 : 746-748, Decb 1976.
  20. Naotoshi Seo sonots. (2008). Pitch Detection. (report ENEE632 Project4 Part I). March 24, 2008.
Index Terms

Computer Science
Information Sciences

Keywords

Arabic Visemes Speech Recognition Audiovisual Analysis Pitch