Call for Paper - August 2022 Edition
IJCA solicits original research papers for the August 2022 Edition. Last date of manuscript submission is July 20, 2022. Read More

Visual Speech Analysis, Application to Arabic Phonemes

IJCA Special Issue on Software Engineering, Databases and Expert Systems
© 2012 by IJCA Journal
SEDEX - Number 2
Year of Publication: 2012
Fatma Zohra Chelali
Khadidja Sadeddine
Amar Djeradi

Fatma Zohra Chelali, Khadidja Sadeddine and Amar Djeradi. Article: Visual Speech Analysis, Application to Arabic Phonemes. IJCA Special Issue on Software Engineering, Databases and Expert Systems SEDEX(2):29-34, September 2012. Full text available. BibTeX

	author = {Fatma Zohra Chelali and Khadidja Sadeddine and Amar Djeradi},
	title = {Article: Visual Speech Analysis, Application to Arabic Phonemes},
	journal = {IJCA Special Issue on Software Engineering, Databases and Expert Systems},
	year = {2012},
	volume = {SEDEX},
	number = {2},
	pages = {29-34},
	month = {September},
	note = {Full text available}


The aim of this work is to introduce a primary research on Arabic audiovisual analysis. Each language has multiple phonemes and visemes and each viseme can have multiple phonemes. The first part focuses on how to classify Arabic visemes from still images, whereas the second part shows the variation of Pitch for each viseme. We haven't taken co-articulation of visemes in context. To evaluate the performance of the proposed method, we collected a large number of speech visual signal of ten Algerian speakers male and female at different moments pronouncing 28 Arabic syllabuses. In our work, we demonstrate 11 final visemes representing the 28 consonantal Arabic phonemes.


  • Waters,keith and Levergood, Thomas M, DECface: An automatic lip-synchronization algorithm for synthetic faces, In Technical report series, CRL 93/4, Digital Equipment Corporation, Cambridge Research Lab, September 23, 1993 .
  • Breen, Dr. A. P, Bowers Ms. E. and Welsh Dr. W, An Investigation into the generation of mouth shapes for a talking head.
  • Möttönen, Riikka Olivés Jean-Luc, Kulja Janne and Sams, Mikko, Parameterized visual speech synthesis and its evaluation.
  • Tiddeman, Bernard and Perret, David, Prototyping and transforming visemes for animated speech,
  • Abdul Rafay Abbasi and Naveed Ahmad, Urdu Viseme Identification", pp68-71.
  • Tony Ezzat and Tomaso Poggio, Visual Speech Synthesis by Morphing Visemes, In A. I. Memo No. 1658 C. B. C, L, Paper No. 173, Artificial Intelligence ,Laboratory, M. I. T, May 1999.
  • Michael M. Cohen and Dominic W Massaro, Modeling Coarticulation in synthetic visual speech.
  • Riikka Möttönen, Jean-Luc Olivés, Janne Kulju, and Mikko Sams, Parameterized visual speech synthesis and its evaluation, Eusipco 2000 X, European Signal Processing Conference,September 4-8, 2000Tampere, Finland.
  • Hassan Satori, Hussein Hiyassat, Mostafa Harti, and Noureddine Chenfour, Investigation Arabic Speech Recognition Using CMU Sphinx System",The International Arab Journal of Information Technology, Vol. 6, No. 2, April 2009.
  • Rabiner, L. and B. Juang , Fundamentals of Speech Recognition, Englewood Cliffs, N. J. : Prentice Hall,1993.
  • WANG Anhong, BAO Huaiqiao, CHEN, Primary research on the viseme system in Standard Chinese,
  • C. Benoît, T. Lallouache, T. Mohamadi and C. Abry, A set of French visemes for visual speech synthesis, In Talking Machines: Theories Models and Designs, G Bailly and C. Benoît, Editors. Elsevier B. V. p. 485-501.
  • L. Revéret, Conception et évaluation d'un système de suivi automatique des gestes labiaux en parole, docteur de l'institut national polytechnique de Grenoble,thése préparée au sein de l'institut de la communication parlée.
  • Salah Werda, Walid Mahdi and Abdelmajid Ben Hamadou, Lip Localization and Viseme Classification for Visual Speech Recognition, International Journal of Computing & Information Sciences Vol. 5, No. 1, April 2007.
  • Gerasimos Potamianos, Chalapathy Neti, "Audio-Visual Automatic Speech Recognition": An Overview, Chapter to appear in: Issues in Visual and Audio-Visual Speech Processing, G. Bailly, E. Vatikiotis-Bateson, and P. Perrier, Eds. , MIT Press, 2004.
  • Fatma zohra CHELALI , Amar DJERADI, " Primary research on Arabic visemes, Analysis in space and frequency domain", International Journal of Mobile Computing and Multimedia Communication (IJMCMC), published by IGI Global, USA,pp 1-19, DOI: 10. 4018/IJMCMC, ISSN: 1937-9412, EISSN: 1937-9404, vol. 3 , N°4,2011.
  • Sergios Theodoridis and Konstantinos koutroumbas. (2003). book pattern recognition, second edition, Elsevier (USA).
  • Herve Abdi,"Neural networks",Program in Cognition and neurosciences,MS:Gr. 4. 1,The university of Texas at Dallas.
  • McGurck et J. Mcdonald. "Hearing lips and seeing voice". Nature, 264 : 746-748, Decb 1976.
  • Naotoshi Seo sonots. (2008). Pitch Detection. (report ENEE632 Project4 Part I). March 24, 2008.