Visual Speech Analysis, Application to Arabic Phonemes

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

FORENSIC ANALYSIS FRAMEWORKS FOR ENCRYPTED CLOUD STORAGE INVESTIGATIONS

Joy Awoleye Sarah Mavire Allan Munyira Kelvin Magora

Random Articles

Design of Instruction Service Quality System in Accordance with the Information and Communication Technology Frameworks

March

2016

Novel Notch Detection Algorithm for Detection of Dicrotic Notch in PPG Signals

January

2014

Design and Simulation of OTA using DTMOS Technique in 180 nm CMOS Process

April

2016

A Survey on FM-UWB Transceivers

January

2013

Reseach Article

Visual Speech Analysis, Application to Arabic Phonemes

Published on September 2012 by Fatma Zohra Chelali, Khadidja Sadeddine, Amar Djeradi

Software Engineering, Databases and Expert Systems

Foundation of Computer Science USA

SEDEX - Number 2

September 2012

Authors: Fatma Zohra Chelali, Khadidja Sadeddine, Amar Djeradi

4814582c-1b19-4f53-87be-9f90c08e99b5

Fatma Zohra Chelali, Khadidja Sadeddine, Amar Djeradi . Visual Speech Analysis, Application to Arabic Phonemes. Software Engineering, Databases and Expert Systems. SEDEX, 2 (September 2012), 29-34.

@article{

author = { Fatma Zohra Chelali, Khadidja Sadeddine, Amar Djeradi },

title = { Visual Speech Analysis, Application to Arabic Phonemes },

journal = { Software Engineering, Databases and Expert Systems },

issue_date = { September 2012 },

volume = { SEDEX },

number = { 2 },

month = { September },

year = { 2012 },

issn = 0975-8887,

pages = { 29-34 },

numpages = 6,

url = { /specialissues/sedex/number2/8364-1015/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Special Issue Article

%1 Software Engineering, Databases and Expert Systems

%A Fatma Zohra Chelali

%A Khadidja Sadeddine

%A Amar Djeradi

%T Visual Speech Analysis, Application to Arabic Phonemes

%J Software Engineering, Databases and Expert Systems

%@ 0975-8887

%V SEDEX

%N 2

%P 29-34

%D 2012

%I International Journal of Computer Applications

Abstract

The aim of this work is to introduce a primary research on Arabic audiovisual analysis. Each language has multiple phonemes and visemes and each viseme can have multiple phonemes. The first part focuses on how to classify Arabic visemes from still images, whereas the second part shows the variation of Pitch for each viseme. We haven't taken co-articulation of visemes in context. To evaluate the performance of the proposed method, we collected a large number of speech visual signal of ten Algerian speakers male and female at different moments pronouncing 28 Arabic syllabuses. In our work, we demonstrate 11 final visemes representing the 28 consonantal Arabic phonemes.

References

Waters,keith and Levergood, Thomas M, DECface: An automatic lip-synchronization algorithm for synthetic faces, In Technical report series, CRL 93/4, Digital Equipment Corporation, Cambridge Research Lab, September 23, 1993 .
Breen, Dr. A. P, Bowers Ms. E. and Welsh Dr. W, An Investigation into the generation of mouth shapes for a talking head.
Möttönen, Riikka Olivés Jean-Luc, Kulja Janne and Sams, Mikko, Parameterized visual speech synthesis and its evaluation.
Tiddeman, Bernard and Perret, David, Prototyping and transforming visemes for animated speech,
Abdul Rafay Abbasi and Naveed Ahmad, Urdu Viseme Identification", pp68-71.
Tony Ezzat and Tomaso Poggio, Visual Speech Synthesis by Morphing Visemes, In A. I. Memo No. 1658 C. B. C, L, Paper No. 173, Artificial Intelligence ,Laboratory, M. I. T, May 1999.
Michael M. Cohen and Dominic W Massaro, Modeling Coarticulation in synthetic visual speech.
Riikka Möttönen, Jean-Luc Olivés, Janne Kulju, and Mikko Sams, Parameterized visual speech synthesis and its evaluation, Eusipco 2000 X, European Signal Processing Conference,September 4-8, 2000Tampere, Finland.
Hassan Satori, Hussein Hiyassat, Mostafa Harti, and Noureddine Chenfour, Investigation Arabic Speech Recognition Using CMU Sphinx System",The International Arab Journal of Information Technology, Vol. 6, No. 2, April 2009.
Rabiner, L. and B. Juang , Fundamentals of Speech Recognition, Englewood Cliffs, N. J. : Prentice Hall,1993.
WANG Anhong, BAO Huaiqiao, CHEN, Primary research on the viseme system in Standard Chinese,
C. Benoît, T. Lallouache, T. Mohamadi and C. Abry, A set of French visemes for visual speech synthesis, In Talking Machines: Theories Models and Designs, G Bailly and C. Benoît, Editors. Elsevier B. V. p. 485-501.
L. Revéret, Conception et évaluation d'un système de suivi automatique des gestes labiaux en parole, docteur de l'institut national polytechnique de Grenoble,thése préparée au sein de l'institut de la communication parlée.
Salah Werda, Walid Mahdi and Abdelmajid Ben Hamadou, Lip Localization and Viseme Classification for Visual Speech Recognition, International Journal of Computing & Information Sciences Vol. 5, No. 1, April 2007.
Gerasimos Potamianos, Chalapathy Neti, "Audio-Visual Automatic Speech Recognition": An Overview, Chapter to appear in: Issues in Visual and Audio-Visual Speech Processing, G. Bailly, E. Vatikiotis-Bateson, and P. Perrier, Eds. , MIT Press, 2004.
Fatma zohra CHELALI , Amar DJERADI, " Primary research on Arabic visemes, Analysis in space and frequency domain", International Journal of Mobile Computing and Multimedia Communication (IJMCMC), published by IGI Global, USA,pp 1-19, DOI: 10. 4018/IJMCMC, ISSN: 1937-9412, EISSN: 1937-9404, vol. 3 , N°4,2011.
Sergios Theodoridis and Konstantinos koutroumbas. (2003). book pattern recognition, second edition, Elsevier (USA).
Herve Abdi,"Neural networks",Program in Cognition and neurosciences,MS:Gr. 4. 1,The university of Texas at Dallas.
McGurck et J. Mcdonald. "Hearing lips and seeing voice". Nature, 264 : 746-748, Decb 1976.
Naotoshi Seo sonots. (2008). Pitch Detection. (report ENEE632 Project4 Part I). March 24, 2008.

Index Terms

Computer Science

Information Sciences

Keywords

Arabic Visemes Speech Recognition Audiovisual Analysis Pitch