2.5D Visual Speech Synthesis Using Appearance Models

Theobald, B, Cawley, GC, Glauert, JRW, Abider, JA and Matthews, I (2003) 2.5D Visual Speech Synthesis Using Appearance Models. In: British Machine Vision Conference, 2005-09-05 - 2005-09-08.

Full text not available from this repository. (Request a copy)


Two dimensional (2D) shape and appearance models are applied to the problem of creating a near-videorealistic talking head. A speech corpus of a talker uttering a set of phonetically balanced training sentences is analysed using a generative model of the human face. Segments of original parameter trajectories, corresponding to the synthesis unit (e.g.~triphone), are extracted from a codebook, then normalised, blended, concatenated and smoothed before being applied to the model to give natural, realistic animations of novel utterances. The system provides a 2D image sequence corresponding to the face of a talker. It is also used to animate the face of a 3D avatar by displacing the mesh according to movements of points in the shape model and dynamically texturing the face polygons using the appearance model.

Item Type: Conference or Workshop Item (Paper)
Faculty \ School: Faculty of Science > School of Computing Sciences
Faculty of Arts and Humanities > School of Political, Social and International Studies

University of East Anglia > Faculty of Science > Research Groups > Computational Biology (subgroups are shown below) > Machine learning in computational biology
Related URLs:
Depositing User: Vishal Gautam
Date Deposited: 23 Jul 2011 19:45
Last Modified: 07 Sep 2021 23:51
URI: https://ueaeprints.uea.ac.uk/id/eprint/21914

Actions (login required)

View Item View Item