Theobald, Barry J., Bangham, J. Andrew, Matthews, Iain A. and Cawley, Gavin C. ORCID: https://orcid.org/0000-0002-4118-9095 (2002) Towards video realistic synthetic visual speech. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2002-05-13 - 2002-05-17.
Full text not available from this repository. (Request a copy)Abstract
In this paper we present initial work towards a video-realistic visual speech synthesiser based on statistical models of shape and appearance. A synthesised image sequence corresponding to an utterance is formed by concatenation of synthesis units (in this case phonemes) from a pre-recorded corpus of training data. A smoothing spline is applied to the concatenated parameters to ensure smooth transitions between frames and the resultant parameters applied to the model—early results look promising.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
UEA Research Groups: | Faculty of Science > Research Groups > Interactive Graphics and Audio Faculty of Science > Research Groups > Data Science and Statistics Faculty of Science > Research Groups > Computational Biology Faculty of Science > Research Groups > Centre for Ocean and Atmospheric Sciences |
Depositing User: | Vishal Gautam |
Date Deposited: | 04 Jul 2011 08:49 |
Last Modified: | 22 Apr 2023 02:48 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/21917 |
DOI: | 10.1109/ICASSP.2002.5745507 |
Actions (login required)
View Item |