Phoneme-to-viseme mappings: the good, the bad, and the ugly

Bear, Helen L.; Harvey, Richard

doi:10.1016/j.specom.2017.07.001

Phoneme-to-viseme mappings: the good, the bad, and the ugly

Tools

Bear, Helen L. and Harvey, Richard ORCID: https://orcid.org/0000-0001-9925-8316 (2017) Phoneme-to-viseme mappings: the good, the bad, and the ugly. Speech Communication, 95. pp. 40-67. ISSN 0167-6393

Preview

PDF (Accepted manuscript) - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
Download (1MB) | Preview

Abstract

Visemes are the visual equivalent of phonemes. Although not precisely defined, a working definition of a viseme is “a set of phonemes which have identical appearance on the lips”. Therefore a phoneme falls into one viseme class but a viseme may represent many phonemes: a many to one mapping. This mapping introduces ambiguity between phonemes when using viseme classifiers. Not only is this ambiguity damaging to the performance of audio-visual classifiers operating on real expressive speech, there is also considerable choice between possible mappings. In this paper we explore the issue of this choice of viseme-to-phoneme map. We show that there is definite difference in performance between viseme-to-phoneme mappings and explore why some maps appear to work better than others. We also devise a new algorithm for constructing phoneme-to-viseme mappings from labeled speech data. These new visemes, ‘Bear’ visemes, are shown to perform better than previously known units.

Item Type:	Article
Uncontrolled Keywords:	lipreading,speaker-dependent,viseme,phoneme,resolution,speech recognition,classification,visual speech,visual units
Faculty \ School:	Faculty of Science > School of Computing Sciences
UEA Research Groups:	Faculty of Science > Research Groups > Visual Computing and Signal Processing (former - to 2025) Faculty of Science > Research Groups > Smart Emerging Technologies (former - to 2025)
Related URLs:	http://linkinghub.elsevier.com/retrieve/...
Depositing User:	Pure Connector
Date Deposited:	05 Aug 2017 05:06
Last Modified:	18 Jun 2026 17:34
URI:	https://ueaeprints.uea.ac.uk/id/eprint/64350
DOI:	10.1016/j.specom.2017.07.001

Downloads

Downloads per month over past year

Actions (login required)

View Item