High-level approaches to confidence estimation in speech recognition

Cox, S. J. and Dasmahapatra, S. (2002) High-level approaches to confidence estimation in speech recognition. IEEE Transactions on Speech and Audio Processing, 10 (7). pp. 460-471. ISSN 1063-6676

Full text not available from this repository. (Request a copy)


We describe some high-level approaches to estimating confidence scores for the words output by a speech recognizer. By "high-level" we mean that the proposed measures do not rely on decoder specific "side information" and so should find more general applicability than measures that have been developed for specific recognizers. Our main approach is to attempt to decouple the language modeling and acoustic modeling in the recognizer in order to generate independent information from these two sources that can then be used for estimation of confidence. We isolate these two information sources by using a phone recognizer working in parallel with the word recognizer. A set of techniques for estimating confidence measures using the phone recognizer output in conjunction with the word recognizer output is described. The most effective of these techniques is based on the construction of "metamodels," which generate alternative word hypotheses for an utterance. An alternative approach requires no other recognizers or extra information for confidence estimation and is based on the notion that a word that is semantically "distant" from the other decoded words in the utterance is likely to be incorrect. We describe a method for constructing "semantic similarities" between words and hence estimating a confidence. Results using the UK version of the Wall Street Journal are given for each technique.

Item Type: Article
Faculty \ School: Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Interactive Graphics and Audio
Faculty of Science > Research Groups > Smart Emerging Technologies
Depositing User: Vishal Gautam
Date Deposited: 07 Mar 2011 14:10
Last Modified: 04 Jan 2024 02:31
URI: https://ueaeprints.uea.ac.uk/id/eprint/22133
DOI: 10.1109/TSA.2002.804304

Actions (login required)

View Item View Item