Cox, S. J. (2004) Using Context to Correct Phone Recognition Errors. In: 8th International Conference on Spoken Language Processing (Interspeech 2004), 2004-10-04 - 2004-10-08.
Full text not available from this repository. (Request a copy)Abstract
There are many circumstances in which it is useful or necessary to recognise phones rather than words, but phone recognition is inherently less accurate than word recognition. We describe here a post-recognition method for "translating" an errorful phone string output by a speech recogniser into a string that more closely matches the transcription. The technique owes something to Kohonen's idea of "dynamically expanding context" in that it learns from the errors made by the recogniser in a particular context, but it uses many contexts rather than a single context to estimate the "translation" of a recognised phone. The weights given to the different contexts in estimating the translation are determined discriminatively. On the WSJCAM0 database, the technique gives a 19.2% relative improvement in phone errors (including insertions) over the baseline, compared with a 6.2% improvement obtained using dynamically expanding context.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
UEA Research Groups: | Faculty of Science > Research Groups > Interactive Graphics and Audio Faculty of Science > Research Groups > Smart Emerging Technologies |
Depositing User: | Vishal Gautam |
Date Deposited: | 14 Jun 2011 18:15 |
Last Modified: | 22 Apr 2023 02:46 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/21562 |
DOI: |
Actions (login required)
View Item |