Towards speech recogniser assessment using a human reference standard

Cox, Stephen J., Linford, Paul W., Hill, William B. and Johnston, R. Denis (1998) Towards speech recogniser assessment using a human reference standard. Computer Speech and Language, 12 (4). pp. 375-391. ISSN 0885-2308

Full text not available from this repository. (Request a copy)


The measurement of the word error rate (WER) of a speech recognizer is valuable for the development of new algorithms but provides only the most limited information about the performance of the recognizer. We propose the use of a human reference standard to assess the performance of speech recognizers, so that the performance of a recognizer could be quoted as being equivalent to the performance of a human hearing speech which is subject to X dB of degradation. This approach should have the major advantage of being independent of the database and speakers used for testing. Furthermore, it would allow factors beyond the word error rate to be measured, such as the performance within an interactive speech system. In this paper, we report on preliminary work to explore the viability of this approach. This has consisted of recording a suitable database for experimentation, devising a method of degrading the speech in a controlled way and conducting two set of experiments on listeners to measure their responses to degraded speech to establish a reference. Results from these experiments raise several questions about the technique but encourage us to experiment with comparisons with automatic recognizers.

Item Type: Article
Faculty \ School: Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Smart Emerging Technologies
Faculty of Science > Research Groups > Interactive Graphics and Audio
Depositing User: Pure Connector
Date Deposited: 09 Jul 2014 12:00
Last Modified: 04 Mar 2024 16:54
DOI: 10.1006/csla.1998.0109

Actions (login required)

View Item View Item