Milner, B. P. (2002) A comparison of front-end configurations for robust speech recognition. In: IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2002-05-13 - 2002-05-17.
Full text not available from this repository. (Request a copy)Abstract
This paper presents a comparative analysis of the processing stages involved in feature extraction for speech recognition. Feature extraction is considered as comprising three different processing stages; namely static feature extraction, normalisation and inclusion of temporal information. In each stage a comparison of techniques is made, both theoretically and in terms of their comparative performance. The analysis shows that while some techniques may appear significantly different, upon analysis the effect they have on the signal can be similar. Comparative studies include MFCC and PLP analysis, RASTA filtering and cepstral mean normalisation, and temporal derivatives and cepstral-time matrices. Experimental results, on an unconstrained monophone task, compare recognition performance using different front-end configurations.
Item Type: | Conference or Workshop Item (Other) |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
UEA Research Groups: | Faculty of Science > Research Groups > Interactive Graphics and Audio Faculty of Science > Research Groups > Smart Emerging Technologies Faculty of Science > Research Groups > Data Science and AI |
Depositing User: | EPrints Services |
Date Deposited: | 01 Oct 2010 13:41 |
Last Modified: | 10 Dec 2024 01:15 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/3089 |
DOI: | 10.1109/ICASSP.2002.5743838 |
Actions (login required)
View Item |