Milner, Ben and Darch, Jonathan (2010) Robust acoustic speech feature prediction from noisy mel-frequency cepstral coefficients. IEEE Transactions on Audio, Speech, and Language Processing, 19 (2). pp. 338-347. ISSN 1558-7916
Full text not available from this repository. (Request a copy)Abstract
This paper examines the effect of applying noise compensation to acoustic speech feature prediction from noisy mel-frequency cepstral coefficient (MFCC) vectors within a distributed speech recognition architecture. An acoustic speech feature (comprising fundamental frequency, formant frequencies, speech/nonspeech classification, and voicing classification) is predicted from an MFCC vector in a maximum a posteriori (MAP) framework using phoneme-specific or global models of speech. The effect of noise is considered and three different noise compensation methods, that have been successful in robust speech recognition, are integrated within the MAP framework. Experiments show that noise compensation can be applied successfully to prediction with best performance given by a model adaptation method that performs only slightly worse than matched training and testing. Further experiments consider application of the predicted acoustic features to speech reconstruction. A series of human listening tests show that the predicted features are sufficient for speech reconstruction and that noise compensation improves speech quality in noisy conditions.
Item Type: | Article |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
UEA Research Groups: | Faculty of Science > Research Groups > Interactive Graphics and Audio Faculty of Science > Research Groups > Smart Emerging Technologies Faculty of Science > Research Groups > Data Science and AI |
Depositing User: | Vishal Gautam |
Date Deposited: | 07 Mar 2011 13:17 |
Last Modified: | 10 Dec 2024 01:18 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/22034 |
DOI: | 10.1109/TASL.2010.2047811 |
Actions (login required)
View Item |