Shao, X. and Milner, B. P. (2003) Clean speech reconstruction from noisy mel-frequency cepstral coefficients using a sinusoidal model. In: IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2005-03-18 - 2005-03-23.
Full text not available from this repository. (Request a copy)Abstract
This paper extends the technique of speech reconstruction from MFCC by considering the effect of noisy speech. To reconstruct a clean speech signal from noise contaminated MFCC an estimate of the clean mel-filterbank vector is required together with a robust estimate of the pitch. This work applies spectral subtraction to the mel-filterbank vector (derived from noisy MFCC) to provide a clean speech spectral estimate. To obtain a reliable estimate of pitch a robust extraction technique is used. Spectrograms and informal listening tests reveal that a clean speech signal can be successfully reconstructed from the noisy MFCC. Pitch errors are shown to manifest themselves as artificial sounding bursts in the reconstructed speech signal. Incorrect estimates of the spectral envelope introduce periods of noise into the reconstructed speech.
Item Type: | Conference or Workshop Item (Other) |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
UEA Research Groups: | Faculty of Science > Research Groups > Interactive Graphics and Audio Faculty of Science > Research Groups > Smart Emerging Technologies Faculty of Science > Research Groups > Data Science and AI |
Depositing User: | EPrints Services |
Date Deposited: | 01 Oct 2010 13:41 |
Last Modified: | 10 Dec 2024 01:15 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/3051 |
DOI: | 10.1109/ICASSP.2003.1198878 |
Actions (login required)
View Item |