Gómez, A. M., Peinado, A. M., Sánchez, V., Milner, B. P. and Rubio, A. J. (2004) Statistical-based reconstruction methods for speech recognition in IP networks. In: COST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction (Robust2004), 2004-08-30 - 2004-08-31, University of East Anglia.
Full text not available from this repository. (Request a copy)Abstract
This work shows the performance of statistical-based reconstruction techniques when a burst-like packet loss network is used to transmit speech feature vectors on a DSR architecture. Two different approaches to exploit prior information about the speech are outlined. The first models the sequence of quantized vectors through transition probabilities to make estimations based on data-source information, while the second uses prior knowledge of the means and covariances of the feature vector stream to make a maximum a-posteriori (MAP) estimate of lost vectors. These methods provide better results than those obtained by the AURORA nearest repetition, especially in the presence of bursts of losses. However, they require either a notable amount of memory or a high time complexity. Therefore, a novel solution based on the previous methods is proposed and evaluated.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
UEA Research Groups: | Faculty of Science > Research Groups > Interactive Graphics and Audio Faculty of Science > Research Groups > Smart Emerging Technologies Faculty of Science > Research Groups > Data Science and AI |
Related URLs: | |
Depositing User: | Vishal Gautam |
Date Deposited: | 20 Jul 2011 21:16 |
Last Modified: | 10 Dec 2024 01:15 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/23280 |
DOI: |
Actions (login required)
View Item |