Soft Decoding of Temporal Derivatives for Robust Distributed Speech Recognition in Packet Loss

James, A. B. and Milner, B. P. (2005) Soft Decoding of Temporal Derivatives for Robust Distributed Speech Recognition in Packet Loss. In: IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2005-03-18 - 2005-03-23.

Full text not available from this repository. (Request a copy)

Abstract

The aim of this work is to improve distributed speech recognition accuracy in packet loss by considering the effect of loss on the temporal derivatives of the feature vector. Analysis of temporal derivatives reveals they suffer severe distortion when static vectors are lost in times of packet loss. The application of missing feature theory and soft-decoding techniques are considered for compensating against packet loss at the decoding stage of recognition. An extension to these methods is developed which considers the static, velocity and acceleration components separately. A series of confidence measures for the temporal derivatives is devised and applied within the soft-decoding framework. Experimental results on both a connected digit task and a large vocabulary task demonstrate significant increases in recognition accuracy under a range of packet loss conditions.

Item Type: Conference or Workshop Item (Paper)
Faculty \ School: Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Interactive Graphics and Audio
Faculty of Science > Research Groups > Smart Emerging Technologies
Faculty of Science > Research Groups > Data Science and AI
Depositing User: Vishal Gautam
Date Deposited: 14 Jun 2011 11:40
Last Modified: 10 Dec 2024 01:15
URI: https://ueaeprints.uea.ac.uk/id/eprint/21966
DOI: 10.1109/ICASSP.2005.1415121

Actions (login required)

View Item View Item