Hierarchical language modeling for audio events detection in a sports game

Huang, Qiang and Cox, Stephen (2010) Hierarchical language modeling for audio events detection in a sports game. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2010-03-14 - 2010-03-19.

Full text not available from this repository. (Request a copy)


We investigate the automatic labelling of “events” from an audio recording of a sports game. We describe a technique that utilises a hierarchy of language models, which are a low-level model of acoustic observations and a high-level model of audio events that occur during a game: these models are integrated using a maximum entropy approach. Our models of the audio events also utilise duration and voicing information as well as spectral content, and we show that further discrimination between events is possible using these features. Results on different tennis games show that the use of these techniques is better than using an approach that does not use modelling of dependencies between frames and events or extra information in the form of duration and voicing.

Item Type: Conference or Workshop Item (Paper)
Faculty \ School: Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Interactive Graphics and Audio
Faculty of Science > Research Groups > Smart Emerging Technologies
Depositing User: Nicola Talbot
Date Deposited: 14 Mar 2011 09:50
Last Modified: 21 Apr 2023 07:33
URI: https://ueaeprints.uea.ac.uk/id/eprint/26048
DOI: 10.1109/ICASSP.2010.5495935

Actions (login required)

View Item View Item