Finding an Optimal Segmentation for Audio Genre Classification

West, K. and Cox, S. J. (2005) Finding an Optimal Segmentation for Audio Genre Classification. In: 6th International Conference on Music Information Retrieval, 2005-09-11 - 2005-09-15.

Full text not available from this repository. (Request a copy)


In the automatic classification of music many different segmentations of the audio signal have been used to calculate features. These include individual short frames (23 ms), longer frames (200 ms), short sliding textural windows (1 sec) of a stream of 23 ms frames, large fixed windows (10 sec) and whole files. In this work we present an evaluation of these different segmentations, showing that they are sub-optimal for genre classification and introduce the use of an onset detection based segmentation, which appears to outperform all of the fixed and sliding windows segmentation schemes in terms of classification accuracy and model size.

Item Type: Conference or Workshop Item (Paper)
Faculty \ School: Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Interactive Graphics and Audio
Faculty of Science > Research Groups > Smart Emerging Technologies
Related URLs:
Depositing User: Vishal Gautam
Date Deposited: 20 Jul 2011 15:48
Last Modified: 22 Apr 2023 02:45

Actions (login required)

View Item View Item