West, Kris and Cox, Stephen (2010) Incorporating cultural representations of features into audio music similarity estimation. IEEE Transactions on Audio, Speech, and Language Processing, 18 (3). pp. 625-637. ISSN 1558-7916
Full text not available from this repository. (Request a copy)Abstract
We address the problem of estimating automatically from audio signals the similarity between two pieces of music, a technology that has many applications in the online digital music industry. Conventional methods of audio music search use distance measures between features derived from the audio for this task. We describe three techniques that make use of music classifiers to derive representations of audio features that are based on culturally motivated information learned by the classifier. When these representations are used for similarity estimation, they produce very significant reductions in computational complexity over existing techniques (such as those based on the KL-divergence), and also produce metric similarity spaces, which facilitate the use of technologies for the sub-linear scaling of search times. We have evaluated each system using both pseudo-objective techniques and human listeners, and we demonstrate that this efficiency gain is obtained while providing a comparable level of performance when compared with existing techniques.
Item Type: | Article |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
UEA Research Groups: | Faculty of Science > Research Groups > Interactive Graphics and Audio Faculty of Science > Research Groups > Smart Emerging Technologies |
Depositing User: | Vishal Gautam |
Date Deposited: | 07 Mar 2011 13:14 |
Last Modified: | 21 Apr 2023 07:32 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/21861 |
DOI: | 10.1109/TASL.2009.2025533 |
Actions (login required)
View Item |