Title :
A Simple Method to Determine if a Music Information Retrieval System is a “Horse”
Author_Institution :
Audio Anal. Lab., Aalborg Univ. Copenhagen, Copenhagen, Denmark
Abstract :
We propose and demonstrate a simple method to explain the figure of merit (FoM) of a music information retrieval (MIR) system evaluated in a dataset, specifically, whether the FoM comes from the system using characteristics confounded with the “ground truth” of the dataset. Akin to the controlled experiments designed to test the supposed mathematical ability of the famous horse “Clever Hans,” we perform two experiments to show how three state-of-the-art MIR systems produce excellent FoM in spite of not using musical knowledge. This provides avenues for improving MIR systems, as well as their evaluation. We make available a reproducible research package so that others can apply the same method to evaluating other MIR systems.
Keywords :
information retrieval systems; music; Clever Hans; FoM; MIR evaluation; MIR system; dataset ground truth; figure-of-merit; music information retrieval system; musical knowledge; Accuracy; Feature extraction; Multiple signal classification; Semantics; Silicon; Standards; Vocabulary; 2-WORK system performance; 5-CONT content description and annotation; 5-SEAR multimedia search and retrieval;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2014.2330697