Title :
Musical query-by-description as a multiclass learning problem
Author :
Whitman, Brian ; Rifkin, R.
Author_Institution :
Music, Mind & Machine Group, MIT Media Lab, Cambridge, MA, USA
Abstract :
We present the query-by-description (QBD) component of "Kandem", a time-aware music retrieval system. The QBD system we describe learns a relation between descriptive text concerning a musical artist and their actual acoustic output, making such queries as "Play me something loud with an electronic beat" possible by merely analyzing the audio content of a database. We show a novel machine learning technique based on regularized least-squares classification (RLSC) that can quickly and efficiently learn the non-linear relation between descriptive language and audio features by treating the problem as a large number of possible output classes linked to the same set or input features. We show how the RLSC training can easily eliminate irrelevant labels.
Keywords :
audio databases; electronic music; learning (artificial intelligence); pattern recognition; query formulation; Kandem; actual acoustic output; audio features; data collection; data representation; descriptive language; machine learning technique; multiclass learning problem; musical QBD; musical artist; nonlinear relation; regularized least-squares classification; time-aware music retrieval system; Artificial intelligence; Audio databases; Biology computing; Design for quality; Digital audio players; Laboratories; Machine learning; Programmable logic arrays; Rails; Spatial databases;
Conference_Titel :
Multimedia Signal Processing, 2002 IEEE Workshop on
Print_ISBN :
0-7803-7713-3
DOI :
10.1109/MMSP.2002.1203270