Title :
A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge
Author :
Schuller, Björn ; Zobl, Martin ; Rigoll, Gerhard ; Lang, Manfred
Author_Institution :
Inst. for Human-Computer Commun., Technische Univ. Munchen, Germany
Abstract :
Recently an increasing interest in music retrieval can be observed. Due to the growing amount of online and offline available music and a broadening user spectrum more efficient query methods are needed. We believe that only a parallel multimodal combination of different input modalities forms the most intuitive way to access desired media for any user. In this paper we introduce a query by humming, speaking, writing, and typing. The strengths of each modality are combined in a synergetic manner by a soft decision fusion. Songs can be referenced by their according melody, artist, title or other specific information. Further more the recognition of the actual user´s emotion and external contextual knowledge helps to build an expectance of the intended song at a time. This constrains the hypothesis sphere of possible songs and leads to a more robust recognition or even a suggestive query. A combination of artificial neural networks, hidden Markov models and dynamic time warping integrated in a Bayesian belief network framework build the mathematical background of the chosen hybrid architecture. We address the implementation of a working system and results achieved by the introduced methods.
Keywords :
audio databases; belief networks; hidden Markov models; music; neural nets; query formulation; query processing; Bayesian belief network framework; artificial neural networks; contextual knowledge; dynamic time warping; hidden Markov models; hybrid music retrieval system; multimodal queries; music database; query methods; soft decision fusion; user emotion recognition; Bayesian methods; Context; Databases; Emotion recognition; Hidden Markov models; Multiple signal classification; Music information retrieval; Probability; Robustness; Writing;
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
DOI :
10.1109/ICME.2003.1220853