Title :
Music identification based on music word model
Author :
Wanyi Yang ; Deshun Yang ; Xiaoou Chen ; Haiqian He
Author_Institution :
Inst. of Comput. Sci. & Technol., Peking Univ., Beijing, China
fDate :
June 29 2015-July 3 2015
Abstract :
For music identification, conventional bag of audio words model methods generally compute a histogram for a piece of music, which ignores the temporal characteristic of music and has a negative influence on the accuracy. In addition, they are usually based on DFT spectrogram, which cannot represent music as well as Constant Q (CQ) spectrogram. To address the above problems, we propose a two-layer representation method based on a set of music words for music identification. Firstly, music words are learned from the CQ spectrogram as typical patterns. Then, based on the obtained music words, a piece of music can be represented as word sequence and word histogram. We can reduce the number of possible similar candidates effectively with the histogram similarity measure, and the final result is determined by the sequence similarity measure. Based on the distribution of music word frequency, a low frequency word filter strategy is devised to increase the identification speed, which is essential for large systems such as a million song library. Experiments demonstrate the effectiveness and efficiency of our proposed method.
Keywords :
audio signal processing; music; pattern clustering; CQ spectrogram; constant Q spectrogram; histogram similarity measure; low frequency word filter; music identification; music word frequency; sequence similarity measure; two-layer representation method; word histogram; word sequence; Accuracy; Dictionaries; Histograms; Indexes; Multiple signal classification; Spectrogram; Training; Constant Q transform; Music Word Model; Music identification;
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
DOI :
10.1109/ICME.2015.7177492