Title :
Multimodal structure segmentation and analysis of music using audio and textual information
Author :
Cheng, Heng-Tze ; Yang, Yi-Hsuan ; Lin, Yu-Ching ; Chen, Homer H.
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
In this paper, we present a multimodal approach to structure segmentation of music with applications to audio content analysis and music information retrieval. In particular, since lyrics contain rich information about the semantic structure of a song, our approach incorporates lyrics to overcome the existing difficulties associated with large acoustic variation in music. We further design a constrained clustering algorithm for music segmentation and evaluate its performance on commercial recordings. Experimental results show that our method can effectively detect the boundaries and the types of semantic structure of music segments.
Keywords :
audio signal processing; information retrieval; music; pattern clustering; text analysis; acoustic variation; audio content analysis; audio information analysis; constrained clustering algorithm; lyrics processing; multimodal structure segmentation; music information retrieval; semantic structure labeling; textual information analysis; Acoustic signal detection; Algorithm design and analysis; Bridges; Clustering algorithms; Content based retrieval; Contracts; Information analysis; Labeling; Multiple signal classification; Music information retrieval; Music; lyrics; music information; retrieval; segmentation;
Conference_Titel :
Circuits and Systems, 2009. ISCAS 2009. IEEE International Symposium on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-3827-3
Electronic_ISBN :
978-1-4244-3828-0
DOI :
10.1109/ISCAS.2009.5118096