Title :
Learning to segment songs with ordinal linear discriminant analysis
Author :
McFee, Brian ; Ellis, Daniel P. W.
Abstract :
This paper describes a supervised learning algorithm which optimizes a feature representation for temporally constrained clustering. The proposed method is applied to music segmentation, in which a song is partitioned into functional or locally homogeneous segments (e.g., verse or chorus). To facilitate abstraction over multiple training examples, we develop a latent structural repetition feature, which summarizes the repetitive structure of a song of any length in a fixed-dimensional representation. Experimental results demonstrate that the proposed method efficiently integrates heterogeneous features, and improves segmentation accuracy.
Keywords :
acoustic signal processing; learning (artificial intelligence); music; time series; feature representation; latent structural repetition feature; music segmentation; ordinal linear discriminant analysis; supervised learning algorithm; temporally constrained clustering; Accuracy; Clustering algorithms; Feature extraction; Speech; Timbre; Music; automatic segmentation; learning;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854594