Title :
Music Identification with Weighted Finite-State Transducers
Author :
Weinstein, E. ; Moreno, Pablo
Author_Institution :
Courant Inst. of Mathematical Sci., New York, NY, USA
Abstract :
Music identification is the process of matching an audio stream to a particular song. Previous work has relied on hashing, where an exact or almost-exact match between local features of the test and reference recordings is required. In this work we present a new approach to music identification based on finite-state transducers and Gaussian mixture models. We apply an unsupervised training process to learn an inventory of music phone units similar to phonemes in speech. We also learn a unique sequence of music units characterizing each song. We further propose a novel application of transducers for recognition of music phone sequences. Preliminary experiments demonstrate an identification accuracy of 99.5% on a database of over 15,000 songs running faster than real time.
Keywords :
audio signal processing; transducers; Gaussian mixture models; audio stream matching; music identification; music phone units; unsupervised training process; weighted finite-state transducers; Acoustic transducers; Audio recording; Cepstral analysis; Fingerprint recognition; Hidden Markov models; Maximum likelihood decoding; Music; Speech processing; Streaming media; Testing; Music identification; acoustic modeling; finite-state transducers;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366329