DocumentCode :
463721
Title :
Music Identification with Weighted Finite-State Transducers
Author :
Weinstein, E. ; Moreno, Pablo
Author_Institution :
Courant Inst. of Mathematical Sci., New York, NY, USA
Volume :
2
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Music identification is the process of matching an audio stream to a particular song. Previous work has relied on hashing, where an exact or almost-exact match between local features of the test and reference recordings is required. In this work we present a new approach to music identification based on finite-state transducers and Gaussian mixture models. We apply an unsupervised training process to learn an inventory of music phone units similar to phonemes in speech. We also learn a unique sequence of music units characterizing each song. We further propose a novel application of transducers for recognition of music phone sequences. Preliminary experiments demonstrate an identification accuracy of 99.5% on a database of over 15,000 songs running faster than real time.
Keywords :
audio signal processing; transducers; Gaussian mixture models; audio stream matching; music identification; music phone units; unsupervised training process; weighted finite-state transducers; Acoustic transducers; Audio recording; Cepstral analysis; Fingerprint recognition; Hidden Markov models; Maximum likelihood decoding; Music; Speech processing; Streaming media; Testing; Music identification; acoustic modeling; finite-state transducers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366329
Filename :
4217502
Link To Document :
بازگشت