Title :
Mobile music modeling, analysis and recognition
Author :
Golik, Pavel ; Harb, Boulos ; Misra, Ananya ; Riley, Michael ; Rudnick, Alex ; Weinstein, Eugene
Author_Institution :
Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
Abstract :
We present an analysis of music modeling and recognition techniques in the context of mobile music matching, substantially improving on the techniques presented in [1]. We accomplish this by adapting the features specifically to this task, and by introducing new modeling techniques that enable using a corpus of noisy and channel-distorted data to improve mobile music recognition quality. We report the results of an extensive empirical investigation of the system´s robustness under realistic channel effects and distortions. We show an improvement of recognition accuracy by explicit duration modeling of music phonemes and by integrating the expected noise environment into the training process. Finally, we propose the use of frame-to-phoneme alignment for high-level structure analysis of polyphonic music.
Keywords :
acoustic signal processing; audio signal processing; information retrieval; mobile computing; music; speech recognition; channel-distorted data; duration modeling; frame-to-phoneme alignment; high-level structure analysis; mobile music analysis; mobile music matching; mobile music modeling; mobile music recognition quality improvement; music information retrieval; music phonemes; noisy data; polyphonic music; recognition accuracy improvement; Accuracy; Hidden Markov models; Music; Speech recognition; Training; USA Councils; Music; modeling; music information retrieval; signal analysis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288387