Title :
A state space model for online polyphonic audio-score alignment
Author :
Duan, Zhiyao ; Pardo, Bryan
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA
Abstract :
We present a novel online audio-score alignment approach for multi-instrument polyphonic music. This approach uses a 2-dimensional state vector to model the underlying score position and tempo of each time frame of the audio performance. The process model is defined by dynamic equations to transition between states. Two representations of the observed audio frame are proposed, resulting in two observation models: a multi-pitch-based and a chroma-based. Particle filtering is used to infer the hidden states from observations. Experiments on 150 music pieces with polyphony from one to four show the proposed approach outperforms an existing offline global string alignment-based score alignment approach. Results also show that the multi-pitch-based observation model works better than the chroma-based one.
Keywords :
audio signal processing; music; particle filtering (numerical methods); chroma based audio frame; hidden state; multiinstrument polyphonic music; multipitch based audio frame; online audio score alignment; online polyphonic audio-score alignment; particle filtering; state space model; Bayesian methods; Computational modeling; Estimation; Hidden Markov models; Instruments; Mathematical model; Real time systems; Score following; audio-score alignment; hidden Markov model; online algorithm; realtime;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946374