DocumentCode
2149749
Title
A state space model for online polyphonic audio-score alignment
Author
Duan, Zhiyao ; Pardo, Bryan
Author_Institution
Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
197
Lastpage
200
Abstract
We present a novel online audio-score alignment approach for multi-instrument polyphonic music. This approach uses a 2-dimensional state vector to model the underlying score position and tempo of each time frame of the audio performance. The process model is defined by dynamic equations to transition between states. Two representations of the observed audio frame are proposed, resulting in two observation models: a multi-pitch-based and a chroma-based. Particle filtering is used to infer the hidden states from observations. Experiments on 150 music pieces with polyphony from one to four show the proposed approach outperforms an existing offline global string alignment-based score alignment approach. Results also show that the multi-pitch-based observation model works better than the chroma-based one.
Keywords
audio signal processing; music; particle filtering (numerical methods); chroma based audio frame; hidden state; multiinstrument polyphonic music; multipitch based audio frame; online audio score alignment; online polyphonic audio-score alignment; particle filtering; state space model; Bayesian methods; Computational modeling; Estimation; Hidden Markov models; Instruments; Mathematical model; Real time systems; Score following; audio-score alignment; hidden Markov model; online algorithm; realtime;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5946374
Filename
5946374
Link To Document