DocumentCode :
21142
Title :
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
Author :
Raczynski, Stanislaw A. ; Vincent, Emmanuel ; Sagayama, Shigeki
Author_Institution :
Inria, Rennes, France
Volume :
21
Issue :
9
fYear :
2013
fDate :
Sept. 2013
Firstpage :
1830
Lastpage :
1840
Abstract :
Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of analyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch structure. These models are formulated as linear or log-linear interpolations of up to five sub-models, each of which is responsible for modeling a different type of relation. The ability of the models to predict symbolic pitch data is evaluated in terms of their cross-entropy, and of a newly proposed “contextual cross-entropy” measure. Their performance is then measured on synthesized polyphonic audio signals in terms of the accuracy of multiple pitch estimation in combination with a Nonnegative Matrix Factorization-based acoustic model. In both experiments, the log-linear combination of at least one “vertical” (e.g., harmony) and one “horizontal” (e.g., note duration) sub-model outperformed a pitch-dependent Bernoulli prior by more than 60% in relative cross-entropy and 3% in absolute multiple pitch estimation accuracy. This work provides a proof of concept of the usefulness of model interpolation, which may be used for improved symbolic modeling of other aspects of music in the future.
Keywords :
audio signal processing; belief networks; entropy; interpolation; matrix decomposition; music; probability; signal synthesis; contextual cross-entropy measure; dynamic Bayesian networks; horizontal pitch structure; linear interpolations; log-linear interpolations; multiple pitch estimation; musical information analysis; nonnegative matrix factorization-based acoustic model; pitch-dependent Bernoulli prior; probabilistic symbolic polyphonic pitch models; symbolic pitch data prediction; synthesized polyphonic audio signals; vertical pitch structure; Dynamic Bayesian Networks; multipitch analysis; symbolic pitch modeling;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2013.2258012
Filename :
6502207
Link To Document :
بازگشت