Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling - Publication - Bridge of Knowledge

Search

Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling

Abstract

Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is responsible for modeling a different type of relation. The ability of the models to predict symbolic pitch data is evaluated in terms of their cross-en- tropy, and of a newly proposed “contextual cross-entropy” mea- sure. Their performance is then m easuredonsynthesizedpoly- phonic audio signals in terms of the accuracy of multiple pitch estimation in combination with a Nonnegative Matrix Factoriza- tion-based acoustic model. In both experiments, the log-linear com- bination of at least one “vertical” (e.g., harmony) and one “hori- zontal” (e.g., note duration) sub-model outperformed a pitch-de- pendent Bernoulli prior by more than 60% in relative cross-en- tropy and 3% in absolute multiple pitch estimation accuracy. This work provides a proof of concept of the usefulness of model inter- polation, which may be used for improved symbolic modeling of other aspects of music in the future.

Citations

  • 1 9

    CrossRef

  • 0

    Web of Science

  • 2 9

    Scopus

Authors (3)

  • Photo of dr inż. Stanisław Raczyński

    Stanisław Raczyński dr inż.

    • Inria, Francja .
  • Photo of  Emmanuel Vincent

    Emmanuel Vincent

    • Inria, Francja .
  • Photo of  Shigeki Sagayama

    Shigeki Sagayama

    • University of Tokyo, Japonia .

Cite as

Full text

full text is not available in portal

Keywords

Details

Category:
Articles
Type:
artykuł w czasopiśmie wyróżnionym w JCR
Published in:
IEEE Transactions on Audio Speech and Language Processing no. 21, pages 1830 - 1840,
ISSN: 1558-7916
Language:
English
Publication year:
2013
Bibliographic description:
Raczyński S., Vincent E., Sagayama S.: Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling// IEEE Transactions on Audio Speech and Language Processing. -Vol. 21, nr. 9 (2013), s.1830-1840
DOI:
Digital Object Identifier (open in new tab) 10.1109/tasl.2013.2258012
Verified by:
Gdańsk University of Technology

seen 126 times

Recommended for you

Meta Tags