• DocumentCode
    177637
  • Title

    PYIN: A fundamental frequency estimator using probabilistic threshold distributions

  • Author

    Mauch, Matthias ; Dixon, Sam

  • Author_Institution
    Centre for Digital Music, Queen Mary Univ. of London, London, UK
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    659
  • Lastpage
    663
  • Abstract
    We propose the Probabilistic YIN (PYIN) algorithm, a modification of the well-known YIN algorithm for fundamental frequency (F0) estimation. Conventional YIN is a simple yet effective method for frame-wise monophonic F0 estimation and remains one of the most popular methods in this domain. In order to eliminate short-term errors, outputs of frequency estimators are usually post-processed resulting in a smoother pitch track. One shortcoming of YIN is that such post-processing cannot fall back on alternative interpretations of the signal because the method outputs precisely one estimate per frame. To address this problem we modify YIN to output multiple pitch candidates with associated probabilities (PYIN Stage 1). These probabilities arise naturally from a prior distribution on the YIN threshold parameter. We use these probabilities as observations in a hidden Markov model, which is Viterbi-decoded to produce an improved pitch track (PYIN Stage 2). We demonstrate that the combination of Stages 1 and 2 raises recall and precision substantially. The additional computational complexity of PYIN over YIN is low. We make the method freely available online1 as an open source C++ library for Vamp hosts.
  • Keywords
    Viterbi decoding; computational complexity; frequency estimation; hidden Markov models; speech processing; statistical distributions; PYIN algorithm; Vamp hosts; Viterbi; YIN threshold parameter; computational complexity; framewise monophonic F0 estimation; fundamental frequency estimation; fundamental frequency estimators; hidden Markov model; open source C++ library; pitch track; post-processing; probabilistic YIN algorithm; probabilistic threshold distributions; short-term errors; Algorithm design and analysis; Databases; Frequency estimation; Hidden Markov models; Probabilistic logic; Signal processing algorithms; Smoothing methods; Pitch estimation; YIN; pitch tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853678
  • Filename
    6853678