DocumentCode
177637
Title
PYIN: A fundamental frequency estimator using probabilistic threshold distributions
Author
Mauch, Matthias ; Dixon, Sam
Author_Institution
Centre for Digital Music, Queen Mary Univ. of London, London, UK
fYear
2014
fDate
4-9 May 2014
Firstpage
659
Lastpage
663
Abstract
We propose the Probabilistic YIN (PYIN) algorithm, a modification of the well-known YIN algorithm for fundamental frequency (F0) estimation. Conventional YIN is a simple yet effective method for frame-wise monophonic F0 estimation and remains one of the most popular methods in this domain. In order to eliminate short-term errors, outputs of frequency estimators are usually post-processed resulting in a smoother pitch track. One shortcoming of YIN is that such post-processing cannot fall back on alternative interpretations of the signal because the method outputs precisely one estimate per frame. To address this problem we modify YIN to output multiple pitch candidates with associated probabilities (PYIN Stage 1). These probabilities arise naturally from a prior distribution on the YIN threshold parameter. We use these probabilities as observations in a hidden Markov model, which is Viterbi-decoded to produce an improved pitch track (PYIN Stage 2). We demonstrate that the combination of Stages 1 and 2 raises recall and precision substantially. The additional computational complexity of PYIN over YIN is low. We make the method freely available online1 as an open source C++ library for Vamp hosts.
Keywords
Viterbi decoding; computational complexity; frequency estimation; hidden Markov models; speech processing; statistical distributions; PYIN algorithm; Vamp hosts; Viterbi; YIN threshold parameter; computational complexity; framewise monophonic F0 estimation; fundamental frequency estimation; fundamental frequency estimators; hidden Markov model; open source C++ library; pitch track; post-processing; probabilistic YIN algorithm; probabilistic threshold distributions; short-term errors; Algorithm design and analysis; Databases; Frequency estimation; Hidden Markov models; Probabilistic logic; Signal processing algorithms; Smoothing methods; Pitch estimation; YIN; pitch tracking;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6853678
Filename
6853678
Link To Document