Title :
Discriminative training of hidden Markov models for multiple pitch tracking [speech processing examples]
Author :
Bach, Francis R. ; Jordan, Michael I.
Author_Institution :
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
Abstract :
We present a multiple pitch tracking algorithm that is based on direct probabilistic modeling of the spectrogram of the signal. The model is a factorial hidden Markov model whose parameters are learned discriminatively from the Keele pitch database. Our algorithm can track several pitches and determines the number of pitches that are active at any given time. We present simulation results on mixtures of several speech signals and noise, showing the robustness of our approach.
Keywords :
feature extraction; frequency estimation; hidden Markov models; speech processing; active pitch number determination; discriminative training; factorial hidden Markov model; multiple pitch tracking; pitch extraction; signal spectrogram probabilistic modeling; speech processing; speech signal/noise mixtures; Algorithm design and analysis; Computer science; Graphical models; Hidden Markov models; Inference algorithms; Multiple signal classification; Noise robustness; Signal processing algorithms; Spectrogram; Speech;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
Print_ISBN :
0-7803-8874-7
DOI :
10.1109/ICASSP.2005.1416347