DocumentCode
2790242
Title
A mixture maximization approach to multipitch tracking with factorial hidden Markov models
Author
Wohlmayr, M. ; Stark, M. ; Pernkopf, F.
Author_Institution
Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz, Austria
fYear
2010
fDate
14-19 March 2010
Firstpage
5070
Lastpage
5073
Abstract
We present a simple and efficient feature modeling approach for tracking the pitch of two speakers speaking simultaneously. We model the spectrogram features of single speakers using Gaussian mixture models in combination with the minimum description length model selection criterion. Furthermore, the mixture maximization (MIXMAX) interaction model is employed to yield a probabilistic representation for the mixture of both speakers. Finally, a factorial hidden Markov model is applied for tracking. We demonstrate experimental results on two databases, and show the excellent performance of the proposed method in comparison to a well known multipitch tracking algorithm based on correlogram features.
Keywords
Gaussian processes; audio databases; feature extraction; hidden Markov models; optimisation; speaker recognition; Gaussian mixture model; correlogram feature modeling; factorial hidden Markov model; minimum description length model selection criterion; mixture maximization interaction model; multipitch tracking; probabilistic representation; spectrogram features; Hidden Markov models; Markov processes; Random variables; Signal processing; Signal processing algorithms; Spatial databases; Spectrogram; Speech analysis; Speech processing; Trajectory; Gaussian mixture model; Multipitch tracking; factorial hidden Markov model; mixture maximization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495048
Filename
5495048
Link To Document