• DocumentCode
    2790242
  • Title

    A mixture maximization approach to multipitch tracking with factorial hidden Markov models

  • Author

    Wohlmayr, M. ; Stark, M. ; Pernkopf, F.

  • Author_Institution
    Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz, Austria
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5070
  • Lastpage
    5073
  • Abstract
    We present a simple and efficient feature modeling approach for tracking the pitch of two speakers speaking simultaneously. We model the spectrogram features of single speakers using Gaussian mixture models in combination with the minimum description length model selection criterion. Furthermore, the mixture maximization (MIXMAX) interaction model is employed to yield a probabilistic representation for the mixture of both speakers. Finally, a factorial hidden Markov model is applied for tracking. We demonstrate experimental results on two databases, and show the excellent performance of the proposed method in comparison to a well known multipitch tracking algorithm based on correlogram features.
  • Keywords
    Gaussian processes; audio databases; feature extraction; hidden Markov models; optimisation; speaker recognition; Gaussian mixture model; correlogram feature modeling; factorial hidden Markov model; minimum description length model selection criterion; mixture maximization interaction model; multipitch tracking; probabilistic representation; spectrogram features; Hidden Markov models; Markov processes; Random variables; Signal processing; Signal processing algorithms; Spatial databases; Spectrogram; Speech analysis; Speech processing; Trajectory; Gaussian mixture model; Multipitch tracking; factorial hidden Markov model; mixture maximization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495048
  • Filename
    5495048