• DocumentCode
    417776
  • Title

    Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds

  • Author

    Katmeoka, H. ; Nishimoto, Takuya ; Sagayama, Shigeki

  • Author_Institution
    Graduate Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Japan
  • Volume
    4
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    A method for the separation of harmonic structures of cochannel input concurrent sounds is described. A model for multiple harmonic structures is constructed with a mixture of tied Gaussian mixtures, from which a single harmonic structure is modeled. Our algorithm enables estimation of both the number and the shape of the underlying harmonic structures, based on a maximum likelihood estimation of the model parameters using the EM algorithm and an information criterion. It operates without restriction on the number of mixed sounds and varieties of sound sources, and extracts accurate fundamental frequencies continuously with simple procedures in the spectral domain. Experiments showed high performance of the algorithm for both simultaneous speech and polyphonic music.
  • Keywords
    Gaussian processes; audio signal processing; harmonic analysis; harmonics; maximum likelihood estimation; music; optimisation; spectral analysis; speech processing; EM algorithm; cochannel input concurrent sounds; fundamental frequencies; harmonic structure separation; information criterion; maximum likelihood estimation; mixed sounds; polyphonic music; simultaneous speech; spectral domain; tied Gaussian mixture model; Acoustic signal processing; Frequency; Information science; Instruments; Maximum likelihood estimation; Music; Robustness; Shape; Signal processing algorithms; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326822
  • Filename
    1326822