• DocumentCode
    730304
  • Title

    Real-time independent vector analysis with Student´s t source prior for convolutive speech mixtures

  • Author

    Harris, Jack ; Rivet, Bertrand ; Naqvi, Syed Mohsen ; Chambers, Jonathon A. ; Jutten, Christian

  • Author_Institution
    GIPSA-Lab., Univ. de Grenoble, Grenoble, France
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    1856
  • Lastpage
    1860
  • Abstract
    A common approach to blind source separation is to use independent component analysis. However when dealing with realistic convolutive audio and speech mixtures, processing in the frequency domain at each frequency bin is required. As a result this introduces the permutation problem, inherent in independent component analysis, across the frequency bins. Independent vector analysis directly addresses this issue by modeling the dependencies between frequency bins, namely making use of a source prior. An alternative source prior for real-time (online) natural gradient independent vector analysis is proposed. A Student´s t probability density function is known to be more suited for speech sources, due to its heavier tails, and is incorporated into a real-time version of natural gradient independent vector analysis. In addition, the importance of the degrees of freedom parameter within the Student´s t distribution is highlighted. The final algorithm is realized as a real-time embedded application on a floating point Texas Instruments digital signal processor platform, where simulated recordings from a reverberant room are used for testing. Results are shown to be better than with the original (super-Gaussian) source prior.
  • Keywords
    independent component analysis; probability; signal processing; speech processing; blind source separation; convolutive speech mixtures; floating point digital signal processor platform; freedom parameter; independent component analysis; probability density function; real-time embedded application; real-time independent vector analysis; reverberant room; speech sources; Three-dimensional displays; embedded application; independent vector analysis; multivariate distribution; real-time; source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178292
  • Filename
    7178292