• DocumentCode
    2400655
  • Title

    Convolutive blind speech separation in a room using cross spectral density matrix

  • Author

    Prabhu, C. ; Pradeep, S. ; Baskaran, R. ; Chellappan, C.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Anna Univ., Chennai, India
  • fYear
    2010
  • fDate
    28-29 Dec. 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The problem of separation of audio sources recorded in a real world situation is well established in modern literature. The method to solve this problem is Blind Speech Separation (BSS). The recording environment is usually modeled as convolutive (i.e. number of speech sources should be equal to or less than number of microphone arrays). In this paper, we propose a new frequency domain approach to convolutive blind speech separation. Matrix Diagonalization method is applied on cross power spectral density matrices of the microphone inputs to determine the mixing system at each frequency bin up to a permutation ambiguity. Then, we propose an efficient algorithm to resolve permutation ambiguity. The inverse of the mixing system is then used to find the separate sources. The performance of the proposed algorithm is demonstrated by experiments conducted in real reverberant rooms.
  • Keywords
    acoustic convolution; blind source separation; matrix algebra; microphone arrays; speech processing; audio sources separation; convolutive blind speech separation; cross power spectral density matrix; matrix diagonalization; microphone; permutation ambiguity; reverberant room; Algorithm design and analysis; Frequency domain analysis; Microphones; Signal to noise ratio; Source separation; Speech; Blind Speech Separation; Cross-Power Spectral Density; Matrix Diagonalization; Permutation ambiguity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Computing Research (ICCIC), 2010 IEEE International Conference on
  • Conference_Location
    Coimbatore
  • Print_ISBN
    978-1-4244-5965-0
  • Electronic_ISBN
    978-1-4244-5967-4
  • Type

    conf

  • DOI
    10.1109/ICCIC.2010.5705742
  • Filename
    5705742