• DocumentCode
    1224366
  • Title

    Adaptive Beamforming With a Minimum Mutual Information Criterion

  • Author

    Kumatani, Kenichi ; Gehrig, Tobias ; Mayer, Uwe ; Stoimenov, Emilian ; McDonough, John ; Wolfel, Matthias

  • Author_Institution
    Inst. for Comput. Sci. & Eng., Univ. of Karlsruhe, Karlsruhe
  • Volume
    15
  • Issue
    8
  • fYear
    2007
  • Firstpage
    2527
  • Lastpage
    2541
  • Abstract
    In this paper, we consider an acoustic beamforming application where two speakers are simultaneously active. We construct one subband-domain beamformer in generalized sidelobe canceller (GSC) configuration for each source. In contrast to normal practice, we then jointly optimize the active weight vectors of both GSCs to obtain two output signals with minimum mutual information (MMI). Assuming that the subband snapshots are Gaussian-distributed, this MMI criterion reduces to the requirement that the cross-correlation coefficient of the subband outputs of the two GSCs vanishes. We also compare separation performance under the Gaussian assumption with that obtained from several super-Gaussian probability density functions (pdfs), namely, the Laplace and pdfs. Our proposed technique provides effective nulling of the undesired source, but without the signal cancellation problems seen in conventional beamforming. Moreover, our technique does not suffer from the source permutation and scaling ambiguities encountered in conventional blind source separation algorithms. We demonstrate the effectiveness of our proposed technique through a series of far-field automatic speech recognition experiments on data from the PASCAL Speech Separation Challenge (SSC). On the SSC development data, the simple delay-and-sum beamformer achieves a word error rate (WER) of 70.4%. The MMI beamformer under a Gaussian assumption achieves a 55.2% WER, which is further reduced to 52.0% with a pdf, whereas the WER for data recorded with a close-talking microphone is 21.6%.
  • Keywords
    Gaussian distribution; acoustic signal processing; array signal processing; probability; speaker recognition; Gaussian-distribution; adaptive acoustic beamforming; cross-correlation coefficient; generalized sidelobe canceller; minimum mutual information criterion; subband-domain beamformer; super Gaussian probability density function; Acoustic applications; Array signal processing; Automatic speech recognition; Blind source separation; Delay; Error analysis; Gaussian processes; Loudspeakers; Mutual information; Probability density function; Beamforming; microphone arrays; source separation; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.907430
  • Filename
    4317568