• DocumentCode
    699927
  • Title

    MLP-based log spectral energy mapping for robust overlapping speech recognition

  • Author

    Weifeng Li ; Magimai-Doss, Mathew ; Dines, John ; Bourlard, Herve

  • Author_Institution
    IDIAP Res. Inst., Martigny, Switzerland
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper investigates a multilayer perceptron (MLP) based acoustic feature mapping to extract robust features for automatic speech recognition (ASR) of overlapping speech. The MLP is trained to learn the mapping from log mel filter bank energies (MFBEs) extracted from the distant microphone recordings, including multiple overlapping speakers, to log MFBEs extracted from the clean speech signal. The outputs of the MLP are then used to generate mel filterbank cepstral coefficient (MFCC) acoustic features, that are subsequently used in acoustic model adaptation and system evaluation. The proposed approach is evaluated through extensive studies on the MONC corpus, which includes both non-overlapping single speaker and overlapping multi-speaker conditions. We demonstrate that by learning the mapping between log MFBEs extracted from noisy and clean signals the performance of ASR system can be significantly improved in overlapping multi-speaker condition compared a conventional delay-sum beamforming approach, while keeping the performance of the system on single non-overlapping speaker condition intact.
  • Keywords
    acoustic signal processing; cepstral analysis; channel bank filters; feature extraction; multilayer perceptrons; speaker recognition; MLP-based log spectral energy mapping; MONC corpus; acoustic feature mapping; acoustic model adaptation; automatic speech recognition; clean speech signal; delay-sum beamforming approach; mel filter bank energy; mel filterbank cepstral coefficient; microphone recording; multilayer perceptron; multispeaker condition; robust feature extraction; robust overlapping speech recognition; Abstracts; Cepstral analysis; Europe; Robustness; Silicon;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080459