DocumentCode :
699927
Title :
MLP-based log spectral energy mapping for robust overlapping speech recognition
Author :
Weifeng Li ; Magimai-Doss, Mathew ; Dines, John ; Bourlard, Herve
Author_Institution :
IDIAP Res. Inst., Martigny, Switzerland
fYear :
2008
fDate :
25-29 Aug. 2008
Firstpage :
1
Lastpage :
5
Abstract :
This paper investigates a multilayer perceptron (MLP) based acoustic feature mapping to extract robust features for automatic speech recognition (ASR) of overlapping speech. The MLP is trained to learn the mapping from log mel filter bank energies (MFBEs) extracted from the distant microphone recordings, including multiple overlapping speakers, to log MFBEs extracted from the clean speech signal. The outputs of the MLP are then used to generate mel filterbank cepstral coefficient (MFCC) acoustic features, that are subsequently used in acoustic model adaptation and system evaluation. The proposed approach is evaluated through extensive studies on the MONC corpus, which includes both non-overlapping single speaker and overlapping multi-speaker conditions. We demonstrate that by learning the mapping between log MFBEs extracted from noisy and clean signals the performance of ASR system can be significantly improved in overlapping multi-speaker condition compared a conventional delay-sum beamforming approach, while keeping the performance of the system on single non-overlapping speaker condition intact.
Keywords :
acoustic signal processing; cepstral analysis; channel bank filters; feature extraction; multilayer perceptrons; speaker recognition; MLP-based log spectral energy mapping; MONC corpus; acoustic feature mapping; acoustic model adaptation; automatic speech recognition; clean speech signal; delay-sum beamforming approach; mel filter bank energy; mel filterbank cepstral coefficient; microphone recording; multilayer perceptron; multispeaker condition; robust feature extraction; robust overlapping speech recognition; Abstracts; Cepstral analysis; Europe; Robustness; Silicon;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne
ISSN :
2219-5491
Type :
conf
Filename :
7080459
Link To Document :
بازگشت