DocumentCode :
542169
Title :
Robust speech recognition in Noisy Environments: The 2001 IBM spine evaluation system
Author :
Kingsbury, Brian ; Saon, George ; Mangu, Lidia ; Padmanabhan, Mukund ; Sarikaya, Ruhi
Author_Institution :
IBM T. J. Watson Research Center, Yorktown Heights, NY, 10598, USA
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
We report on the system IBM fielded in the second SPeech In Noisy Environments (SPINE-2) evaluation, conducted by the Naval Research Laboratory in October 2001. The key components of the system include an HMM-based automatic segmentation module using a novel set of LDA-transformed voicing and energy features, a multiple-pass decoding strategy that uses several speaker-and environment-normalization operations to deal with the highly variable acoustics of the evaluation, the combination of hypotheses from decoders operating on three distinct acoustic feature sets, and a class-based language model that uses both the SPINE-1 and SPINE-2 training data to estimate reliable probabilities for the new SPINE-2 vocabulary.
Keywords :
Atmospheric modeling; Computational modeling; Hidden Markov models; Speech; Speech recognition; Switches; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743652
Filename :
5743652
Link To Document :
بازگشت