Title :
Robust speech recognition in Noisy Environments: The 2001 IBM spine evaluation system
Author :
Kingsbury, Brian ; Saon, George ; Mangu, Lidia ; Padmanabhan, Mukund ; Sarikaya, Ruhi
Author_Institution :
IBM T. J. Watson Research Center, Yorktown Heights, NY, 10598, USA
Abstract :
We report on the system IBM fielded in the second SPeech In Noisy Environments (SPINE-2) evaluation, conducted by the Naval Research Laboratory in October 2001. The key components of the system include an HMM-based automatic segmentation module using a novel set of LDA-transformed voicing and energy features, a multiple-pass decoding strategy that uses several speaker-and environment-normalization operations to deal with the highly variable acoustics of the evaluation, the combination of hypotheses from decoders operating on three distinct acoustic feature sets, and a class-based language model that uses both the SPINE-1 and SPINE-2 training data to estimate reliable probabilities for the new SPINE-2 vocabulary.
Keywords :
Atmospheric modeling; Computational modeling; Hidden Markov models; Speech; Speech recognition; Switches; Training;
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5743652