Title :
Missing data techniques for robust speech recognition
Author :
Cooke, Martin ; Morris, Andrew ; Green, Phil
Author_Institution :
Dept. of Comput. Sci., Sheffield Univ., UK
Abstract :
In noisy listening conditions, the information available on which to base speech recognition decisions is necessarily incomplete: some spectro-temporal regions are dominated by other sources. We report on the application of a variety of techniques for missing data in speech recognition. These techniques may be based on marginal distributions or on reconstruction of missing parts of the spectrum. Application of these ideas in the resource management task shows a performance which is robust to random removal of up to 80% of the frequency channels, but falls off rapidly with deletions which more realistically simulate masked speech. We report on a vowel classification experiment designed to isolate some of the RM problems for more detailed exploration. The results of this experiment confirm the general superiority of marginals-based schemes, demonstrate the viability of shared covariance statistics, and suggest several ways in which performance improvements on the larger task may be obtained
Keywords :
Gaussian distribution; acoustic noise; covariance analysis; pattern classification; spectral analysis; speech recognition; deletions; frequency channels; marginal distributions; masked speech; missing data techniques; noisy listening conditions; performance; reconstruction; resource management task; robust speech recognition; shared covariance statistics; spectro-temporal regions; vowel classification experiment; Acoustic noise; Application software; Automatic speech recognition; Computer science; Frequency; Hidden Markov models; Resource management; Robustness; Speech coding; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596072