Title : 
Perceptually-based features in ASR
         
        
            Author : 
Mason, J.S.D. ; Gu, Y.
         
        
            Author_Institution : 
Dept. of Electr. Eng., Univ. Coll. of Swansea, UK
         
        
        
            fDate : 
1/19/1988 12:00:00 AM
         
        
        
        
            Abstract : 
Perceptually-based linear predictive (PLP) speech analysis, as proposed by Hermansky 1985, can have marked benefits in ASR (automatic speech recognition) systems. Four psychoacoustic factors are considered in PLP analysis, namely critical-band, masking effect, equal-loudness and intensity-loudness law. This paper presents experimental results aimed at illustrating the relative importance of each of these in the context of ASR. It is shown that the [J] SRU filter bank can be incorporated into the PLP process with very similar overall results. The ASR system is based on dynamic time warping (DTW), and a vocabulary consisting of the alphabet and zero-through-nine is used for tests
         
        
            Keywords : 
speech analysis and processing; speech recognition; automatic speech recognition; critical-band; dynamic time warping; equal-loudness; intensity-loudness law; linear predictive; masking effect; psychoacoustic factors; speech analysis;
         
        
        
        
            Conference_Titel : 
Speech Processing, IEE Colloquium on