Title : 
Noise-robust speech recognition using a new spectral estimation method “PHASOR”
         
        
            Author : 
Aikawa, Kiyoaki ; Ishizuka, Kentaro
         
        
            Author_Institution : 
NTT Communication Science Laboratories, NTT Corporation, 3-1 Morinosato-Wakamiya, Atsugi-Shi, Kanagawa 243-0198 Japan
         
        
        
        
        
            Abstract : 
This paper proposes a new noise-robust spectral estimation method for speech recognition. The new method, called PHASOR, is characterized by inside-frame processing. The speech spectrum is estimated from a single impulse response obtained by summing multiple pitch periods in a frame with synchronizing the phase. PHASOR improves the spectral estimation accuracy and suppresses the additive noise because of the inside-frame processing. These improvement is more effective when the pitch fluctuates or changes in the frame. Speaker-dependent and speaker-independent phoneme recognition experiments demonstrate that the PHASOR greatly reduces the recognition error rate for speech data contaminated by noise. It also outperforms conventional noise reduction methods, cepstral mean normalization and spectral subtraction.
         
        
            Keywords : 
Cepstral analysis; Speech;
         
        
        
        
            Conference_Titel : 
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
         
        
            Conference_Location : 
Orlando, FL, USA
         
        
        
            Print_ISBN : 
0-7803-7402-9
         
        
        
            DOI : 
10.1109/ICASSP.2002.5743738