Title : 
Hands-free speech recognition and communication on PDAs using microphone array technology
         
        
            Author : 
Herbordt, W. ; Horiuchi, T. ; Fujimoto, M. ; Jitsuhiro, T. ; Nakamura, S.
         
        
            Author_Institution : 
ATR Spoken Language Commun. Res. Lab., Kyoto
         
        
        
        
        
        
            Abstract : 
In this paper, a personal digital assistant (PDA) for hands-free speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR = 5 dB more than 91% word accuracy is obtained
         
        
            Keywords : 
Gaussian processes; array signal processing; interference suppression; least mean squares methods; microphone arrays; mobile communication; notebook computers; speech recognition; Gaussian mixture model; PDA; hands-free speech recognition; log Mel-spectral energy; microphone array technology; minimum mean-squared error estimation; multichannel database; personal digital assistant; robust generalized sidelobe canceller; single-channel noise suppression; Acoustic noise; Automatic speech recognition; Microphone arrays; Noise cancellation; Noise reduction; Noise robustness; Personal digital assistants; Sensor arrays; Speech recognition; Universal Serial Bus;
         
        
        
        
            Conference_Titel : 
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
         
        
            Conference_Location : 
San Juan
         
        
            Print_ISBN : 
0-7803-9478-X
         
        
            Electronic_ISBN : 
0-7803-9479-8
         
        
        
            DOI : 
10.1109/ASRU.2005.1566509