Title : 
Robust speech recognition over mobile and IP networks in burst-like packet loss
         
        
            Author : 
Milner, B. ; James, Alastair
         
        
            Author_Institution : 
Sch. of Comput. Sci., Univ. of East Anglia, Norwich, UK
         
        
        
        
        
        
        
            Abstract : 
This paper addresses the problem of achieving robust distributed speech recognition in the presence of burst-like packet loss. To compensate for packet loss a number of techniques are investigated to provide estimates of lost vectors. Experimental results on both a connected digits task and a large vocabulary continuous speech recognition task show that simple methods, such as repetition, are not as effective as interpolation methods which are better able to preserve the dynamics of the feature vector stream. Best performance is given by maximum a-posteriori (MAP) estimation of lost vectors which utilizes statistics of the feature vector stream. At longer burst lengths the performance of these compensation techniques deteriorates as the temporal correlation in the received feature vector stream reduces. To compensate for this interleaving is proposed which aims to disperse bursts of loss into a series of unconnected smaller bursts. Results show substantial gains in accuracy, to almost that of the no loss condition, when interleaving is combined with estimation techniques, although this is at the expense of introducing delay. This leads to the proposal that, for a distributed speech recognition application, it is more beneficial to trade delay for accuracy rather than trading bit-rate for accuracy as in forward error correction schemes.
         
        
            Keywords : 
IP networks; interpolation; maximum likelihood estimation; mobile computing; mobile radio; speech recognition; IP networks; bit-rate; burst-like packet loss; feature vector stream; forward error correction schemes; interpolation methods; maximum a-posteriori estimation; mobile networks; repetition methods; robust distributed speech recognition; robust speech recognition; Delay estimation; IP networks; Interleaved codes; Interpolation; Maximum a posteriori estimation; Proposals; Robustness; Speech recognition; Statistics; Vocabulary; Distributed speech recognition; interleaving; interpolation; maximum a-posteriori (MAP); packet loss;
         
        
        
            Journal_Title : 
Audio, Speech, and Language Processing, IEEE Transactions on
         
        
        
        
        
            DOI : 
10.1109/TSA.2005.852997