DocumentCode :
763607
Title :
Robust speech recognition over mobile and IP networks in burst-like packet loss
Author :
Milner, B. ; James, Alastair
Author_Institution :
Sch. of Comput. Sci., Univ. of East Anglia, Norwich, UK
Volume :
14
Issue :
1
fYear :
2006
Firstpage :
223
Lastpage :
231
Abstract :
This paper addresses the problem of achieving robust distributed speech recognition in the presence of burst-like packet loss. To compensate for packet loss a number of techniques are investigated to provide estimates of lost vectors. Experimental results on both a connected digits task and a large vocabulary continuous speech recognition task show that simple methods, such as repetition, are not as effective as interpolation methods which are better able to preserve the dynamics of the feature vector stream. Best performance is given by maximum a-posteriori (MAP) estimation of lost vectors which utilizes statistics of the feature vector stream. At longer burst lengths the performance of these compensation techniques deteriorates as the temporal correlation in the received feature vector stream reduces. To compensate for this interleaving is proposed which aims to disperse bursts of loss into a series of unconnected smaller bursts. Results show substantial gains in accuracy, to almost that of the no loss condition, when interleaving is combined with estimation techniques, although this is at the expense of introducing delay. This leads to the proposal that, for a distributed speech recognition application, it is more beneficial to trade delay for accuracy rather than trading bit-rate for accuracy as in forward error correction schemes.
Keywords :
IP networks; interpolation; maximum likelihood estimation; mobile computing; mobile radio; speech recognition; IP networks; bit-rate; burst-like packet loss; feature vector stream; forward error correction schemes; interpolation methods; maximum a-posteriori estimation; mobile networks; repetition methods; robust distributed speech recognition; robust speech recognition; Delay estimation; IP networks; Interleaved codes; Interpolation; Maximum a posteriori estimation; Proposals; Robustness; Speech recognition; Statistics; Vocabulary; Distributed speech recognition; interleaving; interpolation; maximum a-posteriori (MAP); packet loss;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.852997
Filename :
1561279
Link To Document :
بازگشت