DocumentCode :
2594488
Title :
Robust voice recognition over IP and mobile networks
Author :
Milner, B.
Author_Institution :
BT Cellnet Mobile Internet Centre, Suffolk
Volume :
2
fYear :
2000
fDate :
2000
Firstpage :
1197
Abstract :
This work looks at the issues involved in performing robust speech recognition over mobile and IP networks. The conventional method for sending speech across a mobile or IP network is to encode the speech on the terminal device using a low bit-rate codec and then transmit the stream of codec parameters. It is shown in this work that for speech recognition applications an alternative is available whereby the front-end processing part of a network-based speech recogniser is detached and moved onto the terminal device. Recognition features are then sent over the network to the remote recogniser. Simulations demonstrate that sending the speech features in this manner can provide a significant enhancement in recognition performance over the traditional codec-based approach. This technique forms the basis of the ETSI (European Telecommunications Standards Institute) Aurora standard. Problems arising with access over IP networks are also considered and in particular that of packet loss. A novel two-stage identification and estimation strategy is introduced which compensates for this loss of speech packets. Simulation results show that an almost negligible loss in recognition performance is possible at packet losses of up to 50%
Keywords :
Internet telephony; land mobile radio; packet radio networks; speech codecs; speech recognition; subscriber loops; telecommunication standards; transport protocols; ETSI Aurora standard; European Telecommunications Standards Institute; IP networks; VoIP; access networks; codec parameters; front-end processing; low bit-rate codec; mobile networks; network-based speech recogniser; packet losses; recognition performance enhancement; robust speech recognition; robust voice recognition; simulation results; speech coding; speech features; terminal device; two-stage estimation; two-stage identification; IP networks; Internet telephony; Mobile handsets; Performance loss; Robustness; Speech codecs; Speech enhancement; Speech processing; Speech recognition; Telecommunication standards;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Personal, Indoor and Mobile Radio Communications, 2000. PIMRC 2000. The 11th IEEE International Symposium on
Conference_Location :
London
Print_ISBN :
0-7803-6463-5
Type :
conf
DOI :
10.1109/PIMRC.2000.881609
Filename :
881609
Link To Document :
بازگشت