DocumentCode :
2212645
Title :
Ultra low bit rate voice coding
Author :
Ovens, M.J. ; Ponting, K.M. ; Turner, M.E.
Author_Institution :
20/20 Speech Ltd., Great Malvern, UK
fYear :
2000
fDate :
2000
Firstpage :
42614
Lastpage :
915
Abstract :
High frequency (HF) radio is used for long haul or extended range communications in many situations. Under stressed HF channel conditions, the supportable data rate falls below that required by existing low bit rate speech coding algorithms. This paper presents research undertaken at DERA Malvern on the development of a real-time speech coding system which utilises automatic speech recognition (ASR) and synthesis technologies to achieve speech coding at data rates below 300 bps. A continuous speech recogniser is used to transcribe incoming speech as a sequence of sub-word units, termed acoustic segments. Prosodic information (pitch and duration) is combined with segment identity to form a serial data stream suitable for transmission. A parallel formant speech synthesiser is used to synthesise the speech at the receiver, using models trained to a particular talker´s voice to establish talker characteristics
Keywords :
speech coding; DERA Malvern; HF radio; acoustic segments; automatic speech recognition; automatic speech synthesis; continuous speech recogniser; data rate; data rates; duration; extended range communications; high frequency radio; long haul communications; low bit rate speech coding algorithms; parallel formant speech synthesiser; pitch; prosodic information; real-time speech coding system; research; serial data stream; stressed HF channel conditions; sub-word units; talker characteristics; talker voice trained model; ultra low bit rate voice coding;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Speech Coding for Algorithms for Radio Channels (Ref. No. 2000/012), IEE Seminar
Conference_Location :
London
Type :
conf
DOI :
10.1049/ic:20000047
Filename :
855172
Link To Document :
بازگشت