DocumentCode
697837
Title
A statistical framework for artificial bandwidth extension exploiting speech waveform and phonetic transcription
Author
Bauer, P. ; Fingscheidt, T.
Author_Institution
Dept. of Signal Process., Tech. Univ. Braunschweig, Braunschweig, Germany
fYear
2009
fDate
24-28 Aug. 2009
Firstpage
1839
Lastpage
1843
Abstract
In the past, artificial bandwidth extension (ABWE) has primarily been investigated to enhance transmitted narrowband speech signals at the receiving side. State-of-the-art schemes show improved quality versus narrowband speech; however, a clear gap to wideband speech is still reported. This is largely due to the insufficient ABWE performance on fricatives, particularly /s/. We asked ourselves to what extent the speech quality could be improved, if we knew the currently spoken phoneme. In this paper we present a framework using phonetic transcriptions as a-priori knowledge besides the speech waveform. Possible applications are high-quality offline ABWE of telephone, pilot, or historic speech recordings, memory efficient narrowband speech synthesis followed by ABWE, and extension of narrowband telephone databases to train wideband acoustic models for automatic speech recognition. For the classical conversational telephony application, an improved ABWE scheme is also proposed making use of transcription information only during training.
Keywords
speech processing; speech recognition; statistical analysis; ABWE; artificial bandwidth extension; automatic speech recognition; narrowband speech signals; narrowband telephone databases; phonetic transcription; speech quality; speech waveform; statistical framework; wideband acoustic models; wideband speech; Abstracts; Legged locomotion; Robustness; Wideband;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2009 17th European
Conference_Location
Glasgow
Print_ISBN
978-161-7388-76-7
Type
conf
Filename
7077409
Link To Document