DocumentCode
698869
Title
Bandwidth extension of telephone speech using frame-based excitation and robust features
Author
Uysal, Ismail ; Sathyendra, Harsha ; Harris, John G.
Author_Institution
Comput. NeuroEngineering Lab., Univ. of Florida, Gainesville, FL, USA
fYear
2005
fDate
4-8 Sept. 2005
Firstpage
1
Lastpage
4
Abstract
The standards that are still in use for telephone communications since the 1950s limit the information bandwidth to 300-3400Hz. However, in normal conversational speech, the frequency content is mainly between 0-8000Hz. This constraint degrades not only the sound quality but also the intelligibility of the transmitted signal. Instead of modifying the present telecommunication infrastructures, which would cost billions of dollars, many researchers have been studying more efficient methods to increase the quality of telephone speech. This paper develops an innovative solution to bandwidth extension, which is based upon the Linear Source Filter Model that breaks speech up into two parts: the excitation and the spectral envelope. Novel approaches are used to extend the frequency information for both parts. This algorithm particularly emphasizes low frequency reconstruction without neglecting high frequencies. Furthermore, different feature sets to model the spectral envelope are employed for better performance under noisy conditions.
Keywords
spectral analysis; speech enhancement; speech intelligibility; telephony; bandwidth 300 Hz to 3400 Hz; bandwidth extension; frame-based excitation; frequency information; frequency reconstruction; linear source filter model; robust feature; sound quality; spectral envelope; telephone communication; telephone speech; Feature extraction; Frequency modulation; Hidden Markov models; Maximum likelihood detection; Niobium; Nonlinear filters; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2005 13th European
Conference_Location
Antalya
Print_ISBN
978-160-4238-21-1
Type
conf
Filename
7078466
Link To Document