A maximum a Posterior-based reconstruction approach to speech bandwidth expansion in noise

Author

Hyunson Seo ; Hong-Goo Kang ; Soong, Frank

Author_Institution

Dept. of E.E., Yonsei Univ., Seoul, South Korea

fYear

2014

fDate

4-9 May 2014

Firstpage

6087

Lastpage

6091

Abstract

We propose a novel bandwidth expansion algorithm for extending narrowband speech signal to wideband by exploiting segment examples pre-stored in a speaker independent database. Both narrowband and wideband representation of speech signals are pre-stored in the corpus and they are dynamically chopped into variable length segments. Narrowband segments are used dynamically to explain a given narrowband input sentence while the wideband expanded version of the input sentence is constructed correspondingly. The matching process in the narrowband favors a longer segment patch by the chosen Maximum A Posterior (MAP) criterion. As a result, the multiple choices in matching process are significantly reduced with the MAP criterion in decoding. The approach is further generalized to deal with noise corrupted narrowband input signals and the well-known Vector Taylor Series (VTS) noise adaptation algorithm is incorporated into the matching and bandwidth expansion process. A series of experiments is performed to validate the approach on both clean and noise corrupted narrowband speech where both car noise and babble noise corrupted samples are tested.

Keywords

maximum likelihood estimation; signal reconstruction; signal representation; speech processing; MAP criterion; VTS noise adaptation algorithm; babble noise; car noise; clean narrowband speech; matching process; maximum a posterior-based reconstruction approach; narrowband input sentence; narrowband input signals; narrowband representation; narrowband segments; narrowband speech signal; noise corrupted narrowband speech; speaker independent database; speech bandwidth expansion algorithm; speech signal representation; vector taylor series; wideband representation; Hidden Markov models; Narrowband; Noise; Speech; Vectors; Wideband; corpus-model; maximum a posterior; noise reduction; speech bandwidth expansion; vector Taylor series;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location

Florence

Type

conf

DOI

10.1109/ICASSP.2014.6854773

Filename

6854773