• DocumentCode
    179787
  • Title

    A maximum a Posterior-based reconstruction approach to speech bandwidth expansion in noise

  • Author

    Hyunson Seo ; Hong-Goo Kang ; Soong, Frank

  • Author_Institution
    Dept. of E.E., Yonsei Univ., Seoul, South Korea
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    6087
  • Lastpage
    6091
  • Abstract
    We propose a novel bandwidth expansion algorithm for extending narrowband speech signal to wideband by exploiting segment examples pre-stored in a speaker independent database. Both narrowband and wideband representation of speech signals are pre-stored in the corpus and they are dynamically chopped into variable length segments. Narrowband segments are used dynamically to explain a given narrowband input sentence while the wideband expanded version of the input sentence is constructed correspondingly. The matching process in the narrowband favors a longer segment patch by the chosen Maximum A Posterior (MAP) criterion. As a result, the multiple choices in matching process are significantly reduced with the MAP criterion in decoding. The approach is further generalized to deal with noise corrupted narrowband input signals and the well-known Vector Taylor Series (VTS) noise adaptation algorithm is incorporated into the matching and bandwidth expansion process. A series of experiments is performed to validate the approach on both clean and noise corrupted narrowband speech where both car noise and babble noise corrupted samples are tested.
  • Keywords
    maximum likelihood estimation; signal reconstruction; signal representation; speech processing; MAP criterion; VTS noise adaptation algorithm; babble noise; car noise; clean narrowband speech; matching process; maximum a posterior-based reconstruction approach; narrowband input sentence; narrowband input signals; narrowband representation; narrowband segments; narrowband speech signal; noise corrupted narrowband speech; speaker independent database; speech bandwidth expansion algorithm; speech signal representation; vector taylor series; wideband representation; Hidden Markov models; Narrowband; Noise; Speech; Vectors; Wideband; corpus-model; maximum a posterior; noise reduction; speech bandwidth expansion; vector Taylor series;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854773
  • Filename
    6854773