Title :
Language informed bandwidth expansion
Author :
Han, Jinyu ; Mysore, Gautham J. ; Pardo, Bryan
Abstract :
High-level knowledge of language helps the human auditory system understand speech with missing information such as missing frequency bands. The automatic speech recognition community has shown that the use of this knowledge in the form of language models is crucial to obtaining high quality recognition results. In this paper, we apply this idea to the bandwidth expansion problem to automatically estimate missing frequency bands of speech. Specifically, we use language models to constrain the recently proposed non-negative hidden Markov model for this application. We compare the proposed method to a bandwidth expansion algorithm based on non-negative spectrogram factorization and show improved results on two standard signal quality metrics.
Keywords :
hidden Markov models; natural language processing; speech recognition; automatic speech recognition; human auditory system; language informed bandwidth expansion; missing frequency band estimation; nonnegative hidden Markov model; nonnegative spectrogram factorization; speech understanding; Dictionaries; Hidden Markov models; Narrowband; Spectrogram; Speech; Wideband; Bandwidth Expansion; Language Model; Non-negative Hidden Markov Model;
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2012 IEEE International Workshop on
Conference_Location :
Santander
Print_ISBN :
978-1-4673-1024-6
Electronic_ISBN :
1551-2541
DOI :
10.1109/MLSP.2012.6349783