Title :
Modeling speech with sum-product networks: Application to bandwidth extension
Author :
Peharz, Robert ; Kapeller, Georg ; Mowlaee, Pejman ; Pernkopf, Franz
Abstract :
Sum-product networks (SPNs) are a recently proposed type of probabilistic graphical models allowing complex variable interactions while still granting efficient inference. In this paper we demonstrate the suitability of SPNs for modeling log-spectra of speech signals using the application of artificial bandwidth extension, i.e. artificially replacing the high-frequency content which is lost in telephone signals. We use SPNs as observation models in hidden Markov models (HMMs), which model the temporal evolution of log short-time spectra. Missing frequency bins are replaced by the SPNs using most-probable-explanation inference, where the state-dependent reconstructions are weighted with the HMM state posterior. According to subjective listening and objective evaluation, our system consistently and significantly improves the state of the art.
Keywords :
graph theory; hidden Markov models; speech processing; HMMs; artificial bandwidth extension; hidden Markov models; log short-time spectra; objective evaluation; probabilistic graphical models; speech bandwidth extension; speech signals; subjective listening; sum-product networks; telephone signals; Bandwidth; Computational modeling; Graphical models; Hidden Markov models; Speech; Speech enhancement; HMM; SPN; graphical models; speech bandwidth extension;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854292