Title :
Factorial speech processing models for noise-robust automatic speech recognition
Author :
Khademian, Mahdi ; Homayounpour, Mohammad Mehdi
Author_Institution :
Lab. for Intell. Multimedia Process., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
This paper presents an introduction of factorial speech processing models for noise-robust automatic speech processing tasks. Factorial models try to use more noise information rather than other robustness techniques for better generative modeling of speech and noise and the way they are combine together. Since factorial models were not completely successful in noise-robust speech processing applications while they have significant achievements in other speech processing areas in the past, we decide to reconsider them and evaluate their effects in the Aurora 2 task. In addition to Aurora noises, two more regular noises are examined in our experiments including Helicopter and Locomotive engine noises. Experiments show that these models are successful when we faced with destructive noises in addition to their unexpected improvements for non-regular non-stationary noises like Babble.
Keywords :
signal denoising; speech recognition; Aurora noise; destructive noise; factorial speech processing model; helicopter engine noise; locomotive engine noise; noise robust automatic speech recognition; nonregular nonstationary noise; regular noise; robustness technique; Conferences; Decision support systems; Electrical engineering; Radio frequency; factorial models of speech processing; state-conditional observation distribution; two-dimensional Viterbi algorithm; weighted stereo sampling;
Conference_Titel :
Electrical Engineering (ICEE), 2015 23rd Iranian Conference on
Conference_Location :
Tehran
Print_ISBN :
978-1-4799-1971-0
DOI :
10.1109/IranianCEE.2015.7146292