DocumentCode :
2352678
Title :
The Markov selection model for concurrent speech recognition
Author :
Smaragdis, Paris ; Raj, Bhiksha
Author_Institution :
Adobe Syst. Inc., Cambridge, MA, USA
fYear :
2010
fDate :
Aug. 29 2010-Sept. 1 2010
Firstpage :
214
Lastpage :
219
Abstract :
In this paper we introduce a new Markov model that is capable of recognizing speech from recordings of simultaneously speaking a priori known speakers. This work is based on recent work on non-negative representations of spectrograms, which has been shown to be very effective in source separation problems. In this paper we extend these approaches to design a Markov selection model that is able to recognize sequences even when they are presented mixed together. We do so without the need to perform separation on the signals. Unlike factorial Markov models which have been used similarly in the past, this approach features a low computational complexity in the number of sources and Markov states, which makes it a highly efficient alternative. We demonstrate the use of this framework in recognizing speech from mixtures of known speakers.
Keywords :
Markov processes; computational complexity; source separation; speech recognition; Markov selection model; computational complexity; concurrent speech recognition; source separation problems; spectrogram nonnegative representations; Computational modeling; Dictionaries; Equations; Hidden Markov models; Markov processes; Mathematical model; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2010 IEEE International Workshop on
Conference_Location :
Kittila
ISSN :
1551-2541
Print_ISBN :
978-1-4244-7875-0
Electronic_ISBN :
1551-2541
Type :
conf
DOI :
10.1109/MLSP.2010.5588124
Filename :
5588124
Link To Document :
بازگشت