DocumentCode :
396228
Title :
Usable speech measures and their fusion
Author :
Yantorno, Robert E. ; Smolenski, Brett Y ; Chandra, Nishant
Author_Institution :
ECE Dept., Temple Univ., Philadelphia, PA, USA
Volume :
3
fYear :
2003
fDate :
25-28 May 2003
Abstract :
Usable speech is a novel concept related to the co-channel speech problem. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech is to identify and extract those portions of co-channel speech that are still useful for speech processing applications such as speaker identification or speech recognition, which do not work in cochannel environments. Usable speech measures are features that are extracted from the co-channel signal to detect the presence of usable as well as co-channel (unusable) speech. Several usable speech measures are currently being developed; however, these measures detect only about 75% of the usable speech. To improve on this performance, nonlinear estimation and Bayesian classification are used to fuse the information in two recently proposed usable speech measures. Using fusion resulted in a 15% increase in hits (usable speech frames detected) and a 37% decrease in false alarms.
Keywords :
Bayes methods; nonlinear estimation; sensor fusion; speaker recognition; speech processing; speech recognition; Bayesian classification; co-channel speech; nonlinear estimation; speaker identification; speech processing; speech recognition; usable speech measures fusion; Awards Planning & Policy Committee; Current measurement; Data mining; Feature extraction; Interference; Shape measurement; Signal detection; Speech coding; Speech processing; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
0-7803-7761-3
Type :
conf
DOI :
10.1109/ISCAS.2003.1205124
Filename :
1205124
Link To Document :
بازگشت