DocumentCode
396228
Title
Usable speech measures and their fusion
Author
Yantorno, Robert E. ; Smolenski, Brett Y ; Chandra, Nishant
Author_Institution
ECE Dept., Temple Univ., Philadelphia, PA, USA
Volume
3
fYear
2003
fDate
25-28 May 2003
Abstract
Usable speech is a novel concept related to the co-channel speech problem. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech is to identify and extract those portions of co-channel speech that are still useful for speech processing applications such as speaker identification or speech recognition, which do not work in cochannel environments. Usable speech measures are features that are extracted from the co-channel signal to detect the presence of usable as well as co-channel (unusable) speech. Several usable speech measures are currently being developed; however, these measures detect only about 75% of the usable speech. To improve on this performance, nonlinear estimation and Bayesian classification are used to fuse the information in two recently proposed usable speech measures. Using fusion resulted in a 15% increase in hits (usable speech frames detected) and a 37% decrease in false alarms.
Keywords
Bayes methods; nonlinear estimation; sensor fusion; speaker recognition; speech processing; speech recognition; Bayesian classification; co-channel speech; nonlinear estimation; speaker identification; speech processing; speech recognition; usable speech measures fusion; Awards Planning & Policy Committee; Current measurement; Data mining; Feature extraction; Interference; Shape measurement; Signal detection; Speech coding; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Conference_Location
Bangkok
Print_ISBN
0-7803-7761-3
Type
conf
DOI
10.1109/ISCAS.2003.1205124
Filename
1205124
Link To Document