DocumentCode
323818
Title
Voicing state determination of co-channel speech
Author
Benincasa, Daniel S. ; Savic, Michael I.
Author_Institution
OCSS, Rome, NY, USA
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
1021
Abstract
This paper presents a voicing state determination algorithm (VSDA) that is used to simultaneously estimate the voicing state of two speakers present in a segment of co-channel speech. Supervised learning trains a Bayesian classifier to predict the voicing states. The possible voicing states are silence, voiced/voiced, voiced/unvoiced, unvoiced/voiced and unvoiced/unvoiced. We have assumed the silent state as a subset of the unvoiced class, except when both speakers are silent. We have chosen a binary tree decision structure. Our feature set is a projection of a 37 dimensional feature vector onto a single dimension applied at each branch of the decision tree, using the Fisher linear discriminant. We have produced co-channel speech from the TIMIT database which is used for training and testing. Preliminary results, at signal to interference ratio of 0 dB, have produced classification accuracy of 82.6%, 73.45%, and 68.24% on male/female, male/male and female/female mixtures respectively
Keywords
Bayes methods; decision theory; learning (artificial intelligence); pattern classification; speech recognition; 37 dimensional feature vector; Bayesian classifier; Fisher linear discriminant; VSDA; binary tree decision structure; classification accuracy; co-channel speech; decision tree; female/female voices; male/female voices; male/male voices; silence; supervised learning; unvoiced/unvoiced state; unvoiced/voiced state; voiced/unvoiced state; voiced/voiced state; voicing state determination; voicing state determination algorithm; Bayesian methods; Binary trees; Decision trees; Interference; Spatial databases; Speech; State estimation; Supervised learning; Testing; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675441
Filename
675441
Link To Document