Voicing state determination of co-channel speech

Author

Benincasa, Daniel S. ; Savic, Michael I.

Author_Institution

OCSS, Rome, NY, USA

Volume

2

fYear

1998

fDate

12-15 May 1998

Firstpage

1021

Abstract

This paper presents a voicing state determination algorithm (VSDA) that is used to simultaneously estimate the voicing state of two speakers present in a segment of co-channel speech. Supervised learning trains a Bayesian classifier to predict the voicing states. The possible voicing states are silence, voiced/voiced, voiced/unvoiced, unvoiced/voiced and unvoiced/unvoiced. We have assumed the silent state as a subset of the unvoiced class, except when both speakers are silent. We have chosen a binary tree decision structure. Our feature set is a projection of a 37 dimensional feature vector onto a single dimension applied at each branch of the decision tree, using the Fisher linear discriminant. We have produced co-channel speech from the TIMIT database which is used for training and testing. Preliminary results, at signal to interference ratio of 0 dB, have produced classification accuracy of 82.6%, 73.45%, and 68.24% on male/female, male/male and female/female mixtures respectively

Keywords

Bayes methods; decision theory; learning (artificial intelligence); pattern classification; speech recognition; 37 dimensional feature vector; Bayesian classifier; Fisher linear discriminant; VSDA; binary tree decision structure; classification accuracy; co-channel speech; decision tree; female/female voices; male/female voices; male/male voices; silence; supervised learning; unvoiced/unvoiced state; unvoiced/voiced state; voiced/unvoiced state; voiced/voiced state; voicing state determination; voicing state determination algorithm; Bayesian methods; Binary trees; Decision trees; Interference; Spatial databases; Speech; State estimation; Supervised learning; Testing; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on

Conference_Location

Seattle, WA

ISSN

1520-6149

Print_ISBN

0-7803-4428-6

Type

conf

DOI

10.1109/ICASSP.1998.675441

Filename

675441