• DocumentCode
    323818
  • Title

    Voicing state determination of co-channel speech

  • Author

    Benincasa, Daniel S. ; Savic, Michael I.

  • Author_Institution
    OCSS, Rome, NY, USA
  • Volume
    2
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    1021
  • Abstract
    This paper presents a voicing state determination algorithm (VSDA) that is used to simultaneously estimate the voicing state of two speakers present in a segment of co-channel speech. Supervised learning trains a Bayesian classifier to predict the voicing states. The possible voicing states are silence, voiced/voiced, voiced/unvoiced, unvoiced/voiced and unvoiced/unvoiced. We have assumed the silent state as a subset of the unvoiced class, except when both speakers are silent. We have chosen a binary tree decision structure. Our feature set is a projection of a 37 dimensional feature vector onto a single dimension applied at each branch of the decision tree, using the Fisher linear discriminant. We have produced co-channel speech from the TIMIT database which is used for training and testing. Preliminary results, at signal to interference ratio of 0 dB, have produced classification accuracy of 82.6%, 73.45%, and 68.24% on male/female, male/male and female/female mixtures respectively
  • Keywords
    Bayes methods; decision theory; learning (artificial intelligence); pattern classification; speech recognition; 37 dimensional feature vector; Bayesian classifier; Fisher linear discriminant; VSDA; binary tree decision structure; classification accuracy; co-channel speech; decision tree; female/female voices; male/female voices; male/male voices; silence; supervised learning; unvoiced/unvoiced state; unvoiced/voiced state; voiced/unvoiced state; voiced/voiced state; voicing state determination; voicing state determination algorithm; Bayesian methods; Binary trees; Decision trees; Interference; Spatial databases; Speech; State estimation; Supervised learning; Testing; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675441
  • Filename
    675441