• DocumentCode
    3130868
  • Title

    A robust word boundary detection algorithm with application to speech recognition

  • Author

    Agaiby, H. ; Moir, T.J.

  • Author_Institution
    Dept. of Electron. Eng. & Phys., Paisley Univ., UK
  • Volume
    2
  • fYear
    1997
  • fDate
    2-4 Jul 1997
  • Firstpage
    753
  • Abstract
    A new robust word boundary detection algorithm is described that performs well under a variety of noise conditions including competing talkers. The algorithm uses the direction of the signal as the main criterion to differentiate between wanted-speech and background noise. A `viewing zone´ is assumed within which a speech source is considered desired-speech and signals coming from outside this zone are considered noise. The algorithm uses the time delay between signals received at two microphones to estimate the direction of the dominant signal. This estimate together with an estimate of the coherence function between the two signals as well as measures of the signal energy are used to determine word boundaries. Two state-of-the-art speech recognisers were used to evaluated the performance of the algorithm. For each recogniser, the recognition accuracy is measured with manually labelled noisy speech and compared when speech is automatically processed using the proposed algorithm. The results showed that the algorithm performs as well as manual labelling under signal-to-noise ratios as low as 0 dB
  • Keywords
    array signal processing; direction-of-arrival estimation; microphones; noise; signal detection; speech processing; speech recognition; SNR; background noise; coherence function; competing talkers; desired-speech; direction estimation; dominant signal; manually labelled noisy speech; microphones; noise conditions; recognition accuracy; robust word boundary detection algorithm; signal direction; signal energy; signal-to-noise ratios; speech recognition; speech source; time delay; viewing zone; Automatic speech recognition; Background noise; Delay effects; Delay estimation; Detection algorithms; Energy measurement; Microphones; Noise robustness; Speech enhancement; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing Proceedings, 1997. DSP 97., 1997 13th International Conference on
  • Conference_Location
    Santorini
  • Print_ISBN
    0-7803-4137-6
  • Type

    conf

  • DOI
    10.1109/ICDSP.1997.628461
  • Filename
    628461