• DocumentCode
    331808
  • Title

    Stochastic method for automatic recognition of topics

  • Author

    Scheffler, K. ; du Preez, J.A.

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Stellenbosch Univ., South Africa
  • fYear
    1998
  • fDate
    7-8 Sep 1998
  • Firstpage
    67
  • Lastpage
    72
  • Abstract
    The field of topic spotting in conversational speech has been receiving growing attention in recent years. The goal of this field is to develop a system that can identify topics of interest among large volumes of speech data. In order to cope with practical considerations, researchers are concentrating on phoneme-based methods, which eliminate the need for topic specific data to be hand-transcribed. A number of different phoneme-based approaches have recently been proposed, of which the Euclidean nearest wrong neighbour (ENWN) system (Kuhn et al, 1997) has yielded the most promising experimental results. A phoneme-based topic spotter makes use of a phoneme recogniser to transcribe the speech data. The main problem of this approach is that the accuracy of such transcriptions is very poor. Typically, only between 40 and 50 percent of the phonemes are transcribed correctly. It is therefore important to compensate for the low quality of the transcriptions. However, existing techniques make no use of statistical modelling to compensate for transcription errors. In this research, a stochastic method for automatic recognition of topics (SMART) was developed to address the above mentioned problem. The resulting system is an extension of the existing ENWN algorithm. Comparative results indicate an improvement of SMART over ENWN characterized by a 26% reduction in ROC (receiver operating characteristic) error area. This difference was found to be statistically significant
  • Keywords
    natural languages; speech processing; speech recognition; stochastic processes; ENWN algorithm; Euclidean nearest wrong neighbour system; SMART; automatic topic recognition; conversational speech; phoneme recogniser; phoneme-based methods; phoneme-based topic spotter; receiver operating characteristic; speech data; statistical modelling; stochastic method; transcription errors; transcriptions; Africa; Broadcasting; Character recognition; Dynamic programming; Speech recognition; Stochastic processes; System performance; System testing; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Signal Processing, 1998. COMSIG '98. Proceedings of the 1998 South African Symposium on
  • Conference_Location
    Rondebosch
  • Print_ISBN
    0-7803-5054-5
  • Type

    conf

  • DOI
    10.1109/COMSIG.1998.736924
  • Filename
    736924