• DocumentCode
    419660
  • Title

    Spoken document classification with SVMs using linguistic unit weighting and probabilistic couplers

  • Author

    Iurgel, Uri ; Rigoll, Gerhard

  • Author_Institution
    Inst. of Inf. Technol., Duisburg-Essen Univ., Duisburg, Germany
  • Volume
    2
  • fYear
    2004
  • fDate
    23-26 Aug. 2004
  • Firstpage
    667
  • Abstract
    The task addressed by this paper is spoken document classification (SDC) of German TV news with support vector machines (SVMs). It shows the benefits of weighting different linguistic units when combined into one feature vector. Further experiments show that probabilistic SVMs (pSVMs) with couplers perform well on a SDC task. New couplers for multi-category classification, both for pSVMs and non-pSVMs, are discussed. They are easy to implement and show good and promising results. It turns out that using the distance instead of the decision value can be favorable. Theoretical justification is given for our approaches, and some results are explained theoretically.
  • Keywords
    document handling; natural languages; support vector machines; German TV news; linguistic unit weighting; multicategory classification; probabilistic couplers; spoken document classification; support vector machines; Automatic speech recognition; Computerized monitoring; Couplers; Indexing; Information technology; Man machine systems; Support vector machine classification; Support vector machines; TV; Voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2128-2
  • Type

    conf

  • DOI
    10.1109/ICPR.2004.1334347
  • Filename
    1334347