• DocumentCode
    3426701
  • Title

    An iterative unsupervised learning method for information distillation

  • Author

    Kamangar, Kamand ; Hakkani-Tür, Dilek ; Tur, Gokhan ; Levit, Michael

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4949
  • Lastpage
    4952
  • Abstract
    Information distillation techniques are used to analyze and interpret large volumes of speech and text archives in multiple languages and produce structured information of interest to the user. In this work, we propose an iterative unsupervised sentence extraction method to answer open-ended natural language queries about an event. The approach consists of finding the subset of sentences that are very likely to be relevant or irrelevant for the query from candidate documents, and iteratively training a classification model using these examples. Our results indicate that performance of the system may be improved by around 30% relative in terms of F-measure, by using the proposed method.
  • Keywords
    iterative methods; query processing; unsupervised learning; F-measure; classification model; information distillation; iterative method; natural language queries; sentence extraction; speech; text archives; unsupervised learning; Computer science; Data mining; Information analysis; Information retrieval; Iterative methods; Machine learning; Natural languages; Speech analysis; Strips; Unsupervised learning; information distillation; machine learning; question answering; unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518768
  • Filename
    4518768