• DocumentCode
    2039539
  • Title

    Connectionist models for sentence-based text extracts

  • Author

    Demiros, Iason ; Antonopoulos, Vassilios ; Georgantopoulos, Byron ; Triantafyllou, Yannis ; Piperidis, Stelios

  • Author_Institution
    Inst. for Language & Speech Process., Athens, Greece
  • Volume
    4
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    2648
  • Abstract
    This paper addresses the problem of creating a summary by extracting a set of sentences that are likely to represent the content of a document. A small scale experiment is conducted leading to the compilation of an evaluation corpus for the Greek language. Two models of sentence extraction are then described, along the lines of shallow linguistic analysis, feature combination and machine learning. Both models are based on term extraction and statistical filtering. After extracting the individual features of the text, we apply them to two neural networks that classify each sentence depending on its feature vector, the term weight being the feature with the best discriminant capacity. A three-layer feedforward network trained with the highly popular backpropagation algorithm and a competitive learning self-organizing map characterized by the formation of a topographic map, both trained on a small manually annotated corpus of summaries, perform the sentence extraction task. Both methods could be used for rapid light information retrieval-oriented summarization
  • Keywords
    feedforward neural nets; learning (artificial intelligence); self-organising feature maps; backpropagation algorithm; learning self-organizing map; machine learning; neural networks; self-organizing maps; shallow linguistic analysis; statistical filtering; summarization; term extraction; topographic map; Bayesian methods; Data mining; Feature extraction; Information filtering; Information filters; Information retrieval; Machine learning; Natural languages; Neural networks; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 2001 IEEE International Conference on
  • Conference_Location
    Tucson, AZ
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7087-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2001.972964
  • Filename
    972964