• DocumentCode
    2962845
  • Title

    Extracting Events and Temporal Expressions from Text

  • Author

    UzZaman, Naushad ; Allen, James F.

  • Author_Institution
    Comput. Sci. Dept., Univ. of Rochester, Rochester, NY, USA
  • fYear
    2010
  • fDate
    22-24 Sept. 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Extracting temporal information from raw text is fundamental for deep language understanding, and key to many applications like question answering, information extraction, and document summarization. Our long-term goal is to build complete temporal structure of documents and apply the temporal structure in other applications like textual entailment, question answering, dialog systems or others. In this paper, we present a first step, a system for extracting event, event features, temporal expression and its normalized values from raw text. Our system is a combination of deep semantic parsing with extraction rules, Markov Logic Network classifiers and Conditional Random Field classifiers. To compare with existing systems, we evaluated our system on the TimeBank corpus. Our system outperforms or does equally well with all existing systems that evaluate on the TimeBank corpus and our performance is very close to inter-annotator agreement of the TimeBank annotators.
  • Keywords
    Markov processes; grammars; pattern classification; temporal logic; text analysis; Markov logic network classifier; TimeBank corpus; conditional random field classifier; event extraction; extraction rule; raw text; semantic parsing; temporal information extraction; Data mining; Feature extraction; Filtering; Helium; Markov processes; Ontologies; Semantics; TRIOS; TRIPS; TempEval; TimeBank; event extraction; information extraction; temporal expression extraction; temporal information processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on
  • Conference_Location
    Pittsburgh, PA
  • Print_ISBN
    978-1-4244-7912-2
  • Electronic_ISBN
    978-0-7695-4154-9
  • Type

    conf

  • DOI
    10.1109/ICSC.2010.45
  • Filename
    5628792