Title :
Benchmarking of semantic annotation with conditional random fields
Author :
Grilheres, Bruno ; Beauce, Christophe ; Canu, Stephane ; Brunessaux, Stephan
Author_Institution :
EADS DCS, CHRIS, Val de Reuil, France
fDate :
Nov. 30 2005-Dec. 1 2005
Abstract :
The Semantic Web requires document annotation with various meta-data. But for end-users, doing it manually would be extremely time consuming and unfeasible for billion of documents. To reduce this burden, Information Extraction techniques should be applied. This paper describes the use of a recent probabilistic sequence model, Conditional Random Fields, to annotate semi-automatically sets of documents. It introduces the model principles and how to configure it to maximise the detection capabilities. The approach is evaluated on a task of event detection in news press articles related to terrorism events (the MUC-LAT corpus).
Keywords :
document handling; information analysis; information retrieval; meta data; random processes; semantic Web; conditional random fields; detection capability; document annotation; event detection; information extraction; meta-data; news press article; probabilistic sequence model; semantic Web; semantic annotation; terrorism event;
Conference_Titel :
Integration of Knowledge, Semantics and Digital Media Technology, 2005. EWIMT 2005. The 2nd European Workshop on the (Ref. No. 2005/11099)
Conference_Location :
London
Print_ISBN :
0-86341-595-4