Title :
Constructing Corpus for Query-Oriented XML Text Summarization
Author :
Wu, Shihan ; Liu, Dexi ; Jiao, Xianpei
Author_Institution :
Jiangxi Key Lab. of Data & Knowledge Eng., Jiangxi Univ. of Finance &Econ., Nanchang, China
Abstract :
XML Retrieval is becoming the focus study of the field of Information Retrieval and Database. Summarization of the results which come from the XML search engines will alleviate the read burden of user´s. However, as the basis of this study, the construction of the query-oriented XML text summarization corpus has not yet received enough attention. In this paper, we introduce our works on constructing this kind of corpus, including the selection of topics and XML elements/documents, construction process and the feature of the constructed corpus. Up to now, the corpus has 25 English query topics, including 422 elements for summarization, and 32 Chinese topics which including 402 elements. For each topic, 4 pieces of extracted summaries and 4 pieces of generated summaries are made manually by 4 experts.
Keywords :
XML; query processing; search engines; text analysis; Chinese topics; English query topics; XML documents; XML elements; XML retrieval; XML search engine; corpus construction; information retrieval; query-oriented XML text summarization; result summarization; Databases; Education; Feature extraction; Machine learning; Pragmatics; Security; XML; Automatic Summarization; Corpus; Query-oriented; XML;
Conference_Titel :
Management of e-Commerce and e-Government (ICMeCG), 2010 Fourth International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-8507-9
DOI :
10.1109/ICMeCG.2010.18