DocumentCode :
259484
Title :
An Expansion Method of XML Element Retrieval Techniques into Web Documents
Author :
Keyaki, Atsushi ; Miyazaki, Jun ; Hatano, Kenji
Author_Institution :
Grad. Sch. of Inf. Sci. & Eng., Tokyo Inst. of Technol., Tokyo, Japan
fYear :
2014
fDate :
Aug. 31 2014-Sept. 4 2014
Firstpage :
853
Lastpage :
858
Abstract :
In this paper, we propose a method to expand XML element retrieval techniques into Web documents. XML element retrieval techniques return partial (sub) documents as search results, and are expected to be able to apply to other structured documents, namely, Web documents besides XML documents. The point is that physical document structures of Web documents are literally disorganized because Web documents are generated for not managing data but rendering on a Web browser. As another feature of Web documents, they contain many incomprehensive contents for human readers. To address challenges caused by these features, we propose 1) a reconstruction method of document structures according to logical structures of contents and 2) a filter for removing unimportant content which does not convey useful information to users. Our experimental evaluations showed that our proposed method improved search accuracy compared with both naive XML element retrieval approach and document retrieval approach.
Keywords :
Internet; XML; information retrieval; online front-ends; Web browser; Web document retrieval approach; XML documents; XML element retrieval techniques; Containers; Erbium; HTML; Reconstruction algorithms; Sections; Standards; XML; Web documents; XML element retrieval; filter; logical document structure; physical document structure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Applied Informatics (IIAIAAI), 2014 IIAI 3rd International Conference on
Conference_Location :
Kitakyushu
Print_ISBN :
978-1-4799-4174-2
Type :
conf
DOI :
10.1109/IIAI-AAI.2014.170
Filename :
6913414
Link To Document :
بازگشت