Title :
Linguistic Web: Bridging between text information sources and Semantic Web
Author :
Jingmin, Hao ; Lejian, Liao
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing
Abstract :
The goal of semantic Web is to make the computer can understand and process data which can only be shown by the current Web. But it is impossible to annotate all the huge amount of data of current Web with semantic labels during a short time. This paper proposes a concept of Linguistic Web, which is to provide a bridging between text information sources of HTML Web pages and Semantic Web. The core of the Semantic Web is ontologies. But then it is rather difficult to automatically acquire world knowledge or domain special knowledge to build ontologies at present. As compared to the difficulties in acquiring semantic knowledge based on domain special ontology, grammatical knowledge of text could be acquired easily, and the latter is more determinate than the former. In the area of Information Retrieval, it is not enough to search information only based on keywords. Under this situation should we consider some web application can employ grammatical knowledge to improve performance. Linguistic Web focuses on building a linguistic ontology, providing grammatical knowledge for web applications. A linguistic ontology based on HPSG (Head driven-Phrase Structure Grammar) was accomplished.
Keywords :
grammars; hypermedia markup languages; information retrieval; ontologies (artificial intelligence); semantic Web; text analysis; HTML; Web pages; head driven-phrase structure grammar; information retrieval; linguistic Web; ontologies; semantic Web; semantic knowledge; text information sources; Automation; Buildings; HTML; Intelligent control; Knowledge acquisition; Knowledge engineering; OWL; Ontologies; Semantic Web; Web pages; Linguistic Ontology; Linguistic Web; Ontology; Semantic Web;
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
DOI :
10.1109/WCICA.2008.4593035