Discovering Domain Concepts and Hyponymy Relations by Text Relevance Classifying Based Iterative Web Searching

Author

Lili Mou ; Ge Li ; Zhi Jin ; Yangyang Lu ; Yiyang Hao

Author_Institution

Software Inst., Peking Univ., Beijing, China

Volume

1

fYear

2012

fDate

4-7 Dec. 2012

Firstpage

213

Lastpage

222

Abstract

Domain concepts and taxonomic relationships are an essential part of a domain ontology. They are used in a number of applications, including natural language processing, information retrieval, knowledge management and so on. Nowadays, with the continuous permeation of various kinds of Internet knowledge applications, numerous new concepts are emerged and released on to the Internet. So, the Internet has become an invaluable source of new concepts for almost every possible domain of knowledge. In order to ensure the domain ontologies keep pace with fast changing knowledge, we proposed an web searching based concepts and taxonomic relationships discovering approach. By our approach, the potential concepts on the Internet, which are taxonomically related with the give seeds concepts, can be discovered autonomously and iteratively. In this paper, the approach and a corresponding application in Chinese web pages are reported in detail. The experiments show that, our approach can catch the related domain concepts precisely, meanwhile, can reject irrelevant concepts and figure out the domain knowledge border definitely.

Keywords

Internet; Web sites; information retrieval; knowledge management; natural language processing; ontologies (artificial intelligence); pattern classification; search engines; text analysis; Chinese Web page; Internet knowledge application; Web searching based concept; domain concept; domain ontology; hyponymy relation; information retrieval; iterative Web searching; knowledge management; natural language processing; taxonomic relationships discovering approach; text relevance classification; Biology; Chemicals; Computer languages; Internet; Ontologies; Web pages; domain knowledge; ontology learning; taxonomy learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Software Engineering Conference (APSEC), 2012 19th Asia-Pacific

Conference_Location

Hong Kong

ISSN

1530-1362

Print_ISBN

978-1-4673-4930-7

Type

conf

DOI

10.1109/APSEC.2012.96

Filename

6462656