Author_Institution :
Software Inst., Peking Univ., Beijing, China
Abstract :
Domain concepts and taxonomic relationships are an essential part of a domain ontology. They are used in a number of applications, including natural language processing, information retrieval, knowledge management and so on. Nowadays, with the continuous permeation of various kinds of Internet knowledge applications, numerous new concepts are emerged and released on to the Internet. So, the Internet has become an invaluable source of new concepts for almost every possible domain of knowledge. In order to ensure the domain ontologies keep pace with fast changing knowledge, we proposed an web searching based concepts and taxonomic relationships discovering approach. By our approach, the potential concepts on the Internet, which are taxonomically related with the give seeds concepts, can be discovered autonomously and iteratively. In this paper, the approach and a corresponding application in Chinese web pages are reported in detail. The experiments show that, our approach can catch the related domain concepts precisely, meanwhile, can reject irrelevant concepts and figure out the domain knowledge border definitely.
Keywords :
Internet; Web sites; information retrieval; knowledge management; natural language processing; ontologies (artificial intelligence); pattern classification; search engines; text analysis; Chinese Web page; Internet knowledge application; Web searching based concept; domain concept; domain ontology; hyponymy relation; information retrieval; iterative Web searching; knowledge management; natural language processing; taxonomic relationships discovering approach; text relevance classification; Biology; Chemicals; Computer languages; Internet; Ontologies; Web pages; domain knowledge; ontology learning; taxonomy learning;