DocumentCode :
1760180
Title :
Self-Adaptive Semantic Focused Crawler for Mining Services Information Discovery
Author :
Hai Dong ; Hussain, Farookh Khadeer
Author_Institution :
Sch. of Inf. Syst., Curtin Univ. of Technol., Perth, WA, Australia
Volume :
10
Issue :
2
fYear :
2014
fDate :
41760
Firstpage :
1616
Lastpage :
1626
Abstract :
It is well recognized that the Internet has become the largest marketplace in the world, and online advertising is very popular with numerous industries, including the traditional mining service industry where mining service advertisements are effective carriers of mining service information. However, service users may encounter three major issues - heterogeneity, ubiquity, and ambiguity, when searching for mining service information over the Internet. In this paper, we present the framework of a novel self-adaptive semantic focused crawler - SASF crawler, with the purpose of precisely and efficiently discovering, formatting, and indexing mining service information over the Internet, by taking into account the three major issues. This framework incorporates the technologies of semantic focused crawling and ontology learning, in order to maintain the performance of this crawler, regardless of the variety in the Web environment. The innovations of this research lie in the design of an unsupervised framework for vocabulary-based ontology learning, and a hybrid algorithm for matching semantically relevant concepts and metadata. A series of experiments are conducted in order to evaluate the performance of this crawler. The conclusion and the direction of future work are given in the final section.
Keywords :
Internet; advertising; indexing; information retrieval; learning (artificial intelligence); meta data; mining industry; ontologies (artificial intelligence); vocabulary; Internet; SASF crawler; Web environment; ambiguity; concept matching; heterogeneity; metadata; mining service advertisements; mining service industry; mining service information formatting; mining service information indexing; mining services information discovery; online advertising; self-adaptive semantic focused crawler; ubiquity; unsupervised framework; vocabulary-based ontology learning; Advertising; Business; Crawlers; Industries; Internet; Ontologies; Semantics; Mining service industry; ontology learning; semantic focused crawler; service advertisement; service information discovery;
fLanguage :
English
Journal_Title :
Industrial Informatics, IEEE Transactions on
Publisher :
ieee
ISSN :
1551-3203
Type :
jour
DOI :
10.1109/TII.2012.2234472
Filename :
6384736
Link To Document :
بازگشت