DocumentCode :
3036362
Title :
A binary-categorization approach for classifying multiple-record Web documents using application ontologies and a probabilistic model
Author :
Ng, Yiu-Kai ; Tang, June ; Goodrich, Michael
Author_Institution :
Dept. of Comput. Sci., Brigham Young Univ., Provo, UT, USA
fYear :
2001
fDate :
21-21 April 2001
Firstpage :
58
Lastpage :
65
Abstract :
The amount of information available on the World Wide Web has been increasing dramatically in recent years. To enhance speedy searching and retrieving Web documents of interest, researchers and practitioners have partially relied on various information retrieval techniques. We propose a probabilistic model to classify Web documents into relevant documents and irrelevant documents with respect to a particular application ontology, which is a conceptual-model snippet of standard ontologies. Our probabilistic model is based on multivariate statistical analysis and is different from the conventional probabilistic information retrieval models. The experiments we have conducted on a set of representative Web documents indicate that the proposed probabilistic model is promising in binary-categorization of multiple-record Web documents.
Keywords :
Internet; classification; information resources; information retrieval; probability; statistical analysis; Internet; Web document retrieval; World Wide Web; application ontologies; binary categorization approach; conceptual model; experiments; information retrieval; multiple-record Web document classification; multivariate statistical analysis; probabilistic model; searching; Application software; Computer science; Decision theory; Information retrieval; Internet; Ontologies; Organizing; Probability; Statistical analysis; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Systems for Advanced Applications, 2001. Proceedings. Seventh International Conference on
Conference_Location :
Hong Kong, China
Print_ISBN :
0-7695-0996-7
Type :
conf
DOI :
10.1109/DASFAA.2001.916365
Filename :
916365
Link To Document :
بازگشت