DocumentCode :
3190036
Title :
Exploiting Network Structure for Active Inference in Collective Classification
Author :
Rattigan, Matthew J. ; Maier, Marc ; Jensen, David
Author_Institution :
Univ. of Massachusetts, Amherst
fYear :
2007
fDate :
28-31 Oct. 2007
Firstpage :
429
Lastpage :
434
Abstract :
Active inference seeks to maximize classification performance while minimizing the amount of data that must be labeled ex ante. This task is particularly relevant in the context of relational data, where statistical dependencies among instances can be exploited to improve classification accuracy. We show that efficient methods for indexing network structure can be exploited to select high-value nodes for labeling. This approach substantially outperforms random selection and selection based on simple measures of local structure. We demonstrate the relative effectiveness of this selection approach through experiments with a relational neighbor classifier on a variety of real and synthetic data sets, and identify the necessary characteristics of the data set that allow this approach to perform well.
Keywords :
database indexing; inference mechanisms; pattern classification; relational databases; active inference; collective classification; network structure indexing; relational data; Computer networks; Computer science; Conferences; Data mining; Electronic mail; Humans; Indexing; Inference algorithms; Labeling; Laboratories;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3019-2
Electronic_ISBN :
978-0-7695-3033-8
Type :
conf
DOI :
10.1109/ICDMW.2007.124
Filename :
4476703
Link To Document :
بازگشت