مرکز منطقه ای اطلاع رساني علوم و فناوري - Scalable learning and knowledge discovery via adaptive sampling

DocumentCode :

596614

Title :

Scalable learning and knowledge discovery via adaptive sampling

Author :

Jianhua Chen

Author_Institution :

Dept. of Comput. Sci., Louisiana State Univ. Baton Rouge, Baton Rouge, LA, USA

fYear :

2012

fDate :

18-20 Oct. 2012

Firstpage :

412

Lastpage :

417

Abstract :

Scalability is an important issue in data mining and knowledge discovery in real-world applications where hugedata sets often render ordinary learning algorithms infeasible. As an important technique for parameter estimation and hypothesis testing widely used in statistical analysis, random sampling can be exploited to address the issue of scalable learning and knowledge discovery. Adaptive sampling is typically more efficient than traditional batch sampling methods because it can determine the sample size based on the samples seen so far. Recently a new adaptive sampling method for estimating the mean of a Bernoulli random variable was proposed in [2], which was empirically shown to require significantly lower sample size (i.e., the number of sampled instances) while maintaining competitive accuracy and confidence when compared with existing approaches. This paper presents theoretical analysis of properties of the proposed sampling method, as well as a brief outline on how to utilize the proposed sampling method to develop an efficient ensemble learning method with Boosting.

Keywords :

data mining; learning (artificial intelligence); parameter estimation; random processes; sampling methods; Bernoulli random variable; adaptive sampling method; data mining; hypothesis testing; knowledge discovery; mean estimation; ordinary learning algorithms; parameter estimation; scalable learning; statistical analysis; Accuracy; Algorithm design and analysis; Boosting; Classification algorithms; Estimation; Knowledge discovery; Sampling methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Advanced Computational Intelligence (ICACI), 2012 IEEE Fifth International Conference on

Conference_Location :

Nanjing

Print_ISBN :

978-1-4673-1743-6

Type :

conf

DOI :

10.1109/ICACI.2012.6463196

Filename :

6463196

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=596614