مرکز منطقه ای اطلاع رساني علوم و فناوري - A Coincidence-Based Test for Uniformity Given Very Sparsely Sampled Discrete Data

DocumentCode :

864673

Title :

A Coincidence-Based Test for Uniformity Given Very Sparsely Sampled Discrete Data

Author :

Paninski, L.

Author_Institution :

Dept. of Stat., Columbia Univ., New York, NY

Volume :

Issue :

fYear :

2008

Firstpage :

4750

Lastpage :

4755

Abstract :

How many independent samples N do we need from a distribution p to decide that p is epsiv-distant from uniform in an L₁ sense, Sigma_i=1 ^m |p(i) - 1/m| > epsiv? (Here m is the number of bins on which the distribution is supported, and is assumed known a priori.) Somewhat surprisingly, we only need N epsiv² Gt m ^1/2 to make this decision reliably (this condition is both sufficient and necessary). The test for uniformity introduced here is based on the number of observed ldquocoincidencesrdquo (samples that fall into the same bin), the mean and variance of which may be computed explicitly for the uniform distribution and bounded nonparametrically for any distribution that is known to be epsiv-distant from uniform. Some connections to the classical birthday problem are noted.

Keywords :

information theory; statistical distributions; classical birthday problem; coincidence-based test; convex bounds; hypothesis testing; minimax; uniform distribution; very sparsely sampled discrete data; Computer errors; Computer science; Distributed computing; Engineering profession; Entropy; Minimax techniques; Statistical analysis; Statistical distributions; Testing; Upper bound; Convex bounds; hypothesis testing; minimax;

fLanguage :

English

Journal_Title :

Information Theory, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9448

Type :

jour

DOI :

10.1109/TIT.2008.928987

Filename :

4626074

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=864673