Some analyses of interval data

Author

Billard, Lynne

Author_Institution

Dept. of Stat., Univ. of Georgia, Athens, GA

fYear

2008

fDate

23-26 June 2008

Firstpage

3

Lastpage

11

Abstract

Contemporary computers bring us very large datasets, datasets which can be too large for those same computers to analyse properly. One approach is to aggregate these data (by some suitably scientific criteria) to provide more manageably-sized datasets. These aggregated data will perforce be symbolic data consisting of lists, intervals, histograms, etc. Now an observation is a p-dimensional hypercube or Cartesian product of p distributions in R^p, instead of the p-dimensional point in in R^p of classical data. Other data can be naturally symbolic. We give a brief overview of interval-valued data and show briefly that it is important to use symbolic analysis methodology since, e.g., analyses based on classical surrogates ignore some of the information in the dataset.

Keywords

data analysis; statistical distributions; symbol manipulation; very large databases; Cartesian product; data aggregation; interval-valued data analysis; p distributions; p-dimensional hypercube; symbolic data analysis methodology; very large datasets; Aggregates; Data analysis; Data mining; Histograms; History; Hospitals; Hypercubes; Information analysis; Random variables; Statistical analysis;

fLanguage

English

Publisher

ieee

Conference_Titel

Information Technology Interfaces, 2008. ITI 2008. 30th International Conference on

Conference_Location

Dubrovnik

ISSN

1330-1012

Print_ISBN

978-953-7138-12-7

Electronic_ISBN

1330-1012

Type

conf

DOI

10.1109/ITI.2008.4588377

Filename

4588377