Abstract :
We classify an image by generating a list of salient visual features present in the luminance channel, and matching the resulting variable-length feature list to category-specific generative models for such features. To facilitate quick computation, we use thresholded Viola-Jones rectangular features, each represented by a five-dimensional descriptor For each image category, a probability distribution for feature-lists is given by a latent conditional independence (LCI) model and classification is maximum likelihood. On the NIST tax forms database (Dimmick et al., 1991), where intra-category variations include variable scan-lightness, skew, noise, and machine-printed form-filling, our method improves performance over published results, while requiring very little training data, and without relying on an extensive set of handcrafted features
Keywords :
image classification; image segmentation; statistical distributions; category-specific generative model; image category; image classification; latent conditional independence model; luminance channel; probability distribution; thresholded Viola-Jones rectangular features; visual features; Atomic measurements; Error analysis; Image analysis; Image classification; Image databases; Image generation; NIST; Probability distribution; Spatial databases; Text analysis;