Title :
New kernels for analyzing multimodal data in multimedia using kernel machines
Author :
Aradhye, Hrishikesh ; Dorai, Chitra
Author_Institution :
SRI Int., Menlo Park, CA, USA
Abstract :
Research in automated analysis of digital media content has led to a large collection of low-level feature extractors, such as face detectors, videotext extractors, speech and speaker identifiers, people/vehicle trackers, and event locators. These media metadata are often symbolic rather than continuous-valued, and pose significant difficulty to subsequent tasks such as classification and dimensionality reduction which traditionally deal with continuous-valued data. This paper proposes a novel mechanism that extends tasks traditionally limited to continuous-valued feature spaces, such as (a) dimensionality reduction, (b) de-noising, and (c) clustering, to domains with symbolic features. To this end, we introduce new kernels based on well-known distance metrics, and prove Mercer validity of these kernels for analyzing symbolic feature spaces. We demonstrate their usefulness within the context of kernel-space methods such as Kernel PCA and SVM, in classifying machine learning datasets from the UCI repository and in temporal clustering and tracking of videotext in multimedia. We show that the generalized kernels help capture information from symbolic feature spaces, visualize symbolic data, and aid tasks such as classification and clustering, and therefore are useful in multimodal analysis of multimedia.
Keywords :
data analysis; data visualisation; feature extraction; learning (artificial intelligence); learning automata; multimedia databases; pattern clustering; principal component analysis; Kernel PCA; Mercer validity; SVM; UCI repository; automated analysis; classification; de-noising; dimensionality reduction; distance metrics; kernel machines; machine learning datasets; multimedia; multimodal analysis; multimodal data analysis; symbolic data visualization; symbolic feature spaces; temporal clustering; videotext tracking; Data analysis; Data mining; Detectors; Event detection; Face detection; Feature extraction; Kernel; Speech analysis; Vehicle detection; Vehicles;
Conference_Titel :
Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7803-7304-9
DOI :
10.1109/ICME.2002.1035368