Title :
An MDL-based change-detection algorithm with its applications to learning piecewise stationary memoryless sources
Author :
Kanazawa, Hideko ; Yamanishi, Kenji
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
Abstract :
Kleinberg has proposed an algorithm for detecting bursts from a data sequence, which has turned out to be effective in the scenario of data mining, such as topic detection, change-detection. In this paper we extend Kleinberg´s algorithm in an information-theoretic fashion to obtain a new class of algorithms and apply it into learning of piecewise stationary memoryless sources (PSMSs). The keys of the proposed algorithm are; 1) the parameter space is discretized so that discretization scale depends on the Fisher information, and 2) the optimal path over the discretized parameter space is efficiently computed using the dynamic programming method so that the sum of the data and parameter description lengths is minimized on the basis of the MDL principle. We prove that an upper bound on the total code-length for the proposed algorithm asymptotically matches Merhav´s lower bound.
Keywords :
data mining; dynamic programming; learning (artificial intelligence); Kleinberg algorithm; MDL-based change detection algorithm; PSMS learning; data mining; data sequence; discretization scale; dynamic programming method; minimum description length; parameter description length; parameter space; piecewise stationary memoryless source learning; topic detection; Erbium;
Conference_Titel :
Information Theory Workshop (ITW), 2012 IEEE
Conference_Location :
Lausanne
Print_ISBN :
978-1-4673-0224-1
Electronic_ISBN :
978-1-4673-0222-7
DOI :
10.1109/ITW.2012.6404736