Title :
Compound record clustering algorithm for design pattern detection by decision tree learning
Author :
Jing Dong ; Sun, Yongtao ; Zhao, Yajing
Author_Institution :
Computer Science Department, University of Texas at Dallas, Richardson, 75083, USA
Abstract :
Recovering design patterns applied in a system can help refactoring the system. Machine learning algorithms have been successfully applied in mining data patterns. However, one of the main obstacles of applying them for design pattern detection is the difficulty of collecting training examples. Unlike other applications, a design pattern instance typically includes a group of classes with certain relationships. Thus, the possible combinations of the group of classes can be enormous which results in huge training sets making the application of machine learning algorithms impracticable. In this paper, we propose an innovative method using matrix transformations to cluster the training examples. Our method can significantly reduce the size of training examples, thus making it possible to be efficiently applied in machine learning algorithm.
Keywords :
Algorithm design and analysis; Classification tree analysis; Clustering algorithms; Computer science; Data mining; Decision trees; Machine learning; Machine learning algorithms; Software systems; Weather forecasting; decision tree; design pattern; detection; machine learning; training example;
Conference_Titel :
Information Reuse and Integration, 2008. IRI 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV, USA
Print_ISBN :
978-1-4244-2659-1
Electronic_ISBN :
978-1-4244-2660-7
DOI :
10.1109/IRI.2008.4583034