Title :
Similarity Assessment of Program Samples Based on Theory of Fuzzy
Author :
Yuxiang Li ; Yinliang Zhao ; Bin Liu
Author_Institution :
Dept. of Comput. Sci. & Technol., Xi´an Jiaotong Univ., Xi´an, China
Abstract :
Using machine learning has proven effective at choosing the right set of optimizations for a particular program. For machine learning techniques to be most effective, compiler writers have to develop expressive means of characterizing the program being optimized. The start-of-art techniques for characterizing programs include using a fixed-length feature vector of either source code features extracted during compile time or performance counters collected when running the program. According to the program features, similarity values are calculated to assess the similar degree. In this paper, we introduce a novel way of assessing the similarity of two program samples using Theory of Fuzzy. We firstly calculate the Euclidean Distance of two different program samples as the input, and then assess the overall similarity degree as well as respective similarity degree, using corresponding Fuzzy Function. The graph-based characterization technique is cited, which shows great advantages than state-of-the-art characterization techniques found in the literature. By using the sequences predicted to be the best by the graph-based model, we compare the known sequences with unknown ones, and calculate the similarity between them, then obtain the similarity degree. Using these degrees, we are directed to make better determination to cluster or classify.
Keywords :
feature extraction; fuzzy set theory; graph theory; learning (artificial intelligence); program compilers; source code (software); Euclidean Distance; compile time; compiler writers; fixed-length feature vector; fuzzy function; graph-based characterization technique; graph-based model; machine learning techniques; performance counters; program characterization; program features; program sample similarity assessment; similar degree assessment; similarity degree Assessment; similarity values; source code feature; theory of fuzzy; Euclidean distance; Feature extraction; Flow graphs; Optimization; Predictive models; Program processors; Vectors; Similarity Assessment; Theory of Fuzzy; code features;
Conference_Titel :
Computational Science and Engineering (CSE), 2013 IEEE 16th International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/CSE.2013.40