Title :
Performance evaluation of categorizing technical support requests using advanced K-Means algorithm
Author :
Nadaf, Mubina A. ; Patil, Sachin S.
Author_Institution :
Dept. of CSE, Rajarambapu Inst. of Technol., Islampur, India
Abstract :
Technical support service providers receive thousands of customer queries daily. Traditionally, such organizations discard the data due to lack of storage capacity. However, value of storing such data is needed for the better results of analysis and to improve the closure rate of the daily customer queries. Data mining is the process of finding important and meaningful information, patterns through the large amount of data. Clustering is used as one of the best concept for data analysis, using machine learning approach with mathematical and statistical methods. Cluster analysis is widely applicable for practical applications in emerging trends in data mining. Analysis of clustering algorithms such as K-Means, Dirichlet, Fuzzy K-Means Canopy algorithms is done by means of the practical approach, in this research work. Performance of algorithm is observed based on the execution or computational time and results are compared with each of these algorithms. This paper proposes the streaming K-Means algorithm which resolves the queries as it arrives and analyses the data. Cosine distance measure plays an important role in clustering dataset. Sum of Square error is measured to check the quality of the cluster.
Keywords :
data analysis; data mining; learning (artificial intelligence); mathematical analysis; pattern clustering; query processing; statistical analysis; technical support services; Dirichlet; cluster analysis; clustering algorithms; cosine distance measure; customer queries; data analysis; data mining; data storing; fuzzy k-means canopy algorithms; machine learning approach; mathematical methods; performance evaluation; statistical methods; storage capacity; sum of square error; technical support requests; technical support service providers; Algorithm design and analysis; Clustering algorithms; Data mining; File systems; Machine learning algorithms; Partitioning algorithms; Sparks; Data mining; Machine Learning; MapReduce; Streaming K-Means;
Conference_Titel :
Advance Computing Conference (IACC), 2015 IEEE International
Conference_Location :
Banglore
Print_ISBN :
978-1-4799-8046-8
DOI :
10.1109/IADCC.2015.7154740