DocumentCode :
3343260
Title :
A process to mining issues of software repositories
Author :
Bautista, Ana Maria ; San Feliu, Tomas
Author_Institution :
Dept. Lenguajes y Sist. Inf. e Ing. de Software, Univ. Politec. de Madrid, Madrid, Spain
fYear :
2015
fDate :
17-20 June 2015
Firstpage :
1
Lastpage :
6
Abstract :
Public software repositories offer a great opportunity for researchers. GitHub is a repository with more than 10 million projects. GitHub has an implementation of a defect tracking system. This paper describes the process developed to extract defects from GitHub repository, one of the most widely used public repositories. In this work, besides of the process, it is presented the appeared difficulties, during data mining. With obtained data, it is pretended to apply neural networks to get defects prediction.
Keywords :
data mining; neural nets; program diagnostics; software engineering; GitHub repository; data mining; defect prediction; defect tracking system; issue mining process; neural networks; public software repository; Benchmark testing; Biological neural networks; Data mining; Encoding; Internet; Media; Software; Defect Tracking; Defects Prediction; Repositories;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Systems and Technologies (CISTI), 2015 10th Iberian Conference on
Conference_Location :
Aveiro
Type :
conf
DOI :
10.1109/CISTI.2015.7170552
Filename :
7170552
Link To Document :
بازگشت