DocumentCode
3343260
Title
A process to mining issues of software repositories
Author
Bautista, Ana Maria ; San Feliu, Tomas
Author_Institution
Dept. Lenguajes y Sist. Inf. e Ing. de Software, Univ. Politec. de Madrid, Madrid, Spain
fYear
2015
fDate
17-20 June 2015
Firstpage
1
Lastpage
6
Abstract
Public software repositories offer a great opportunity for researchers. GitHub is a repository with more than 10 million projects. GitHub has an implementation of a defect tracking system. This paper describes the process developed to extract defects from GitHub repository, one of the most widely used public repositories. In this work, besides of the process, it is presented the appeared difficulties, during data mining. With obtained data, it is pretended to apply neural networks to get defects prediction.
Keywords
data mining; neural nets; program diagnostics; software engineering; GitHub repository; data mining; defect prediction; defect tracking system; issue mining process; neural networks; public software repository; Benchmark testing; Biological neural networks; Data mining; Encoding; Internet; Media; Software; Defect Tracking; Defects Prediction; Repositories;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Systems and Technologies (CISTI), 2015 10th Iberian Conference on
Conference_Location
Aveiro
Type
conf
DOI
10.1109/CISTI.2015.7170552
Filename
7170552
Link To Document