DocumentCode :
2211320
Title :
An empirical study on the risks of using off-the-shelf techniques for processing mailing list data
Author :
Bettenburg, Nicolas ; Shihab, Emad ; Hassan, Ahmed E.
Author_Institution :
Software Anal. & Intell. Lab., Queen´´s Univ., Kingston, ON, Canada
fYear :
2009
fDate :
20-26 Sept. 2009
Firstpage :
539
Lastpage :
542
Abstract :
Mailing list repositories contain valuable information about the history of a project. Research is starting to mine this information to support developers and maintainers of long-lived software projects. However, such information exists as unstructured data that needs special processing before it can be studied. In this paper, we identify several challenges that arise when using off-the-shelf techniques for processing mailing list data. Our study highlights the importance of proper processing of mailing list data to ensure accurate research results.
Keywords :
electronic mail; software maintenance; electronic mail; mailing list data processing; mailing list repositories; off-the-shelf techniques; unstructured data; Computer networks; Data mining; Electronic mail; File servers; History; Information analysis; Risk analysis; Software maintenance; Tag clouds; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Maintenance, 2009. ICSM 2009. IEEE International Conference on
Conference_Location :
Edmonton, AB
ISSN :
1063-6773
Print_ISBN :
978-1-4244-4897-5
Electronic_ISBN :
1063-6773
Type :
conf
DOI :
10.1109/ICSM.2009.5306383
Filename :
5306383
Link To Document :
بازگشت