Title :
Text mining and software engineering: an integrated source code and document analysis approach
Author :
Witte, R. ; Li, Q. ; Zhang, Y. ; Rilling, J.
Author_Institution :
Inst. fur Programmstrukturen und Datenorganisation, Univ. Karlsruhe, Karlsruhe
fDate :
2/1/2008 12:00:00 AM
Abstract :
Documents written in natural languages constitute a major part of the artefacts produced during the software engineering life cycle. Especially during software maintenance or reverse engineering, semantic information conveyed in these documents can provide important knowledge for the software engineer. A text mining system capable of populating a software ontology with information detected in documents is presented. A particular novelty is the integration of results from automated source code analysis into a natural language processing pipeline, allowing to crosslink software artefacts represented in code and natural language on a semantic level.
Keywords :
natural language processing; reverse engineering; software maintenance; text analysis; document analysis; integrated source code analysis; natural language processing pipeline; reverse engineering; software engineering; software maintenance; text mining system;
Journal_Title :
Software, IET
DOI :
10.1049/iet-sen:20070110