DocumentCode
3691696
Title
Heuristic-based part-of-speech tagging of source code identifiers and comments
Author
Reem S. Alsuhaibani;Christian D. Newman;Michael L. Collard;Jonathan I. Maletic
Author_Institution
Computer Science Kent State University Kent, OH, USA
fYear
2015
fDate
9/1/2015 12:00:00 AM
Firstpage
1
Lastpage
6
Abstract
An approach for using heuristics and static program analysis information to markup part-of-speech for program identifiers is presented. It does not use a natural language part-ofspeech tagger for identifiers within the code. A set of heuristics is defined akin to natural language usage of identifiers usage in code. Additionally, method stereotype information, which is automatically derived, is used in the tagging process. The approach is built using the srcML infrastructure and adds part-of-speech information directly into the srcML markup.
Keywords
"Speech","Object recognition","Tagging","Natural languages","Conferences","Software","Computational linguistics"
Publisher
ieee
Conference_Titel
Mining Unstructured Data (MUD), 2015 IEEE 5th Workshop on
Type
conf
DOI
10.1109/MUD.2015.7327960
Filename
7327960
Link To Document