Title :
Exploring Technical Phrase Frames from Research Paper Titles
Author :
Win, Yuzana ; Masada, Tomonari
Author_Institution :
Grad. Sch. of Eng., Nagasaki Univ., Nagasaki, Japan
Abstract :
This paper proposes a method for exploring technical phrase frames by extracting word n-grams that match our information needs and interests from research paper titles. Technical phrase frames, the outcome of our method, are phrases with wildcards that may be substituted for any technical term. Our method, first of all, extracts word trigrams from research paper titles and constructs a co-occurrence graph of the trigrams. Even by simply applying Page Rank algorithm to the co-occurrence graph, we obtain the trigrams that can be regarded as technical key phrases at the higher ranks in terms of Page Rank score. In contrast, our method assigns weights to the edges of the co-occurrence graph based on Jaccard similarity between trigrams and then apply weighted Page Rank algorithm. Consequently, we obtain widely different but more interesting results. While the top-ranked trigrams obtained by unweighted Page Rank have just a self-contained meaning, those obtained by our method are technical phrase frames, i.e., A word sequence that forms a complete technical phrase only after putting a technical word (or words) before or/and after it. We claim that our method is a useful tool for discovering important phrase logical patterns, which can expand query keywords for improving information retrieval performance and can also work as candidate phrasings in technical writing to make our research papers attractive.
Keywords :
graph theory; query processing; Jaccard similarity; Page Rank algorithm; cooccurrence graph; information retrieval; query keywords; research paper titles; technical phrase frames; trigrams; word n-grams extraction; word sequence; Algorithm design and analysis; Data mining; Feature extraction; Information retrieval; Natural language processing; Pragmatics; Probability; Jaccard similarity; PageRank; keyphrase extraction; phrase frames; word n-grams;
Conference_Titel :
Advanced Information Networking and Applications Workshops (WAINA), 2015 IEEE 29th International Conference on
Conference_Location :
Gwangiu
Print_ISBN :
978-1-4799-1774-7
DOI :
10.1109/WAINA.2015.37