• DocumentCode
    2562978
  • Title

    Exploring Technical Phrase Frames from Research Paper Titles

  • Author

    Win, Yuzana ; Masada, Tomonari

  • Author_Institution
    Grad. Sch. of Eng., Nagasaki Univ., Nagasaki, Japan
  • fYear
    2015
  • fDate
    24-27 March 2015
  • Firstpage
    558
  • Lastpage
    563
  • Abstract
    This paper proposes a method for exploring technical phrase frames by extracting word n-grams that match our information needs and interests from research paper titles. Technical phrase frames, the outcome of our method, are phrases with wildcards that may be substituted for any technical term. Our method, first of all, extracts word trigrams from research paper titles and constructs a co-occurrence graph of the trigrams. Even by simply applying Page Rank algorithm to the co-occurrence graph, we obtain the trigrams that can be regarded as technical key phrases at the higher ranks in terms of Page Rank score. In contrast, our method assigns weights to the edges of the co-occurrence graph based on Jaccard similarity between trigrams and then apply weighted Page Rank algorithm. Consequently, we obtain widely different but more interesting results. While the top-ranked trigrams obtained by unweighted Page Rank have just a self-contained meaning, those obtained by our method are technical phrase frames, i.e., A word sequence that forms a complete technical phrase only after putting a technical word (or words) before or/and after it. We claim that our method is a useful tool for discovering important phrase logical patterns, which can expand query keywords for improving information retrieval performance and can also work as candidate phrasings in technical writing to make our research papers attractive.
  • Keywords
    graph theory; query processing; Jaccard similarity; Page Rank algorithm; cooccurrence graph; information retrieval; query keywords; research paper titles; technical phrase frames; trigrams; word n-grams extraction; word sequence; Algorithm design and analysis; Data mining; Feature extraction; Information retrieval; Natural language processing; Pragmatics; Probability; Jaccard similarity; PageRank; keyphrase extraction; phrase frames; word n-grams;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops (WAINA), 2015 IEEE 29th International Conference on
  • Conference_Location
    Gwangiu
  • Print_ISBN
    978-1-4799-1774-7
  • Type

    conf

  • DOI
    10.1109/WAINA.2015.37
  • Filename
    7096236