Title :
Acquisition of useful expressions from English research papers
Author :
Sakai, Yuta ; Sugiki, Kenji ; Matsubara, Shigeki
Author_Institution :
Grad. Sch. of Inf. Sci., Nagoya Univ., Nagoya, Japan
Abstract :
This paper proposes a method for extracting useful expressions from English research papers. The method extracts sequences of words from research papers and refine them into phrasal expressions (PEs). We use base-phrases for acquiring such the expressions. The method extracts PEs from the set of sequences of base-phrases by using three kinds of statistical information: frequency, length, and the number of kinds of the succeeding base-phrases. In our experiment using 1,232 research papers, the precision of acquisition at the top-200 was 62.0%. The precision was higher than all of the baselines, and therefore, we confirmed the feasibility of our method.
Keywords :
information retrieval; natural languages; search engines; statistical analysis; English research paper; base-phrase; phrasal expression acquisition; statistical information; word sequence extraction; Data mining; Dictionaries; Frequency; Marketing and sales; Natural language processing; Search engines;
Conference_Titel :
Natural Language Processing, 2009. SNLP '09. Eighth International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4244-4138-9
Electronic_ISBN :
978-1-4244-4139-6
DOI :
10.1109/SNLP.2009.5340948