Title :
Category-associated collocative concept primitives extraction
Author :
Zhejie Chi ; Quan Zhang
Author_Institution :
Inst. of Acoust., Beijing, China
Abstract :
Collocation is studied as an essential linguistic phenomenon in traditional natural language processing. Similarity, collocative concept primitives are introduced in HNC Concept Primitive Space to present the concept primitive pair co-occurring frequently. Collocative concept primitives can be studied with categories together as concept primitives usually contain category information. To explore the collocation phenomenon in the field of HNC and apply collocative information to language processing, this paper presents a two-stage approach to extract category-associated collocative concept primitives from a classification corpus. By conducting collocative concept primitives extraction in each sub-category corpus and carrying out category-associated collocative concept primitives extraction in the summarized corpus, we generate a category-associated collocative concept primitives list for each category. Our experiments show the items we extract are consistent with the reality and are of significance.
Keywords :
feature extraction; natural language processing; pattern classification; statistical distributions; category information; chi-squared test; classification corpus; collocative concept primitive extraction; natural language processing; Acoustics; Agriculture; Computational linguistics; Economics; Frequency measurement; Pragmatics; Presses; HNC; chi-squared test; collocation extraction; concept primitives; simple-ll measure;
Conference_Titel :
Asian Language Processing (IALP), 2014 International Conference on
Conference_Location :
Kuching
DOI :
10.1109/IALP.2014.6973475