Title :
The retrieval research of non-adjacent keywords in Chinese corpus — A case study of “Yi…Jiu…” construction
Author :
Xiaoping Tan ; Lijiao Yang
Author_Institution :
Inst. of Chinese Inf. Process., Beijing Normal Univ., Beijing, China
Abstract :
Corpus Concordancing is a popular research topic. The function of retrieving data from corpus by providing non-adjacent keywords is widely used by users. However, the precision of retrieval results is not very high because the machine can´t recognize the relationship of the non-adjacent keywords. To deal with this problem, this paper proposed a rule-based method for the “Yi...Jiu...” construction, which could exclude the unrelated data, even though the data include the keywords. The experiments show that the precision is close to 82%.
Keywords :
information retrieval; knowledge based systems; natural language processing; word processing; Chinese corpus; Chinese language; Yi...Jiu... construction; nonadjacent keywords retrieval research; rule-based method; Accuracy; Educational institutions; Electronic mail; Information processing; Legged locomotion; Testing; Corpus; Yi…Jiu…; concordancing; non-adjacent keywords; retrieval;
Conference_Titel :
Asian Language Processing (IALP), 2014 International Conference on
Conference_Location :
Kuching
DOI :
10.1109/IALP.2014.6973507