DocumentCode :
682361
Title :
Study on Tibetan new meaning word extraction
Author :
Sun Yuan
Author_Institution :
Sch. of Inf. Eng., Minzu Univ. of China, Beijing, China
fYear :
2013
fDate :
23-24 Dec. 2013
Firstpage :
404
Lastpage :
407
Abstract :
This paper proposes a model to automatically extract Tibetan new meaning words. Through building the dynamic Tibetan corpus from 2009 to 2012, which covers more than 16 Tibetan network media of Tibet, Qinghai, Sichuan, Gansu and Yunnan, we research on the key techniques of Tibetan new meaning word extraction: (1) construction of Tibetan new word electronic dictionary; (2) using word frequency ratio to find new meaning words; (3) using word co-occurrence techniques to extract and analysis Tibetan new meaning words.
Keywords :
natural language processing; Gansu; Qinghai; Sichuan; Tibet; Tibetan new meaning word extraction; Tibetan new word electronic dictionary; Yunnan; dynamic Tibetan corpus; word cooccurrence techniques; word frequency ratio; Data mining; Dictionaries; Educational institutions; Information processing; Standardization; Statistical analysis; Tibetan new meaning word; co-occurrence word; extraction method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Instrumentation and Measurement, Sensor Network and Automation (IMSNA), 2013 2nd International Symposium on
Conference_Location :
Toronto, ON
Type :
conf
DOI :
10.1109/IMSNA.2013.6743301
Filename :
6743301
Link To Document :
بازگشت