DocumentCode
1652544
Title
Advances in Chinese Natural Language Processing and Language resources
Author
Tao, Jianhua ; Zheng, Fang ; Li, Ya ; Ya Li
Author_Institution
Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing, China
fYear
2009
Firstpage
13
Lastpage
18
Abstract
In the past few years, there have been a significant number of activities in the area of Chinese natural language processing (CNLP) including the language resource construction and assessment. This paper summarized the major tasks and key technologies in natural language processing (NLP), which encompasses both text processing and speech processing by extension. The Chinese language resources, including linguistic data, speech data, evaluation data and language toolkits which are elaborately constructed for CNLP related fields and some language resource consortiums are also introduced in this paper. Aimed to promote the development of corpus-based technologies, many resource consortiums commit themselves to collect, create and distribute many kinds of resources. The goal of these organizations is to set up a universal and well accepted Chinese resources database so that to push forward the CNLP.
Keywords
natural language processing; speech processing; Chinese language resources; Chinese natural language processing; Chinese resources database; speech processing; text processing; Data mining; Databases; Frequency; Information retrieval; Laboratories; Natural language processing; Natural languages; Speech analysis; Speech processing; Tagging; Chinese Natural Language Processing; Language Resource; Resource Consortium;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Database and Assessments, 2009 Oriental COCOSDA International Conference on
Conference_Location
Urumqi
Print_ISBN
978-1-4244-4400-7
Type
conf
DOI
10.1109/ICSDA.2009.5278384
Filename
5278384
Link To Document