DocumentCode :
2040981
Title :
The research of proofreading for the Uigur character
Author :
Gulila
Author_Institution :
Dept. of Electron., Xinjiang Univ., Urmuqi, China
Volume :
2
fYear :
2001
fDate :
2001
Firstpage :
874
Abstract :
Uigur language belongs to the Altaic language branch of the Turkic language. This paper analyses common error types of pre-proofread text of Uigur,and discusses how to establish a corpus, rule base, part-of-speech tagging and word class ambiguity syncopate etc. It also presents a method with part-of-speech tagging of word class grammatical character, a combined method of a rule base and corpus statistics
Keywords :
grammars; text analysis; text editing; Altaic language; Turkic language; Uigur language; corpus; corpus statistics; part-of-speech tagging; pre-proofread text errors; proofreading; rule base; word class ambiguity syncopate; word class grammatical character; Character recognition; Computer errors; Dictionaries; Educational institutions; Information science; Keyboards; Libraries; Natural languages; Speech analysis; Tagging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location :
Tucson, AZ
ISSN :
1062-922X
Print_ISBN :
0-7803-7087-2
Type :
conf
DOI :
10.1109/ICSMC.2001.973026
Filename :
973026
Link To Document :
بازگشت