Title :
The pretreatment of Chinese character database based on IS010646
Author :
Xian, Wu ; Zhu, Qiaoming ; Li, Peifeng ; Qian, Peide
Author_Institution :
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Abstract :
When developing Chinese character input methods and its associated software, we are always suffering from the multiple Chinese character encodings. The article firstly introduces a universal Chinese character encoding on simplified-Chinese platform and traditional-Chinese platform. Then it discusses how to build an 118N Chinese character database. After analyzing the relationship between native Chinese character encodings and this ISO standard, the article describes how to convert between these native Chinese character encoding and ISO/IEC 10646-1:2000 standard. At last, the article gives an instance to use in the real-time screen dictionary system.
Keywords :
IEC standards; ISO standards; dictionaries; encoding; information retrieval systems; natural languages; 118N Chinese character database; Chinese character input methods; ISO standard; ISO-IEC 10646-1:2000 standard; character set; multiple Chinese character encodings; native Chinese character encoding; real-time screen dictionary system; simplified-Chinese platform; traditional-Chinese platform; universal Chinese character encoding; Code standards; Databases; Dictionaries; Encoding; ISO standards; Information processing; Operating systems; Real time systems; Software standards; Standards publication;
Conference_Titel :
Computer and Information Technology, 2004. CIT '04. The Fourth International Conference on
Print_ISBN :
0-7695-2216-5
DOI :
10.1109/CIT.2004.1357347