DocumentCode :
2091514
Title :
Using electronic texts for an annotated corpus building
Author :
Galicia-Haro, Sofía N.
Author_Institution :
Computational Res. Center, Nat. Polytech. Inst., Mexico City, Mexico
fYear :
2003
fDate :
8-12 Sept. 2003
Firstpage :
26
Lastpage :
32
Abstract :
Collections of texts with annotations on several levels are useful resources. They are employed for diverse tasks in theoretical research and natural language processing. The most important collections are dedicated to English. However, huge efforts are required to develop the corresponding resource for other languages. In this work, we present the initial steps for the compilation of an annotated Mexican corpus using electronic texts obtained from the Web.
Keywords :
Internet; computational linguistics; natural languages; text analysis; World Wide Web; annotated Mexican corpus; annotated corpus building; electronic texts; natural language processing; Acceleration; Computer languages; Information analysis; Laboratories; Natural language processing; Natural languages; Problem-solving; Proposals; Text analysis; Text processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science, 2003. ENC 2003. Proceedings of the Fourth Mexican International Conference on
Print_ISBN :
0-7695-1915-6
Type :
conf
DOI :
10.1109/ENC.2003.1232870
Filename :
1232870
Link To Document :
بازگشت