Title :
Using electronic texts for an annotated corpus building
Author :
Galicia-Haro, Sofía N.
Author_Institution :
Computational Res. Center, Nat. Polytech. Inst., Mexico City, Mexico
Abstract :
Collections of texts with annotations on several levels are useful resources. They are employed for diverse tasks in theoretical research and natural language processing. The most important collections are dedicated to English. However, huge efforts are required to develop the corresponding resource for other languages. In this work, we present the initial steps for the compilation of an annotated Mexican corpus using electronic texts obtained from the Web.
Keywords :
Internet; computational linguistics; natural languages; text analysis; World Wide Web; annotated Mexican corpus; annotated corpus building; electronic texts; natural language processing; Acceleration; Computer languages; Information analysis; Laboratories; Natural language processing; Natural languages; Problem-solving; Proposals; Text analysis; Text processing;
Conference_Titel :
Computer Science, 2003. ENC 2003. Proceedings of the Fourth Mexican International Conference on
Print_ISBN :
0-7695-1915-6
DOI :
10.1109/ENC.2003.1232870