Title :
The ACL data collection initiative
Author :
Liberman, Mark Y.
Author_Institution :
Dept. of Linguistics, Pennsylvania Univ., Philadelphia, PA, USA
Abstract :
The Data Collection Initiative, established by the Association for Computational Linguistics, is described. Its aim is to acquire a large and diverse text corpus, to transform it into a common format based on the Standardized General Markup Language, and to make it available for scientific research at low cost with minimal restrictions. The rationale for this effort is discussed
Keywords :
computational linguistics; ACL data collection initiative; Computational Linguistics; Standardized General Markup Language; scientific research; text corpus; Computational linguistics; Costs; Databases; Europe; Machine assisted indexing; Natural languages; SGML; Speech analysis; Speech recognition; Text recognition;
Conference_Titel :
Information Technology, 1990. 'Next Decade in Information Technology', Proceedings of the 5th Jerusalem Conference on (Cat. No.90TH0326-9)
Conference_Location :
Jerusalem
Print_ISBN :
0-8186-2078-1
DOI :
10.1109/JCIT.1990.128361