Title :
Database of grammatical sentences of Croatian language
Author :
Tepes, B. ; Mateljan, Vladimir
Author_Institution :
Fac. of Philosphy, Zagreb Univ., Croatia
Abstract :
The paper describes work on a linguistic database of grammatical sentences of the Croatian language. In databases, sentences are shown as constituent structure trees, and words are shown with features and their values. The main project which is concerned with language databases is the Penn Treebank project of the Linguistic Data Consortium (A. Bias et al., 1995). The database is a result of theoretical research in the field of computational linguistics and its application in the Croatian language. The database of grammatical sentences of the Croatian language can also be accessed through the Internet.
Keywords :
computational linguistics; context-free grammars; database management systems; linguistics; natural languages; word processing; Croatian language; Internet; Linguistic Data Consortium; Penn Treebank project; computational linguistics; constituent structure trees; grammatical sentence database; language databases; linguistic database; words; Computational linguistics; Computer interfaces; Context modeling; Information technology; Internet; Maximum likelihood estimation; Natural languages; Probability; Production systems; Spatial databases;
Conference_Titel :
Information Technology Interfaces, 2001. ITI 2001. Proceedings of the 23rd International Conference on
Print_ISBN :
953-96769-3-2
DOI :
10.1109/ITI.2001.938051