Title :
Corpora building and processing
Author :
Marija Brkic;Maja Matetic;Igor Jugo
Author_Institution :
University of Rijeka, Department of Informatics, Croatia
Abstract :
Creativity is a basic feature of a language. Therefore, it is perfectly possible to create a completely new context that has never occurred before. This feature allows us to express our ideas, thoughts, knowledge and fears, but it also complicates the idea of human-machine communication. Since it became obvious that natural languages cannot be formalized and described as a whole, the idea of combining linguistic knowledge and corpora has arisen. The combination of these techniques has proven to give the best results and our research is based on that notion. Since data sparsity poses a huge problem, this work presents a practical solution in overcoming data sparsity problem and gives a detailed account of an advanced data processing technique.
Keywords :
"Natural languages","Databases","Weather forecasting","Search engines","Dictionaries","Telephony","ISO standards","Computer networks","Information retrieval","Learning systems"
Conference_Titel :
Human System Interactions, 2009. HSI ´09. 2nd Conference on
Print_ISBN :
978-1-4244-3959-1
Electronic_ISBN :
2158-2254
DOI :
10.1109/HSI.2009.5090987