DocumentCode :
3631780
Title :
Corpora building and processing
Author :
Marija Brkic;Maja Matetic;Igor Jugo
Author_Institution :
University of Rijeka, Department of Informatics, Croatia
fYear :
2009
Firstpage :
251
Lastpage :
254
Abstract :
Creativity is a basic feature of a language. Therefore, it is perfectly possible to create a completely new context that has never occurred before. This feature allows us to express our ideas, thoughts, knowledge and fears, but it also complicates the idea of human-machine communication. Since it became obvious that natural languages cannot be formalized and described as a whole, the idea of combining linguistic knowledge and corpora has arisen. The combination of these techniques has proven to give the best results and our research is based on that notion. Since data sparsity poses a huge problem, this work presents a practical solution in overcoming data sparsity problem and gives a detailed account of an advanced data processing technique.
Keywords :
"Natural languages","Databases","Weather forecasting","Search engines","Dictionaries","Telephony","ISO standards","Computer networks","Information retrieval","Learning systems"
Publisher :
ieee
Conference_Titel :
Human System Interactions, 2009. HSI ´09. 2nd Conference on
ISSN :
2158-2246
Print_ISBN :
978-1-4244-3959-1
Electronic_ISBN :
2158-2254
Type :
conf
DOI :
10.1109/HSI.2009.5090987
Filename :
5090987
Link To Document :
بازگشت