Title :
Computing Aggregations from Linguistic Web Resources: A Case Study in Czech Republic Sector/Traffic Accidents
Author :
Jan Dedek;Peter Vojtá
Author_Institution :
Dept. of Software Eng., Charles Univ. in Prague, Prague
Abstract :
Semantic computing aims to connect the intention of humans with computational content. We present a study of a problem of this type: extract information from large number of similar linguistic Web resources to compute various aggregations (sum, average,...). In our motivating example we calculate the sum of injured people in traffic accidents in a certain period in a certain region. We restrict ourselves to pages written in Czech language. Our solution exploits existing linguistic tools created originally for a syntactically annotated corpus, Prague Dependency Treebank (PDT 2.0). We propose a solutions which learns tree queries to extract data from PDT2.0 annotations and transforms the data in an ontology. This method is not limited to Czech language and can be used with any structured linguistic representation. We present a proof of concept of our method. This enables to compute various aggregations over linguistic Web resources.
Keywords :
"Data mining","Ontologies","Accidents","Chapters","Driver circuits","Humans","Fires"
Conference_Titel :
Advanced Engineering Computing and Applications in Sciences, 2008. ADVCOMP ´08. The Second International Conference on
DOI :
10.1109/ADVCOMP.2008.17