DocumentCode :
2708013
Title :
A new matching algorithm for Chinese place names
Author :
Cheng, Gang ; Wang, Fei ; Lv, Haiyang ; Zhang, Yinling
Author_Institution :
Key Lab. of Mine Spatial Inf. Technol. of SBSM, Henan Polytech. Univ., Jiaozuo, China
fYear :
2011
fDate :
24-26 June 2011
Firstpage :
1
Lastpage :
4
Abstract :
Matching algorithm for Place Names is one of the most important research topics in the construction of digital gazetteers. Taking account of the morphological characteristics of Chinese Place Names and ontology method, we first pretreat the Chinese Place Names to remove illegal characters, decompose the Chinese Place Names into special names and generic terms, and then use both the Levenshtein Distance(LD) method and the semantic distance method to calculate relationship of similarity between them. Finally, we get the comprehensive similarity for Chinese Place Names by calculating the weighted average for both similarity of special names and that of generic terms. In this study, the example shows that similarity index for special names, generic terms and the overall place names, enhance the completeness and accuracy of the place names matching theory.
Keywords :
geographic information systems; natural language processing; ontologies (artificial intelligence); pattern matching; Chinese place name; Levenshtein distance method; comprehensive similarity; digital gazetteer; illegal character removal; matching algorithm; morphological characteristic; ontology; place names matching theory; semantic distance method; special name similarity; weighted average; Accuracy; Algorithm design and analysis; Encoding; Indexes; Ontologies; Semantics; Chinese Place Names; matching algorithm; morphological characteristics of place names; ontology; semantic similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Geoinformatics, 2011 19th International Conference on
Conference_Location :
Shanghai
ISSN :
2161-024X
Print_ISBN :
978-1-61284-849-5
Type :
conf
DOI :
10.1109/GeoInformatics.2011.5980801
Filename :
5980801
Link To Document :
بازگشت