Title :
ME-Match: Tonal Grouping Based Approach in Cross-Script Name Matching
Author :
Phyu, Kyaw Zar Zar ; Tun, Khin Mar Lar
Author_Institution :
Univ. of Comput. Studies, Yangon, Myanmar
Abstract :
Even though matching between different scripts could be immensely useful for news organizations, author recognition with cross-script matches in digital libraries and homeland security, it is impossible to automatically match. Now, we propose a new approach, ME-match, for matching the proper names across different scripts. The foremost concept of our approach is to match them via phoneme strings. The main steps in ME-match are creation of bilingual pronouncing mapping, tokenization of query names, transformation of query names to IPA forms based on tonal grouping approach, searching possible various words in both scripts for each query IPA phoneme string, combination of various words to become full name strings and then searching names. The performance is measured by standard information-retrieval metrics: recall, precision, and f-measures.
Keywords :
information retrieval; natural language processing; string matching; ME-match; author recognition; bilingual pronouncing mapping; cross-script name matching; digital libraries; f-measures; homeland security; information-retrieval metrics; news organizations; query IPA phoneme string; query names; tonal grouping; tonal grouping approach; Dictionaries; Home computing; Humans; Information retrieval; Measurement standards; Natural languages; Software libraries; Speech analysis; Terrorism; Writing; approximate string matching; cross-script name matching; phoneme strings;
Conference_Titel :
Future Computer and Communication, 2009. ICFCC 2009. International Conference on
Conference_Location :
Kuala Lumpar
Print_ISBN :
978-0-7695-3591-3
DOI :
10.1109/ICFCC.2009.24