Title :
An Automatic Approach for Domain-Specific Dictionary Expansion Based on Web Mining
Author :
Sun, Yueheng ; Ni, Weijie ; Men, Rui
Author_Institution :
Sch. of Comput. Sci. & Technol., Tianjin Univ., Tianjin, China
fDate :
Nov. 30 2009-Dec. 1 2009
Abstract :
This paper proposes an automatic expansion approach for an existing domain-specific dictionary based on Web mining. Using the terminology pairs in a dictionary as queries, we first extract snippet fragments that potentially contain the Chinese translations of current English phrases. Based on matching patterns extracted from the Web, we can get the most likely translation for a new English phrase. Finally we use the vector space model to filter out the translation equivalents belong to our expected domain, and a domain-specific dictionary expanded by these new terminology pairs is thereby built. The performance of our approach is verified on a dictionary of finance and accounting, and a precision between 85-90% is achieved on considering different thresholds.
Keywords :
Internet; data mining; dictionaries; language translation; natural language processing; query processing; Chinese translations; English phrases; Web mining; automatic expansion approach; domain-specific dictionary; domain-specific dictionary expansion; pattern matching; queries; snippet fragment extraction; vector space model; Data mining; Dictionaries; Filtering; Filters; Information retrieval; Natural languages; Pattern matching; Search engines; Terminology; Web mining; automatic expansion; domain-specific dictionary; matching patterns; translation equivalents; web mining;
Conference_Titel :
Knowledge Acquisition and Modeling, 2009. KAM '09. Second International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3888-4
DOI :
10.1109/KAM.2009.54