DocumentCode :
2347620
Title :
Web-based technical term translation pairs mining for patent document translation
Author :
Ren, Feiliang ; Zhu, Jingbo ; Wang, Huizhen
Author_Institution :
Northeastern Univ., Shenyang, China
fYear :
2010
fDate :
21-23 Aug. 2010
Firstpage :
1
Lastpage :
8
Abstract :
This paper proposes a simple but powerful approach for obtaining technical term translation pairs in patent domain from Web automatically. First, several technical terms are used as seed queries and submitted to search engineering. Secondly, an extraction algorithm is proposed to extract some key word translation pairs from the returned web pages. Finally, a multi-feature based evaluation method is proposed to pick up those translation pairs that are true technical term translation pairs in patent domain. With this method, we obtain about 8,890,000 key word translation pairs which can be used to translate the technical terms in patent documents. And experimental results show that the precision of these translation pairs are more than 99%, and the coverage of these translation pairs for the technical terms in patent documents are more than 84%.
Keywords :
Internet; data mining; document handling; information retrieval; natural language processing; patents; Web-based technical term translation; extraction algorithm; multifeature based evaluation method; patent document translation; seed queries; Chemistry; Patents; Term translation; key word extraction; key word selection; machine translation; patent document translation; web-based;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6896-6
Type :
conf
DOI :
10.1109/NLPKE.2010.5587775
Filename :
5587775
Link To Document :
بازگشت