DocumentCode
1841006
Title
OOV Translation Mining from Mixed-Language Snippets from a Search Engine
Author
Yun-Qian Qu ; Jian-Min Yao ; Jun Sun ; Meng Sun
Author_Institution
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou
fYear
2008
fDate
18-21 Nov. 2008
Firstpage
931
Lastpage
935
Abstract
In this paper, we describe an approach to OOV translation mining based on web search. Without comparable corpus or parallel corpus, we first expand the Chinese OOV phrase with English segments by a C-E dictionary. Then the OOV and the expansion will be submitted to the web search engine. From the snippets we can mine the translation of the OOV. Experiment shows that the algorithm is simple and the approach is feasible.
Keywords
data mining; dictionaries; language translation; natural languages; query processing; search engines; vocabulary; C-E dictionary; Chinese OOV phrase; OOV translation mining; Web search engine; mixed-language snippets; out-of-vocabulary; query expansion; Computer science; Data mining; Databases; Dictionaries; Registers; Search engines; Sun; Testing; Web pages; Web search; OOV; query expansion; search engine; translation mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Young Computer Scientists, 2008. ICYCS 2008. The 9th International Conference for
Conference_Location
Hunan
Print_ISBN
978-0-7695-3398-8
Electronic_ISBN
978-0-7695-3398-8
Type
conf
DOI
10.1109/ICYCS.2008.91
Filename
4709099
Link To Document