Title :
Mining events and new name translations from online daily news
Author :
Lam, Wai ; Cheung, Pik-Shan ; Huang, Ruizhang
Author_Institution :
Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Shatin, China
Abstract :
We develop a system for mining events and unseen name translations from online daily Web news. This system first automatically discovers bilingual events by analyzing the content of the news stories. The discovered event can be treated as comparable bilingual news and can be used for generating name candidates. A name matching algorithm is developed to discover new unseen name translations based on phonetic and context clues. The experimental results show that our system is effective for mining new knowledge and information from online Web news.
Keywords :
Internet; data mining; digital libraries; electronic publishing; language translation; linguistics; meta data; natural languages; speech processing; string matching; bilingual event discovery; bilingual news; context clues; event mining system; metadata generation; multilingual information processing; name candidate generation; name matching algorithm; new knowledge mining; online daily Web news mining; phonetics; unseen name translations; Councils; Event detection; Information processing; Information retrieval; Information systems; Internet; Permission; Research and development management; Software libraries; Systems engineering and theory;
Conference_Titel :
Digital Libraries, 2004. Proceedings of the 2004 Joint ACM/IEEE Conference on
Print_ISBN :
1-58113-832-6
DOI :
10.1109/JCDL.2004.1336138