مرکز منطقه ای اطلاع رساني علوم و فناوري - Research on cross-language text similarity calculation

DocumentCode :

3664416

Title :

Research on cross-language text similarity calculation

Author :

Sun Yuan;Zhao Qian

Author_Institution :

School of Information Engineering, Minzu University of China, Minority Languages Branch, National Language, Resource and Monitoring Research Center, Beijing, China

fYear :

2015

fDate :

5/1/2015 12:00:00 AM

Firstpage :

423

Lastpage :

426

Abstract :

Cross-language text similarity calculation is a critical and fundamental problem in natural language processing. It is widely used in cross-language research, such as cross-language information retrieval. In this paper, we used the LDA (Latent Dirichlet Allocation) model to calculate similarities of Tibetan and Chinese texts at the topic level. Through topic modelling and forecasting, the texts are mapped to the feature space of topics. This method reduced the dimensions of text space vector and improved the speed and efficiency of computation.

Keywords :

"Dictionaries","Computational modeling","Computational linguistics","Accuracy","Natural language processing","Internet"

Publisher :

ieee

Conference_Titel :

Electronics Information and Emergency Communication (ICEIEC), 2015 5th International Conference on

Print_ISBN :

978-1-4799-7283-8

Type :

conf

DOI :

10.1109/ICEIEC.2015.7284573

Filename :

7284573

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3664416