DocumentCode :
1787083
Title :
Name deciphering in unrelated languages: The case study of Farsi and English
Author :
Bakhshaei, Somayeh ; Khadivi, Shahram ; Safabakhsh, Reza ; Zafarian, Atefeh
Author_Institution :
Comput. Eng. & Inf. Technol. Dept., Amirkabir Univ. of Technol., Tehran, Iran
fYear :
2014
fDate :
9-11 Sept. 2014
Firstpage :
529
Lastpage :
534
Abstract :
In this work we propose an unsupervised model for deciphering names in two unrelated languages, English and Farsi. The proposed model is a generative non-parametric model that is a customized version of [3] model for name extraction. We show that this unsupervised model is able to achieve competitive results in comparison with a supervised model. Although the accuracy of the unsupervised model is lower than the supervised model, using this model makes it possible to produce list of parallel names without parallel corpora.
Keywords :
natural language processing; English; Farsi; generative nonparametric model; name deciphering; name extraction; parallel names; unrelated languages; unsupervised model; Accuracy; Bayes methods; Ciphers; Computational modeling; Data models; Probability distribution; Vectors; Deciphering; English-Farsi; Name extraction; Scarce resource languages; Unrelated languages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Telecommunications (IST), 2014 7th International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4799-5358-5
Type :
conf
DOI :
10.1109/ISTEL.2014.7000761
Filename :
7000761
Link To Document :
بازگشت