DocumentCode
1787083
Title
Name deciphering in unrelated languages: The case study of Farsi and English
Author
Bakhshaei, Somayeh ; Khadivi, Shahram ; Safabakhsh, Reza ; Zafarian, Atefeh
Author_Institution
Comput. Eng. & Inf. Technol. Dept., Amirkabir Univ. of Technol., Tehran, Iran
fYear
2014
fDate
9-11 Sept. 2014
Firstpage
529
Lastpage
534
Abstract
In this work we propose an unsupervised model for deciphering names in two unrelated languages, English and Farsi. The proposed model is a generative non-parametric model that is a customized version of [3] model for name extraction. We show that this unsupervised model is able to achieve competitive results in comparison with a supervised model. Although the accuracy of the unsupervised model is lower than the supervised model, using this model makes it possible to produce list of parallel names without parallel corpora.
Keywords
natural language processing; English; Farsi; generative nonparametric model; name deciphering; name extraction; parallel names; unrelated languages; unsupervised model; Accuracy; Bayes methods; Ciphers; Computational modeling; Data models; Probability distribution; Vectors; Deciphering; English-Farsi; Name extraction; Scarce resource languages; Unrelated languages;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunications (IST), 2014 7th International Symposium on
Conference_Location
Tehran
Print_ISBN
978-1-4799-5358-5
Type
conf
DOI
10.1109/ISTEL.2014.7000761
Filename
7000761
Link To Document