Title :
Name deciphering in unrelated languages: The case study of Farsi and English
Author :
Bakhshaei, Somayeh ; Khadivi, Shahram ; Safabakhsh, Reza ; Zafarian, Atefeh
Author_Institution :
Comput. Eng. & Inf. Technol. Dept., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
In this work we propose an unsupervised model for deciphering names in two unrelated languages, English and Farsi. The proposed model is a generative non-parametric model that is a customized version of [3] model for name extraction. We show that this unsupervised model is able to achieve competitive results in comparison with a supervised model. Although the accuracy of the unsupervised model is lower than the supervised model, using this model makes it possible to produce list of parallel names without parallel corpora.
Keywords :
natural language processing; English; Farsi; generative nonparametric model; name deciphering; name extraction; parallel names; unrelated languages; unsupervised model; Accuracy; Bayes methods; Ciphers; Computational modeling; Data models; Probability distribution; Vectors; Deciphering; English-Farsi; Name extraction; Scarce resource languages; Unrelated languages;
Conference_Titel :
Telecommunications (IST), 2014 7th International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4799-5358-5
DOI :
10.1109/ISTEL.2014.7000761