• DocumentCode
    1787083
  • Title

    Name deciphering in unrelated languages: The case study of Farsi and English

  • Author

    Bakhshaei, Somayeh ; Khadivi, Shahram ; Safabakhsh, Reza ; Zafarian, Atefeh

  • Author_Institution
    Comput. Eng. & Inf. Technol. Dept., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2014
  • fDate
    9-11 Sept. 2014
  • Firstpage
    529
  • Lastpage
    534
  • Abstract
    In this work we propose an unsupervised model for deciphering names in two unrelated languages, English and Farsi. The proposed model is a generative non-parametric model that is a customized version of [3] model for name extraction. We show that this unsupervised model is able to achieve competitive results in comparison with a supervised model. Although the accuracy of the unsupervised model is lower than the supervised model, using this model makes it possible to produce list of parallel names without parallel corpora.
  • Keywords
    natural language processing; English; Farsi; generative nonparametric model; name deciphering; name extraction; parallel names; unrelated languages; unsupervised model; Accuracy; Bayes methods; Ciphers; Computational modeling; Data models; Probability distribution; Vectors; Deciphering; English-Farsi; Name extraction; Scarce resource languages; Unrelated languages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications (IST), 2014 7th International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4799-5358-5
  • Type

    conf

  • DOI
    10.1109/ISTEL.2014.7000761
  • Filename
    7000761