• DocumentCode
    3342481
  • Title

    Intertextual distance for Arabic texts classification

  • Author

    Ayadi, R. ; Maraoui, M. ; Zrigui, M.

  • Author_Institution
    UTIC Lab., ISIM-Sfax Inst., Sfax, Tunisia
  • fYear
    2009
  • fDate
    9-12 Nov. 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Our researches works are interested on the application of the intertextual distance theory on the Arabic language as a tool for the classification of texts. This theory assumes the classification of texts according to criteria of lexical statistics, and it is based on the lexical connection approach. Our objective is to integrate this theory as a tool of classification of texts in Arabic language. It requires the integration of a metrics for the classification of texts using a database of lemmatized and identified corpus which can be considered as a literature reference for times, kinds, literary themes and authors and this in order to permit the classification of anonymous texts.
  • Keywords
    natural languages; pattern classification; statistical analysis; text analysis; Arabic texts classification; anonymous text classification; intertextual distance theory; lexical connection approach; lexical statistics; Character recognition; Databases; Fusion power generation; Indexing; Laboratories; Merging; Statistics; Text categorization; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Internet Technology and Secured Transactions, 2009. ICITST 2009. International Conference for
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-5647-5
  • Type

    conf

  • DOI
    10.1109/ICITST.2009.5402564
  • Filename
    5402564