• DocumentCode
    3278013
  • Title

    Automatic Classification of Uighur Web Pages

  • Author

    Xu Guixian ; Gao Xu ; Zhao Xiaobing ; Yang Guosheng

  • Author_Institution
    Coll. of Inf. Eng., Minzu Univ. of China, Beijing, China
  • fYear
    2013
  • fDate
    16-18 Jan. 2013
  • Firstpage
    390
  • Lastpage
    393
  • Abstract
    In this paper, we introduce a classification approach for Uighur web pages. It utilizes the class feature dictionary and Cosine similarity computation to classify the Uighur web pages into the predefined classes rapidly and accurately. The experimental result shows that the approach has a good classification performance for Uighur web pages classification. It is useful and helpful for the construction of the statistical and rule-based classification of Uighur texts as well as construction of high-quality Uighur corpus.
  • Keywords
    Internet; knowledge based systems; natural language processing; pattern classification; statistical analysis; text analysis; Uighur Web page; Uighur corpus; Uighur text; Web page classification; class feature dictionary; classification performance; cosine similarity computation; rule-based classification; statistical classification; Dictionaries; Feature extraction; Information processing; Kernel; Text categorization; Web pages; Classification of Web Pages; Text classification; Uighur Information Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent System Design and Engineering Applications (ISDEA), 2013 Third International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4673-4893-5
  • Type

    conf

  • DOI
    10.1109/ISDEA.2012.97
  • Filename
    6456317