• DocumentCode
    1908441
  • Title

    Research of Modern Uyghur Word Frequency Statistical Technology

  • Author

    Azragul ; Nian Mei ; Yasen Yimin

  • Author_Institution
    Anal. Lab., Xinjiang Normal Univ., Urumqi, China
  • fYear
    2013
  • fDate
    17-19 Aug. 2013
  • Firstpage
    60
  • Lastpage
    63
  • Abstract
    With the development of our society, the languages are also constantly evolving. Word is the smallest meaningful language composition which able to activity independently, and is also important carrier of knowledge and the basic operation unit in the natural language processing system. Uyghur word frequency statistics technology is the process by computer automatic identification term boundary in the texts. It is the most important pretreatment of information processing technology. However, there is no a really mature Uighur word frequency statistics system, which became one of the bottlenecks that hampered the development of information processing in Uighur language seriously at present. This paper discusses the idea and algorithms of the Uyghur word frequency statistics system in detail. Secondly introduces functional design process of the word frequency statistics system. Third I describe methods and techniques of this system. Finally it states statement of the test results.
  • Keywords
    natural language processing; statistical analysis; Uighur language; Uyghur word frequency statistical technology; computer automatic identification term boundary; functional design process; information processing technology; language composition; natural language processing system; Algorithm design and analysis; Databases; Dictionaries; Frequency conversion; Standards; Time-frequency analysis; Functional design; Implementation method; Modern Uygur language; Word frequency statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2013 International Conference on
  • Conference_Location
    Urumqi
  • Type

    conf

  • DOI
    10.1109/IALP.2013.20
  • Filename
    6646004