• DocumentCode
    477517
  • Title

    A New Parallel Segmentation Model Based on Dictionary and Mutual Information

  • Author

    Bo Liu ; Liang Gui Tang ; Can Tang

  • Author_Institution
    Comput. Sci. & Inf. Eng. Coll., Chongqing Technol. & Bus. Univ., Chongqing
  • Volume
    1
  • fYear
    2008
  • fDate
    20-22 Oct. 2008
  • Firstpage
    554
  • Lastpage
    558
  • Abstract
    It is difficult to compute the word frequency for mutual information segmentation. Statistic of word frequency of parallel mutual information is integrated with dictionary segmentation to improve efficiency in this paper. The parallel model and dispatching policy are presented, the paper also gives the speed up ratio of parallel model at the same time, periods pattern string and non periods pattern string are optimized in parallel model. Experiment show that the algorithm is available. The parallel model also can use for other segmentation algorithms that base on statistic of word frequency.
  • Keywords
    computational linguistics; natural language processing; parallel algorithms; Chinese word segmentation; dictionary segmentation; dispatching policy; parallel algorithm; parallel mutual information segmentation; statistics; word frequency; Automation; Computer science; Concurrent computing; Dictionaries; Distribution strategy; Educational institutions; Frequency; Mutual information; Pattern matching; Statistics; PVM; mutual information; parallel segmentation; statistic frequency;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computation Technology and Automation (ICICTA), 2008 International Conference on
  • Conference_Location
    Hunan
  • Print_ISBN
    978-0-7695-3357-5
  • Type

    conf

  • DOI
    10.1109/ICICTA.2008.369
  • Filename
    4659546