• Title of article

    Discovering Term Occurrence Structure in Text

  • Author/Authors

    Bookstein، Abraham نويسنده , , Raita، T. نويسنده ,

  • Issue Information
    ماهنامه با شماره پیاپی سال 2001
  • Pages
    -475
  • From page
    476
  • To page
    0
  • Abstract
    This article examines some consequences for information control of the tendency of occurrences of contentbearing terms to appear together, or clump. Properties of previously defined clumping measures are reviewed and extended, and the significance of these measures for devising retrieval strategies discussed. A new type of clumping measure, which extends the earlier measures by permitting gaps within a clump, is defined, and several variants examined. Experiments are carried out that indicate the relation between the new measure and one of the earlier measures, as well as the ability of the two types of measure to predict compression efficiency.
  • Keywords
    optical music recognition , musical data acquisition , Pattern recognition , Document image analysis
  • Journal title
    Journal of the American Society for Information Science and Technology
  • Serial Year
    2001
  • Journal title
    Journal of the American Society for Information Science and Technology
  • Record number

    35090