• DocumentCode
    2929253
  • Title

    Analysis and Transformation of Textual Energy Distribution

  • Author

    Molina, Arturo ; Torres-Moreno, Juan-Manuel ; Sanjuan, Eric ; Sierra, Gerardo ; Rojas-Mora, Julio

  • Author_Institution
    Vaucluse Lab. Inf. d´Avignon, Univ. d´Avignon et des Pays, Avignon, France
  • fYear
    2013
  • fDate
    24-30 Nov. 2013
  • Firstpage
    203
  • Lastpage
    208
  • Abstract
    In this paper we revisit the Textual Energy model. We deal with the two major disadvantages of the Textual Energy: the asymmetry of the distribution and the unbounded ness of the maximum value. Although this model has been successfully used in several NLP tasks like summarization, clustering and sentence compression, no correction of these problems has been proposed until now. Concerning the maximum value, we analyze the computation of Textual Energy matrix and we conclude that energy values are dominated by the lexical richness in quadratic growth of the vocabulary size. Using the Box-Cox transformation, we show empirical evidence that a log transformation could correct both problems.
  • Keywords
    natural language processing; pattern clustering; NLP tasks; box-cox transformation; clustering; lexical richness; log transformation; maximum value unboundedness; quadratic vocabulary size growth; sentence compression; summarization; textual energy distribution; textual energy matrix; Adaptation models; Computational modeling; Energy measurement; Numerical models; Physics; Vectors; Vocabulary; Measures of Informativity; Sentence Importance; textual energy; weight model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence (MICAI), 2013 12th Mexican International Conference on
  • Conference_Location
    Mexico City
  • Print_ISBN
    978-1-4799-2604-6
  • Type

    conf

  • DOI
    10.1109/MICAI.2013.32
  • Filename
    6714669