• DocumentCode
    2662825
  • Title

    Applying latent semantic analysis to classify emotions in Thai text

  • Author

    Inrak, Piyatida ; Sinthupinyo, Sukree

  • Author_Institution
    Dept. of Comput. Eng., Chulalongkorn Univ., Bangkok, Thailand
  • Volume
    6
  • fYear
    2010
  • fDate
    16-18 April 2010
  • Abstract
    With a rapid growth of the internet communication, many types of text are produced. They can convey the meanings that can contribute to text categorization. Emotion classification also becomes more interesting, but emotion classification in Thai text is still not able to be correctly classified. Thus, this paper proposes a novel approach that takes advantage of bi-words occurrence to classify emotion hidden in a short sentence. In this paper, we classify Thai text into six basic universal emotions including anger, disgust, fear, happiness, sadness, and surprise based on latent semantic analysis approach. We compared the results between two models which construct features from the sentences and applied to three classification methods, i.e. Naïve Bayes, SVM, and Decision Tree. The first feature model uses only single word occurrence in the classification. The second model uses single word combined with bi-words occurrence in the classification. The results show that the second model can yield higher accuracy than the first model based on the Naïve Bayes classification method.
  • Keywords
    Internet; decision trees; emotion recognition; feature extraction; natural language processing; support vector machines; text analysis; word processing; Internet communication; SVM; Thai text; bi-word occurrence; decision tree; emotion classification; latent semantic analysis; naive Bayes classification method; single word occurrence; text categorization; Decision trees; Dictionaries; Internet; Linear algebra; Matrix decomposition; Singular value decomposition; Statistical analysis; Support vector machine classification; Support vector machines; Text categorization; Affective Computing; Emotions in Text; Latent Semantic Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4244-6347-3
  • Type

    conf

  • DOI
    10.1109/ICCET.2010.5486137
  • Filename
    5486137