Title :
Pre-processing online financial text for sentiment classification: A natural language processing approach
Author :
Fan Sun ; Belatreche, Ammar ; Coleman, Sonya ; McGinnity, Thomas Martin ; Yuhua Li
Author_Institution :
Intell. Syst. Res. Centre, Univ. of Ulster, Magee, Derry, UK
Abstract :
Online financial textual information contains a large amount of investor sentiment, i.e. subjective assessment and discussion with respect to financial instruments. An effective solution to automate the sentiment analysis of such large amounts of online financial texts would be extremely beneficial. This paper presents a natural language processing (NLP) based pre-processing approach both for noise removal from raw online financial texts and for organizing such texts into an enhanced format that is more usable for feature extraction. The proposed approach integrates six NLP processing steps, including a developed syntactic and semantic combined negation handling algorithm, to reduce noise in the online informal text. Three-class sentiment classification is also introduced in each system implementation. Experimental results show that the proposed pre-processing approach outperforms other pre-processing methods. The combined negation handling algorithm is also evaluated against three standard negation handling approaches.
Keywords :
Internet; emotion recognition; feature extraction; financial data processing; natural language processing; pattern classification; text analysis; NLP based preprocessing approach; feature extraction; financial instruments; investor sentiment analysis; natural language processing approach; negation handling algorithm; noise removal; online financial text preprocessing; online financial textual information; online informal text; raw online financial texts; three-class sentiment classification; Companies; Feature extraction; Niobium; Semantics; Sentiment analysis; Tagging;
Conference_Titel :
Computational Intelligence for Financial Engineering & Economics (CIFEr), 2104 IEEE Conference on
Conference_Location :
London
DOI :
10.1109/CIFEr.2014.6924063