Title :
A fuzzy model of a European index based on automatically extracted content information
Author :
Milea, Viorel ; Almeida, Rui J. ; Kaymak, Uzay ; Frasincar, Flavius
Author_Institution :
Erasmus Sch. of Econ., Erasmus Univ. Rotterdam, Rotterdam, Netherlands
Abstract :
In this paper we build on previous work related to predicting the MSCI EURO index based on content analysis of ECB statements. Our focus is on reducing the number of features employed for prediction through feature selection. For this purpose we rely on two methodologies: (stepwise) linear regression and greedy forward feature subset selection. The original dataset consists of 13 features (General Inquirer content categories). Both methodologies provide an improvement in the overall accuracy of the model, while reducing the number of features employed. Through linear regression we achieve an accuracy of 67.58% on the testing set by relying on six features, while greedy forward selection enables an accuracy on the test set of 69.50% while relying on eight features.
Keywords :
economic indicators; financial data processing; fuzzy set theory; information retrieval; regression analysis; ECB statements; European index; MSCI EURO index; automatic content information extraction; content analysis; feature selection; fuzzy model; greedy forward feature subset selection; linear regression; Accuracy; Data models; Economics; Indexes; Linear regression; Predictive models; Training;
Conference_Titel :
Computational Intelligence for Financial Engineering and Economics (CIFEr), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9933-5
DOI :
10.1109/CIFER.2011.5953571