DocumentCode
1999710
Title
A Case Study: News Classification Based on Term Frequency
Author
Kroha, Petr ; Baeza-Yates, Ricardo
Author_Institution
Fac. of Comput. Sci., Univ. of Technol., Chemnitz
fYear
2005
fDate
26-26 Aug. 2005
Firstpage
428
Lastpage
432
Abstract
In this paper, we investigate how much similarity good news and bad news have in context of long-terms market trends and we discuss the relation between information retrieval and text mining. We have analyzed about 400 thousand news stories coming from the years 1999 to 2002 and we argue that classification methods of information retrieval are not strong enough to solve problems like this one because the meaning of news is given not only by the used words and their frequency but also by the structure of sentences and their context. We present results of our experiments and examples of news that support this statement
Keywords
business data processing; classification; data mining; information retrieval; stock markets; text analysis; business news classification; financial markets; information retrieval; term frequency; text mining; Business; Chemical technology; Computer aided software engineering; Computer science; Data mining; Databases; Engines; Frequency; Information retrieval; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
Conference_Location
Copenhagen
ISSN
1529-4188
Print_ISBN
0-7695-2424-9
Type
conf
DOI
10.1109/DEXA.2005.6
Filename
1508310
Link To Document