Title :
An empirical text categorizing computational model based on stylistic aspects
Author :
Michos, S.E. ; Stamatatos, E. ; Fakotakis, N. ; Kokkinakis, G.
Author_Institution :
Dept. of Electr. & Comput. Eng., Patras Univ., Greece
Abstract :
The presented work is strongly motivated by the need for categorizing unrestricted text in terms of a functional style (FS) in order to attain a satisfying outcome in style processing. Towards this aim, a three level description of FS is given that comprises: (a) the basic categories of FS; (b) the main features that characterize each one of the above categories; and (c) the linguistic identifiers that act as style markers in text for the identification of the above features. Special emphasis is put on the problems that faced the computational implementation of the aforementioned findings, as well as the selection of the most appropriate stylometrics (i.e., stylistic scores) to achieve better results on text categorization. This approach is language independent, empirically driven, and can be used in various applications including grammar and style checking, natural language generation, style verification in real world text, and recognition of style shift between adjacent portions of text.
Keywords :
computational linguistics; grammars; natural languages; pattern recognition; word processing; computational implementation; empirical text categorizing computational model; functional style; linguistic identifiers; natural language generation; real world text; style checking; style markers; style processing; style shift; style verification; stylistic aspects; stylistic scores; stylometrics; text categorization; three level description; unrestricted text; Computational modeling; Educational institutions; Humans; Information technology; Marine vehicles; Natural languages; Statistical analysis; Telecommunication computing; Text categorization; Text recognition;
Conference_Titel :
Tools with Artificial Intelligence, 1996., Proceedings Eighth IEEE International Conference on
Print_ISBN :
0-8186-7686-7
DOI :
10.1109/TAI.1996.560403