DocumentCode
3109599
Title
Tuning semantic association for modelling textual data
Author
Otero, Juan M. ; Rodriguez, Ansel Y. ; Medina-Pagola, José E.
Author_Institution
Dept. of Appl. Math., Havana Univ., Havana, Cuba
fYear
2011
fDate
19-21 May 2011
Firstpage
1
Lastpage
6
Abstract
Text information processing depends critically on the proper representation of documents. Traditional models, like the vector space model, have significant limitations because they do not consider semantic relations amongst terms. Global Association Distance Model (GADM) is an alternative that includes this consideration for document representation, assuming basically that two documents should be closer if the shortest formal distances amongst terms in each document are similar. The association strength function used to model the semantic relations among terms, based on its formal distances is a critical feature of GADM. In this paper the association strength function is analyzed, a family of piecewise association strength functions is proposed and a Simulated Annealing algorithm is used to tune it and to obtain an optimal model of semantic relation. We evaluate this significance for topic classification task.
Keywords
document handling; sensor fusion; simulated annealing; association strength function; document representation; formal distances; global association distance model; simulated annealing algorithm; text information processing; textual data modelling; topic classification task; tuning semantic association; Annealing; Semantics; Simulated annealing; association strength function; representation of documents; simulated annealing; vector space model;
fLanguage
English
Publisher
ieee
Conference_Titel
Research Challenges in Information Science (RCIS), 2011 Fifth International Conference on
Conference_Location
Gosier
ISSN
2151-1349
Print_ISBN
978-1-4244-8670-0
Electronic_ISBN
2151-1349
Type
conf
DOI
10.1109/RCIS.2011.6006821
Filename
6006821
Link To Document