Title :
Entropy-Based Age Estimation of Blog Authors
Author :
Izumi, Masataka ; Miura, Takao ; Shioya, Isamu
Author_Institution :
Dept.of Elect, & Electr. Eng., Hosei Univ., Tokyo
fDate :
July 28 2008-Aug. 1 2008
Abstract :
In this investigation, we propose a probabilistic approach for estimating the ages of blog authors in Japan by means of naive Bayesian classifier. We can learn context of characteristic words appeared in training data in terms of entropy. The key idea is that we extract feature words specific to authors´ ages, and we estimate ages of the blog authors. We show the effectiveness of our approach by experimental results.
Keywords :
Bayes methods; Internet; data mining; feature extraction; probability; blog authors; entropy-based age estimation; feature word extraction; naive Bayesian classifier; probabilistic approach; Bayesian methods; Data mining; Engineering profession; Feature extraction; Information services; Internet; Support vector machine classification; Support vector machines; Training data; Web sites; software as a service;
Conference_Titel :
Computer Software and Applications, 2008. COMPSAC '08. 32nd Annual IEEE International
Conference_Location :
Turku
Print_ISBN :
978-0-7695-3262-2
Electronic_ISBN :
0730-3157
DOI :
10.1109/COMPSAC.2008.201