Title :
Estimating The Date of Blog Authors by CRF
Author :
Izumi, Masataka ; Miura, Takao ; Shioya, Isamu
Author_Institution :
Hosei Univ., Tokyo
Abstract :
In this investigation, we propose a sophisticated approach for estimating the ages of blog authors by means of stochastic process. In this technique, we give weights on every word appeared in training data, and we extract a collection of feature words to each age. Then we examine articles on Blog based on the feature information and estimate the age by obtaining label to each word by means of conditional random fields (CRF). We show the effectiveness of our approach by some experiments.
Keywords :
Web sites; stochastic processes; blog author age estimation; blog date estimation; conditional random field; feature information; feature word extraction; stochastic process; Data mining; Engineering management; Engineering profession; Hidden Markov models; Informatics; Information services; Internet; Stochastic processes; Training data; Web sites; Age of Author; BLOG; CRF; Stochastic Process;
Conference_Titel :
Communications, Computers and Signal Processing, 2007. PacRim 2007. IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
978-1-4244-1189-4
Electronic_ISBN :
1-4244-1190-4
DOI :
10.1109/PACRIM.2007.4313222