DocumentCode
1943940
Title
Alignment between a technical paper and presentation sheets using a hidden Markov model
Author
Hayama, Tessai ; Nanba, Hidetsugu ; Kunifuji, Susumu
Author_Institution
Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
fYear
2005
fDate
19-21 May 2005
Firstpage
102
Lastpage
106
Abstract
We have been studying the automatic generation of presentation sheets from a technical paper. Our approach consists of obtaining a set of rules for generating presentation sheets by applying machine learning techniques to many pairs of technical papers and their presentation sheets collected from the World Wide Web. As a first step, in this paper, we propose a method for aligning technical papers and presentation sheets. Our method is based on Jing´s method, which uses a hidden Markov model (HMM). Although this method is useful to align short sentences in newspaper articles, it is inapplicable to align sentences in a paper including charts and long sentences. Therefore, we analyse features of papers and sheets, such as information from text appearance, and propose an alignment method that combines the use of these features and Jing´s method. The evaluation shows that our alignment method performed effectively.
Keywords
Internet; document handling; hidden Markov models; knowledge acquisition; learning (artificial intelligence); natural languages; Jing method; World Wide Web; document handling; hidden Markov model; knowledge acquisition; machine learning techniques; presentation sheet; rule generation; technical paper alignment; Cities and towns; Equations; Hidden Markov models; Information analysis; Machine learning; Paper technology; Performance evaluation; TV; Teletext; Web sites;
fLanguage
English
Publisher
ieee
Conference_Titel
Active Media Technology, 2005. (AMT 2005). Proceedings of the 2005 International Conference on
Print_ISBN
0-7803-9035-0
Type
conf
DOI
10.1109/AMT.2005.1505278
Filename
1505278
Link To Document