• DocumentCode
    1943940
  • Title

    Alignment between a technical paper and presentation sheets using a hidden Markov model

  • Author

    Hayama, Tessai ; Nanba, Hidetsugu ; Kunifuji, Susumu

  • Author_Institution
    Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
  • fYear
    2005
  • fDate
    19-21 May 2005
  • Firstpage
    102
  • Lastpage
    106
  • Abstract
    We have been studying the automatic generation of presentation sheets from a technical paper. Our approach consists of obtaining a set of rules for generating presentation sheets by applying machine learning techniques to many pairs of technical papers and their presentation sheets collected from the World Wide Web. As a first step, in this paper, we propose a method for aligning technical papers and presentation sheets. Our method is based on Jing´s method, which uses a hidden Markov model (HMM). Although this method is useful to align short sentences in newspaper articles, it is inapplicable to align sentences in a paper including charts and long sentences. Therefore, we analyse features of papers and sheets, such as information from text appearance, and propose an alignment method that combines the use of these features and Jing´s method. The evaluation shows that our alignment method performed effectively.
  • Keywords
    Internet; document handling; hidden Markov models; knowledge acquisition; learning (artificial intelligence); natural languages; Jing method; World Wide Web; document handling; hidden Markov model; knowledge acquisition; machine learning techniques; presentation sheet; rule generation; technical paper alignment; Cities and towns; Equations; Hidden Markov models; Information analysis; Machine learning; Paper technology; Performance evaluation; TV; Teletext; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Active Media Technology, 2005. (AMT 2005). Proceedings of the 2005 International Conference on
  • Print_ISBN
    0-7803-9035-0
  • Type

    conf

  • DOI
    10.1109/AMT.2005.1505278
  • Filename
    1505278