• DocumentCode
    1811859
  • Title

    Automatic topic detection strategy for information retrieval in spoken document

  • Author

    Jin, Shan ; Misra, Hemant ; Sikora, Thomas ; Jose, Joemon

  • Author_Institution
    Dept. of Telecommun. Syst., Tech. Univ. of Berlin, Berlin
  • fYear
    2009
  • fDate
    6-8 May 2009
  • Firstpage
    300
  • Lastpage
    303
  • Abstract
    This paper suggests an alternative solution for the task of spoken document retrieval (SDR). The proposed system runs retrieval on multi-level transcriptions (word and phone) produced by word and phone recognizers respectively, and their outputs are combined. We propose to use latent Dirichlet allocation (LDA) model for capturing the semantic information on word transcription. The LDA model is employed for estimating topic distribution in queries and word transcribed spoken documents, and the matching is performed at the topic level. Acoustic matching between query words and phonetically transcribed spoken documents is performed using phone-based matching algorithm. The results of acoustic and topic level matching methods are compared and shown to be complementary.
  • Keywords
    information retrieval; natural language processing; text analysis; acoustic matching; automatic topic detection strategy; information retrieval; latent Dirichlet allocation; multilevel transcriptions retrieval; phone-based matching algorithm; query words; semantic information; spoken document retrieval; word transcription; Acoustic signal detection; Automatic speech recognition; Content based retrieval; Databases; Distributed computing; Information retrieval; Linear discriminant analysis; Streaming media; Telecommunication computing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Analysis for Multimedia Interactive Services, 2009. WIAMIS '09. 10th Workshop on
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-3609-5
  • Electronic_ISBN
    978-1-4244-3610-1
  • Type

    conf

  • DOI
    10.1109/WIAMIS.2009.5031492
  • Filename
    5031492