• DocumentCode
    2576226
  • Title

    Automatic audio archiving system for panel discussions

  • Author

    Akita, Yuya ; Hasegawa, Masahiro ; Kawahara, Tatsuya

  • Author_Institution
    Sch. of Informatics, Kyoto Univ., Japan
  • Volume
    3
  • fYear
    2004
  • fDate
    27-30 June 2004
  • Firstpage
    1859
  • Abstract
    We present an automatic audio archiving system suitable for panel discussions. In our archive framework, audio data, speech transcription, speaker and content based indices are integrated in order to realize efficient archive browsing. Speaker indexing is performed in a totally unsupervised manner. The speaker information is also used for enhancing the automatic speech recognition system. These results are aligned with audio segments. Moreover we also introduce a novel indexing of utterances based on discourse tags that represent intentions and importance of utterances. A discourse tagger combining rule based and statistical methods is developed to automatically generate high-level indices. Finally, these results are combined and encoded using an MPEG-7 framework, resulting in highly portable archives.
  • Keywords
    audio databases; indexing; information retrieval; multimedia databases; speech coding; speech recognition; MPEG-7 coding; archive browsing; audio data indices; automatic panel discussion audio archiving system; automatic speech recognition system; discourse tagger; discourse tags; multimedia content archiving; rule based methods; speech content based indices; speech transcription; statistical methods; unsupervised speaker indexing; utterance importance; utterance indexing; utterance intentions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
  • Print_ISBN
    0-7803-8603-5
  • Type

    conf

  • DOI
    10.1109/ICME.2004.1394620
  • Filename
    1394620