DocumentCode :
2576226
Title :
Automatic audio archiving system for panel discussions
Author :
Akita, Yuya ; Hasegawa, Masahiro ; Kawahara, Tatsuya
Author_Institution :
Sch. of Informatics, Kyoto Univ., Japan
Volume :
3
fYear :
2004
fDate :
27-30 June 2004
Firstpage :
1859
Abstract :
We present an automatic audio archiving system suitable for panel discussions. In our archive framework, audio data, speech transcription, speaker and content based indices are integrated in order to realize efficient archive browsing. Speaker indexing is performed in a totally unsupervised manner. The speaker information is also used for enhancing the automatic speech recognition system. These results are aligned with audio segments. Moreover we also introduce a novel indexing of utterances based on discourse tags that represent intentions and importance of utterances. A discourse tagger combining rule based and statistical methods is developed to automatically generate high-level indices. Finally, these results are combined and encoded using an MPEG-7 framework, resulting in highly portable archives.
Keywords :
audio databases; indexing; information retrieval; multimedia databases; speech coding; speech recognition; MPEG-7 coding; archive browsing; audio data indices; automatic panel discussion audio archiving system; automatic speech recognition system; discourse tagger; discourse tags; multimedia content archiving; rule based methods; speech content based indices; speech transcription; statistical methods; unsupervised speaker indexing; utterance importance; utterance indexing; utterance intentions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8603-5
Type :
conf
DOI :
10.1109/ICME.2004.1394620
Filename :
1394620
Link To Document :
بازگشت