DocumentCode
2015540
Title
Large-scale processing, indexing and search system for Czech audio-visual cultural heritage archives
Author
Nouza, Jan ; Blavka, Karel ; Zdansky, Jindrich ; Cerva, Petr ; Silovsky, Jan ; Bohac, Marek ; Chaloupka, Josef ; Kucharova, Michaela ; Seps, Ladislav
Author_Institution
Inst. of Inf. Technol. & Electron., Tech. Univ. of Liberec, Liberec, Czech Republic
fYear
2012
fDate
17-19 Sept. 2012
Firstpage
337
Lastpage
342
Abstract
This paper describes a complex system developed for processing, indexing and accessing data collected in large audio and audio-visual archives that make an important part of Czech cultural heritage. Recently, the system is being applied to the Czech Radio archive, namely to its oral history segment with more than 200.000 individual recordings covering almost ninety years of broadcasting in the Czech Republic and former Czechoslovakia. The ultimate goals are a) to transcribe a significant portion of the archive - with the support of speech, speaker and language recognition technology, b) index the transcriptions, and c) make the audio and text files fully searchable. So far, the system has processed and indexed over 75.000 spoken documents. Most of them come from the last two decades, but the recent demo collection includes also a series of presidential speeches since 1934. The full coverage of the archive should be available by the end of 2014.
Keywords
audio-visual systems; document handling; history; indexing; information retrieval systems; Czech audio-visual cultural heritage archives; Czechoslovakia; indexing; large-scale processing; oral history segment; search system; spoken documents; Acoustics; Databases; Servers; Speech; Speech recognition; Training; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing (MMSP), 2012 IEEE 14th International Workshop on
Conference_Location
Banff, AB
Print_ISBN
978-1-4673-4570-5
Electronic_ISBN
978-1-4673-4571-2
Type
conf
DOI
10.1109/MMSP.2012.6343465
Filename
6343465
Link To Document