DocumentCode
3200298
Title
Automatic synchronization of electronic and audio books via TTS alignment and silence filtering
Author
Anguera, Xavier ; Perez, Nestor ; Urruela, Andreu ; Oliver, Nuria
Author_Institution
Telefonica Res., Barcelona, Spain
fYear
2011
fDate
11-15 July 2011
Firstpage
1
Lastpage
6
Abstract
The e-book industry is starting to flourish due, in part, to the availability of affordable and user-friendly e-book readers. As users are increasingly moving from traditional paper books to e-books, there is an opportunity to reinvent and enhance their reading experience, for example, by leveraging the multimedia capabilities of these devices in order to turn the act of reading into a real multimedia experience. In this paper, we focus on the augmentation of the written text with its associated audiobook, so that users can listen to the book they are (currently) reading. We propose an audiobook-to-ebook alignment system by applying a Text-to-Speech(TTS)-based text to audio alignment algorithm, and enhance it with a silence filtering algorithm to cope with the difference on reading style between the TTS output and the speakers in the ebook environment. Experiments done using 12 five-minute excerpts of 6 different audio-books (read by men and women) yield usable word alignment errors below 120ms for 90% of the words. Finally, we also show a user interface implementation in the Ipad for synchronized e-book reading while listening to the associated audiobook.
Keywords
electronic publishing; filtering theory; speech synthesis; TTS alignment; audio books; audiobook-to-ebook alignment system; book synchronization; e-book industry; e-book readers; electronic books; multimedia experience; silence filtering; text to audio alignment algorithm; text-to-speech; Electronic publishing; TTS alignment; audio processing; audiobook; e-book; multimodal synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location
Barcelona
ISSN
1945-7871
Print_ISBN
978-1-61284-348-3
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2011.6012185
Filename
6012185
Link To Document