Title :
Automatic correspondence calculation between text and speech for authoring digital talking book
Author :
Watanabe, Katsuyuki ; Sugiyama, Masahide
Author_Institution :
Grad. Sch. of Comput. Sci. & Eng., Univ. of Aizu, Aizu-Wakamatsu
Abstract :
The present paper proposes applying the voice-pause (VP) method to authoring DAISY talking books used by visually impaired people. The proposed method enables authors to automatically calculate the time information of sentence-based correspondence between Japanese text and the corresponding audio data, reducing the time required to perform searches. While there have been several related studies that calculate the time information of the correspondence, they require the input audio data to have a specific speech style and to be short in duration. Therefore, in the present paper, the proposed VP method was used to determine the average gap time and the sentence detection rate for databases having different speech styles and for input audio data having long durations. The experimental results show that the average gap time was approximately 0.38 sec and the sentence detection rate was approximately 94% and these are independent of speech style. The proposed VP method performs well and is efficient compared with methods proposed in previous studies.
Keywords :
authoring systems; handicapped aids; speech synthesis; DAISY talking books; Japanese text; automatic correspondence calculation; digital talking book; digital talking book authoring; sentence detection rate; sentence-based correspondence; speech style; visually impaired people; voice-pause method; Audio databases; Availability; Books; Broadband communication; Computer science; Information systems; Motion pictures; Personal communication networks; Speech analysis; User-generated content;
Conference_Titel :
Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
Conference_Location :
London
Print_ISBN :
978-1-4244-2043-8
Electronic_ISBN :
978-1-4244-2044-5
DOI :
10.1109/CBMI.2008.4564941