DocumentCode
2139410
Title
Automatic correspondence calculation between text and speech for authoring digital talking book
Author
Watanabe, Katsuyuki ; Sugiyama, Masahide
Author_Institution
Grad. Sch. of Comput. Sci. & Eng., Univ. of Aizu, Aizu-Wakamatsu
fYear
2008
fDate
18-20 June 2008
Firstpage
155
Lastpage
161
Abstract
The present paper proposes applying the voice-pause (VP) method to authoring DAISY talking books used by visually impaired people. The proposed method enables authors to automatically calculate the time information of sentence-based correspondence between Japanese text and the corresponding audio data, reducing the time required to perform searches. While there have been several related studies that calculate the time information of the correspondence, they require the input audio data to have a specific speech style and to be short in duration. Therefore, in the present paper, the proposed VP method was used to determine the average gap time and the sentence detection rate for databases having different speech styles and for input audio data having long durations. The experimental results show that the average gap time was approximately 0.38 sec and the sentence detection rate was approximately 94% and these are independent of speech style. The proposed VP method performs well and is efficient compared with methods proposed in previous studies.
Keywords
authoring systems; handicapped aids; speech synthesis; DAISY talking books; Japanese text; automatic correspondence calculation; digital talking book; digital talking book authoring; sentence detection rate; sentence-based correspondence; speech style; visually impaired people; voice-pause method; Audio databases; Availability; Books; Broadband communication; Computer science; Information systems; Motion pictures; Personal communication networks; Speech analysis; User-generated content;
fLanguage
English
Publisher
ieee
Conference_Titel
Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
Conference_Location
London
Print_ISBN
978-1-4244-2043-8
Electronic_ISBN
978-1-4244-2044-5
Type
conf
DOI
10.1109/CBMI.2008.4564941
Filename
4564941
Link To Document