• DocumentCode
    2139410
  • Title

    Automatic correspondence calculation between text and speech for authoring digital talking book

  • Author

    Watanabe, Katsuyuki ; Sugiyama, Masahide

  • Author_Institution
    Grad. Sch. of Comput. Sci. & Eng., Univ. of Aizu, Aizu-Wakamatsu
  • fYear
    2008
  • fDate
    18-20 June 2008
  • Firstpage
    155
  • Lastpage
    161
  • Abstract
    The present paper proposes applying the voice-pause (VP) method to authoring DAISY talking books used by visually impaired people. The proposed method enables authors to automatically calculate the time information of sentence-based correspondence between Japanese text and the corresponding audio data, reducing the time required to perform searches. While there have been several related studies that calculate the time information of the correspondence, they require the input audio data to have a specific speech style and to be short in duration. Therefore, in the present paper, the proposed VP method was used to determine the average gap time and the sentence detection rate for databases having different speech styles and for input audio data having long durations. The experimental results show that the average gap time was approximately 0.38 sec and the sentence detection rate was approximately 94% and these are independent of speech style. The proposed VP method performs well and is efficient compared with methods proposed in previous studies.
  • Keywords
    authoring systems; handicapped aids; speech synthesis; DAISY talking books; Japanese text; automatic correspondence calculation; digital talking book; digital talking book authoring; sentence detection rate; sentence-based correspondence; speech style; visually impaired people; voice-pause method; Audio databases; Availability; Books; Broadband communication; Computer science; Information systems; Motion pictures; Personal communication networks; Speech analysis; User-generated content;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-2043-8
  • Electronic_ISBN
    978-1-4244-2044-5
  • Type

    conf

  • DOI
    10.1109/CBMI.2008.4564941
  • Filename
    4564941