• DocumentCode
    2260505
  • Title

    Automatic Speech Corpus Construction from Broadcasting Speech Databases

  • Author

    Zhang Wei ; Du Ranran ; Pang Minhui ; Wang Qiuhong

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Ocean Univ. of China, Qingdao, China
  • fYear
    2010
  • fDate
    11-14 Dec. 2010
  • Firstpage
    639
  • Lastpage
    643
  • Abstract
    The speech corpus often needs to be constructed frequently for the diversified speech synthesis. This paper discusses our efforts on construction of speech corpus automatically from broadcasting speech databases for trainable Text-To-Speech (TTS) system. We present a new framework of automatic speech corpus construction from broadcasting speech databases. We select the clean speech audios from the broadcasting audios with a music detector which is based on speech/music discrimination. An automatic speech sentence segmentation system is used to generate the sentence database from the clean speech audios. At last, a text corpus construction method selects appropriate sentences speech which is maximizing the coverage of the sentence database´s diphones. Experiments show that our method can generate a good speech corpus rapidly with minimum manual intervention.
  • Keywords
    database management systems; music; speech synthesis; TTS; automatic speech corpus construction; database diphones; music detector; music discrimination; speech audios; speech databases broadcasting; speech discrimination; speech synthesis; text corpus construction method; text-to-speech system; automatic speech sentence segmentation; speech corpus; speech synthesis; text selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Security (CIS), 2010 International Conference on
  • Conference_Location
    Nanning
  • Print_ISBN
    978-1-4244-9114-8
  • Electronic_ISBN
    978-0-7695-4297-3
  • Type

    conf

  • DOI
    10.1109/CIS.2010.145
  • Filename
    5696361