• DocumentCode
    394295
  • Title

    Auditive learning based Chinese F0 prediction

  • Author

    Jianhua, Tao ; Xing, Ni

  • Author_Institution
    Nat. Lab of Pattern Recognition, Acad. Sinica, Beijing, China
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    The paper describes a new F0 (fundamental frequency) model based on an auditive learning (AL) method. Being focused on the notion of prosody templates, we confirmed that F0 patterns for a syllable can be extracted from various anamorphoses of F0 contours in spontaneous speech. It is most suitable to use the F0 templates selection method for Chinese F0 prediction with prosody cost function (PCF). Furthermore, an AL method is used to adjust the weight of PCF dynamically in application. Unlike other methods, the approach may give feedback as to exactly what are the crucial parameters determining the successful choice of patterns. The paper also analyzes the error distribution of the F0 prediction results. Both smoothing testing and F0 range testing show that the synthesis results are very close to human speech.
  • Keywords
    error statistics; feedback; learning (artificial intelligence); natural languages; prediction theory; speech synthesis; Chinese F0 prediction; auditive learning method; error distribution; feedback; fundamental frequency prediction; prosody cost function; prosody templates; range testing; smoothing testing; speech synthesis; spontaneous speech; syllable pattern extraction; Clustering methods; Cost function; Databases; Error analysis; Humans; Pattern recognition; Smoothing methods; Speech synthesis; Stress; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198827
  • Filename
    1198827