• DocumentCode
    3614533
  • Title

    Forward masking phenomenon in concatenative speech synthesis

  • Author

    M. Cernak;G. Rozinaj

  • Author_Institution
    Fac. of Electr. Eng., Slovak Tech. Univ., Bratislava, Slovakia
  • Volume
    2
  • fYear
    2003
  • fDate
    6/25/1905 12:00:00 AM
  • Firstpage
    691
  • Abstract
    The approach described in the paper tries to get more knowledge to the concatenative text-to-speech system design. The knowledge is based on masking phenomenon of the inner ear, particularly of its temporal (forward) masking properties. Designing such knowledge-based system is suggested to use in the unit selection-based speech synthesis, as contemporary a prominent technique in concatenative synthesis, which utilizes a big speech corpus. The more prosodic variability the corpus captures, the more natural a synthetic voice sounds and there are more possibilities to occur a forward masking events during concatenation of selected candidate units from the corpus.
  • Keywords
    "Speech synthesis","Humans","Speech analysis","Speech coding","Frequency","Cost function","Knowledge based systems","Speech processing","Cepstral analysis","Ear"
  • Publisher
    ieee
  • Conference_Titel
    Video/Image Processing and Multimedia Communications, 2003. 4th EURASIP Conference focused on
  • Print_ISBN
    953-184-054-7
  • Type

    conf

  • DOI
    10.1109/VIPMC.2003.1220544
  • Filename
    1220544