• DocumentCode
    2040612
  • Title

    LSF and Phase Feature Combination for Join Cost Estimation in a TTS System

  • Author

    Hosseinpour, Mehdi ; Moin, M. Shahram ; Zargari, Farzad

  • Author_Institution
    Multimedia Res. Group, Iran Telecommun. Res. Center, Tehran
  • fYear
    2007
  • fDate
    24-27 Nov. 2007
  • Firstpage
    237
  • Lastpage
    240
  • Abstract
    In recent text-to-speech (TTS) synthesis systems, speech is generated by concatenating units that are selected from a large database. Selection of units can be based on two cost measures: join and target. Many methods have been developed for join cost estimation, but they all suffer from neglecting the phase feature. Our new approach for join cost estimation is based on combining the residual phase feature and LSF feature where Discrete All Pole modeling(DAP) has been used for phase and LSF feature extraction. Results of different experimentations show that this method has better performance than conventional methods like MFCC unit selection.
  • Keywords
    feature extraction; spectral analysis; speech synthesis; DAP; LSF feature extraction; TTS system; discrete all pole modeling; join cost estimation; line spectral frequency; phase feature combination; text-to-speech synthesis; Acoustic measurements; Costs; Digital audio players; Feature extraction; Frequency; Multimedia databases; Phase estimation; Polynomials; Speech processing; Speech synthesis; DAP; LSF; Speech processing; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on
  • Conference_Location
    Dubai
  • Print_ISBN
    978-1-4244-1235-8
  • Electronic_ISBN
    978-1-4244-1236-5
  • Type

    conf

  • DOI
    10.1109/ICSPC.2007.4728299
  • Filename
    4728299