• DocumentCode
    639732
  • Title

    Synthesizing suprasegmental speech information using hybrid of GA-ACO and dynamic neural network

  • Author

    Sheikhan, Mansour

  • Author_Institution
    South Tehran Branch, Electr. Eng. Dept., Islamic Azad Univ., Tehran, Iran
  • fYear
    2013
  • fDate
    28-30 May 2013
  • Firstpage
    175
  • Lastpage
    180
  • Abstract
    In generating natural speech by machines, removing the suprasegmental information (such as stress, timing and pitch frequency) results in unpleasant speech. To provide this information for synthesizing natural speech in Farsi language, a dynamic neural network (DNN) is used in this study. The inputs of DNN are word-level and syllable-level features as part of speech tags, word length, and type of punctuation mark at the word-level, and type of vowel and consonants, and position indicator of syllable at the syllable-level. To reduce the number of inputs of DNN, hybrid of genetic algorithm (GA) and ant colony optimization (ACO) is used for feature selection. The output layer of DNN includes nine nodes which provide suprasegmental information at the syllable level including pitch contour, log-energy level, duration information and pause data. Simulation results show that suprasegmental information is predicted with low root mean square error by using this hybrid soft-computing model.
  • Keywords
    ant colony optimisation; genetic algorithms; natural language processing; neural nets; speech synthesis; DNN; Farsi language; GA-ACO; ant colony optimization; dynamic neural network; feature selection; genetic algorithm; hybrid soft-computing model; natural speech synthesis; punctuation mark; root mean square error; speech tags; suprasegmental speech information synthesis; syllable-level features; word length; word-level features; Computational modeling; Computers; Generators; Hidden Markov models; Pragmatics; Speech; Speech processing; ant colony optimization; feature selection; genetic algorithm; neural network; suprasegmental information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Knowledge Technology (IKT), 2013 5th Conference on
  • Conference_Location
    Shiraz
  • Print_ISBN
    978-1-4673-6489-8
  • Type

    conf

  • DOI
    10.1109/IKT.2013.6620060
  • Filename
    6620060