Title :
Synthesizing suprasegmental speech information using hybrid of GA-ACO and dynamic neural network
Author :
Sheikhan, Mansour
Author_Institution :
South Tehran Branch, Electr. Eng. Dept., Islamic Azad Univ., Tehran, Iran
Abstract :
In generating natural speech by machines, removing the suprasegmental information (such as stress, timing and pitch frequency) results in unpleasant speech. To provide this information for synthesizing natural speech in Farsi language, a dynamic neural network (DNN) is used in this study. The inputs of DNN are word-level and syllable-level features as part of speech tags, word length, and type of punctuation mark at the word-level, and type of vowel and consonants, and position indicator of syllable at the syllable-level. To reduce the number of inputs of DNN, hybrid of genetic algorithm (GA) and ant colony optimization (ACO) is used for feature selection. The output layer of DNN includes nine nodes which provide suprasegmental information at the syllable level including pitch contour, log-energy level, duration information and pause data. Simulation results show that suprasegmental information is predicted with low root mean square error by using this hybrid soft-computing model.
Keywords :
ant colony optimisation; genetic algorithms; natural language processing; neural nets; speech synthesis; DNN; Farsi language; GA-ACO; ant colony optimization; dynamic neural network; feature selection; genetic algorithm; hybrid soft-computing model; natural speech synthesis; punctuation mark; root mean square error; speech tags; suprasegmental speech information synthesis; syllable-level features; word length; word-level features; Computational modeling; Computers; Generators; Hidden Markov models; Pragmatics; Speech; Speech processing; ant colony optimization; feature selection; genetic algorithm; neural network; suprasegmental information;
Conference_Titel :
Information and Knowledge Technology (IKT), 2013 5th Conference on
Conference_Location :
Shiraz
Print_ISBN :
978-1-4673-6489-8
DOI :
10.1109/IKT.2013.6620060