مرکز منطقه ای اطلاع رساني علوم و فناوري - Quantitative method for modeling context in concatenative synthesis using large speech database

DocumentCode :

3343712

Title :

Quantitative method for modeling context in concatenative synthesis using large speech database

Author :

Hamza, Wael ; Rashwan, Mohsen ; Afify, Moltanted

Author_Institution :

IBM Eur. Speech Res., IBM Egypt, Egypt

Volume :

fYear :

2001

fDate :

2001

Firstpage :

789

Abstract :

Modeling phonetic context is one of the key points to get natural sounding in concatenativc speech synthesis. In this paper, a new quantitative method to model context is proposed. In the proposed method, the context is measured as the distance between leafs of the top-down likelihood-based decision trees that have been grown during the construction of acoustic inventory. Unlike other context modeling methods, this method allows the unit selection algorithm to borrow unit occurrences from other contexts when their context distances are close. This is done by incorporating the measured distance as an element in the unit selection cost function. The motivation behind this method is that it reduces the required speech modification by using better unit occurrences from near context. This method also makes it easy to use long synthesis units, e.g. syllables or words, in the same unit selection framework

Keywords :

decision theory; maximum likelihood estimation; speech synthesis; acoustic inventory; concatenativc speech synthesis; concatenative synthesis; context distances; context modeling methods; large speech database; long synthesis units; phonetic context; speech modification; syllables; top-down likelihood-based decision trees; unit occurrences; unit selection algorithm; unit selection cost function; words; Acoustic measurements; Context modeling; Cost function; Decision trees; Hidden Markov models; Humans; Laboratories; Natural languages; Spatial databases; Speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location :

Salt Lake City, UT

ISSN :

1520-6149

Print_ISBN :

0-7803-7041-4

Type :

conf

DOI :

10.1109/ICASSP.2001.941033

Filename :

941033

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3343712