DocumentCode :
312282
Title :
A study on Japanese prosodic pattern and its modeling in restricted speech
Author :
Hamagami, Tomoki ; Magata, Ken-ichi ; Komura, Mitsuo
Author_Institution :
Graduate Sch. of Sci. & Technol., Chiba Univ., Japan
Volume :
3
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1628
Abstract :
The report proposes a simple and practical model for generating relatively monotonous, but sufficiently natural, prosodic features by analyzing restricted natural speech. The basic assumption of this model is that the natural F0 pattern can be obtained without complicated linguistic analysis. To achieve this prosodic control, the authors have analyzed and modeled this speech subject that is recoded so that it will appear in the following. First they composed the hypothesis that a Japanese major phrase (MP) could be modeled with the combination of a minor phrase (mp) sequence limited to fewer than three. The number of the combination is decided by the accentual type of minor phrase and intrasentence position. The combination types have 28 patterns. To confirm the hypothesis, the restricted speech (RSP) subjects were collected and analyzed by having the speaker utter the subject sentence without emotional effect or attention to prosodic features. Furthermore, to evaluate the performance of the model, a pattern-matching process (two-level DP) was used between the synthesized F0 pattern and the restricted real F0 pattern. They thus confirmed that the model would create a synthesized F0 pattern sufficiently similar the restricted-speech patterns. The synthesized speech using this model sounds relatively monotonous, but is sufficiently natural as compared with general spontaneous speech
Keywords :
natural languages; pattern matching; speech processing; Japanese major phrase; Japanese minor phrase; Japanese prosodic pattern; Japanese prosodic pattern modeling; accentual minor phrase; intrasentence position; monotonous prosodic feature generation; natural F0 pattern; natural prosodic feature generation; pattern-matching process; performance evaluation; prosodic control; restricted natural speech analysis; restricted real F0 pattern; restricted speech; synthesized F0 pattern; Dynamic range; Information analysis; Intelligent systems; Laboratories; Natural languages; Pattern analysis; Performance analysis; Spatial databases; Speech analysis; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607936
Filename :
607936
Link To Document :
بازگشت