DocumentCode
730747
Title
Extraction of pitch register from expressive speech in Japanese
Author
Jinfu Ni ; Shiga, Yoshinori ; Hori, Chiori
Author_Institution
Universal Commun. Res. Inst., Spoken Language Commun. Lab., Nat. Inst. of Inf. & Commun. Technol., Kyoto, Japan
fYear
2015
fDate
19-24 April 2015
Firstpage
4764
Lastpage
4768
Abstract
Human uses intonation to make focal prominence to give emphasis that highlights the focus of speech. Automatic extraction of proper intonation features from a speech corpus is desirous for processing speech prosody, especially in the context of speech synthesis. This paper presents a method to extract pitch register from observed F0 contours for this purpose. The method utilizes a constrained tone transformation technique under an assumption that lexical accents are confined to parallel high and low tone lines with a limited constant span. Consequently, the extracted pitch register captures dynamic range variation of the pitch accents of an utterance. The method is evaluated by objective tests upon a large-scale expressive speech corpus. A finding is that proper intonation manifested in pitch register in Japanese is very comparable with English intonation in the sense of structural form.
Keywords
feature extraction; speech synthesis; English intonation; Japanese; automatic extraction; constrained tone transformation technique; dynamic range variation; focal prominence; large-scale expressive speech corpus; lexical accents; low tone lines; pitch accents; pitch register; proper intonation features; speech synthesis; Measurement; Registers; Fundamental frequency analysis; intonation proper; pitch decomposition; pitch register; speech prosody;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178875
Filename
7178875
Link To Document