DocumentCode :
3123871
Title :
A syllable-based prosody modeling for L1 and L2 English speeches
Author :
Wei-Fan Chen ; Chin-Kuan Kuo ; Yih-Ru Wang ; Sin-Horng Chen
Author_Institution :
Dept. of Electr. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear :
2012
fDate :
5-8 Dec. 2012
Firstpage :
281
Lastpage :
285
Abstract :
In this paper, a statistical prosody modeling approach for L1 and L2 English speeches is proposed. The study focuses on the modeling of two prosodic-acoustic features: syllable duration and log-pitch contour. Several major affecting factors (AFs) that influence the variations of these two features are considered. They include lexical stress, word length, nearby break type, phonemic constituent of syllable, and prosodic state. A sequential optimization procedure is adopted to automatically train the two models from the TWNAESOP corpus recorded in Taiwan. Experimental results showed that most AFs estimated agreed well with our prior linguistic knowledge. The differences in the prosody of L1 and L2 speeches were also explored.
Keywords :
acoustic signal processing; linguistics; natural language processing; optimisation; speech processing; statistical analysis; text analysis; AF; L1 English speech; L2 English speech; TWNAESOP corpus; Taiwan; affecting factor; lexical stress; linguistic knowledge; log-pitch contour; phonemic constituent; prosodic state; prosodic-acoustic feature; sequential optimization procedure; statistical prosody modeling; syllable duration; syllable-based prosody modeling; word length; Polynomials; Pragmatics; Scattering; Speech; Standards; Stress; Vectors; L2 English speech; Prosody modeling; duration modeling; pitch contour modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
Type :
conf
DOI :
10.1109/ISCSLP.2012.6423464
Filename :
6423464
Link To Document :
بازگشت