DocumentCode
3123871
Title
A syllable-based prosody modeling for L1 and L2 English speeches
Author
Wei-Fan Chen ; Chin-Kuan Kuo ; Yih-Ru Wang ; Sin-Horng Chen
Author_Institution
Dept. of Electr. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear
2012
fDate
5-8 Dec. 2012
Firstpage
281
Lastpage
285
Abstract
In this paper, a statistical prosody modeling approach for L1 and L2 English speeches is proposed. The study focuses on the modeling of two prosodic-acoustic features: syllable duration and log-pitch contour. Several major affecting factors (AFs) that influence the variations of these two features are considered. They include lexical stress, word length, nearby break type, phonemic constituent of syllable, and prosodic state. A sequential optimization procedure is adopted to automatically train the two models from the TWNAESOP corpus recorded in Taiwan. Experimental results showed that most AFs estimated agreed well with our prior linguistic knowledge. The differences in the prosody of L1 and L2 speeches were also explored.
Keywords
acoustic signal processing; linguistics; natural language processing; optimisation; speech processing; statistical analysis; text analysis; AF; L1 English speech; L2 English speech; TWNAESOP corpus; Taiwan; affecting factor; lexical stress; linguistic knowledge; log-pitch contour; phonemic constituent; prosodic state; prosodic-acoustic feature; sequential optimization procedure; statistical prosody modeling; syllable duration; syllable-based prosody modeling; word length; Polynomials; Pragmatics; Scattering; Speech; Standards; Stress; Vectors; L2 English speech; Prosody modeling; duration modeling; pitch contour modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location
Kowloon
Print_ISBN
978-1-4673-2506-6
Electronic_ISBN
978-1-4673-2505-9
Type
conf
DOI
10.1109/ISCSLP.2012.6423464
Filename
6423464
Link To Document