DocumentCode
231585
Title
Automatic detection of contrastive word pairs using textual and acoustic features
Author
Xiao Zang ; Zhiyong Wu ; Yishuang Ning ; Meng, Hsiang-Yun ; Lianhong Cai
Author_Institution
Tsinghua-CUHK Joint Res. Center for Media Sci., Technol. & Syst., Tsinghua Univ., Shenzhen, China
fYear
2014
fDate
19-23 Oct. 2014
Firstpage
594
Lastpage
598
Abstract
Labeling emphatic words from speech recordings plays an important role in building speech corpus for expressive speech synthesis. People generally pronounce some words stronger than usual, making the speech more expressive and signaling the focus of the sentence. Contrastive word pairs are often pronounced with stronger prominences and their presence modifies the meaning of the utterance in subtle but important ways. We used a subset of Switchboard corpus to study the acoustic characteristics of contrastive word pairs and the differences between contrastive and non-contrastive words. To address the problem of automatic detection of contrastive word pairs, support vector machines (SVMs) are used to automatically detect contrastive word pairs. We report the results for automatic detection of contrastive word pairs based on textual and acoustic features. By adding acoustic features, a much better performance is achieved.
Keywords
speech synthesis; support vector machines; acoustic features; automatic detection; contrastive word pairs; speech recordings; speech synthesis; support vector machines; switchboard corpus; textual features; Abstracts; Acoustics; Legged locomotion; Automatic detection; acoustic features; contrastive word pair; support vector machines (SVMs);
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing (ICSP), 2014 12th International Conference on
Conference_Location
Hangzhou
ISSN
2164-5221
Print_ISBN
978-1-4799-2188-1
Type
conf
DOI
10.1109/ICOSP.2014.7015073
Filename
7015073
Link To Document