DocumentCode :
1997401
Title :
Pitch-synchronous time alignment of speech signals for prosody transplantation
Author :
Latsch, Vagner L. ; Netto, Sergio L.
Author_Institution :
DEL, Fed. Univ. of Rio de Janeiro, Rio de Janeiro, Brazil
fYear :
2011
fDate :
15-18 May 2011
Firstpage :
2405
Lastpage :
2408
Abstract :
Prosody transplantation is a speech signal modification procedure usually used to voice transformation or to evaluate the quality of speech synthesizers. In practice, the pitch contour is mapped onto a common segmental content and the target signal is modified adjusting position and length of speech frames to achieve the desired pitch contour and time duration from a speech reference. A new algorithm for prosody transplantation is presented based on a pitch-synchronous feature extraction of the speech signal, unifying the time-aligning and pitch-modification stages. The result is a computationally efficient algorithm for prosody transplantation that maximizes the spectral similarity between the target and reference signals.
Keywords :
feature extraction; speech synthesis; pitch contour; pitch-synchronous feature extraction; pitch-synchronous time alignment; prosody transplantation; spectral similarity; speech frames; speech reference; speech signal modification procedure; speech synthesizers; time duration; voice transformation; Approximation algorithms; Heuristic algorithms; Interpolation; Labeling; Partitioning algorithms; Speech; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (ISCAS), 2011 IEEE International Symposium on
Conference_Location :
Rio de Janeiro
ISSN :
0271-4302
Print_ISBN :
978-1-4244-9473-6
Electronic_ISBN :
0271-4302
Type :
conf
DOI :
10.1109/ISCAS.2011.5938088
Filename :
5938088
Link To Document :
بازگشت