DocumentCode :
394340
Title :
Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts
Author :
Kumar, Arun ; Verma, Ashish
Author_Institution :
Centre for Appl. Res. in Electron., Indian Inst. of Technol., New Delhi, India
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
Voice conversion techniques attempt to modify the speech signal so that it is perceived as if spoken by another speaker, different from the original speaker. In this paper, we present a novel approach to perform voice conversion. Our approach uses acoustic models based on units of speech, like phones and diphones, for voice conversion. These models can be computed and used independently for a given speaker without being concerned about the source or target speaker. It avoids the use of a parallel speech corpus in the voices of source and target speakers. It is shown that by using the proposed approach, voice fonts can be created and stored which represent individual characteristics of a particular speaker, to be used for customization of synthetic speech. We also show through objective and subjective tests, that voice conversion quality is comparable to other approaches that require a parallel speech corpus.
Keywords :
cepstral analysis; speech processing; speech synthesis; diphone based acoustic models; objective tests; phone based acoustic models; source speakers; speech signal; subjective tests; synthetic speech customization; target speakers; voice conversion; voice fonts; Bit rate; Loudspeakers; Postal services; Quantization; Speech coding; Speech recognition; Speech synthesis; Stress; Target recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198882
Filename :
1198882
Link To Document :
بازگشت