Title :
Parrot-like speaking using optimal vector quantization
Author :
Nakano, Ryohei ; Ueda, Naonori ; Saito, Kazumi ; Yamada, Takeshi
Author_Institution :
NTT Commun. Sci. Labs., Kyoto, Japan
Abstract :
Parrot-like speaking can be considered as one of the most fundamental abilities of humans or robots. It is not a transformation of a target speech signal, but a perception-and-action process: recognizing the target speech and producing a mimic one using a voice obtained from a voice owner. This paper presents a connectionist parrot-like speaking system. Our approach employs the record-and-edit approach with an acoustic wave segment as the processing unit, and uses a vector quantizer for two purposes: to build a segment database as a natural voice of a robot, and to cluster the segment database to speed up the mimicking. The experimental parrot system works mostly well, mimicking any target speech and sounding like a voice owner
Keywords :
database management systems; neural nets; optimisation; speech recognition; speech synthesis; vector quantisation; acoustic wave segment; connectionist parrot-like speaking system; mimicry; optimal vector quantization; perception-and-action process; record-and-edit approach; segment database; speech recognition; speech synthesis; voice; Acoustic measurements; Acoustic waves; Cepstral analysis; Cepstrum; Robots; Speech analysis; Speech processing; Speech recognition; Speech synthesis; Vector quantization;
Conference_Titel :
Neural Networks, 1995. Proceedings., IEEE International Conference on
Conference_Location :
Perth, WA
Print_ISBN :
0-7803-2768-3
DOI :
10.1109/ICNN.1995.488190