Title :
Minimal Representation of Speech Signals for Generation of Emotion Speech and Human-Robot Interaction
Author :
Lee, Heyoung ; Bien, Z. Zenn
Author_Institution :
Seoul Nat. Univ. of Technol., Seoul
Abstract :
In this paper minimal representation of voiced speech based on decomposition into AM-FM components is proposed for generation of emotion speech. For the decomposition, firstly time-frequency boundaries of AM-FM components are estimated and secondary each AM-FM component is extracted by using the variable bandwidth filter adaptive to the estimated time-frequency boundaries. Finally, two parameters, that is, instantaneous frequency and instantaneous amplitude of each AM-FM component are estimated. The set composed of instantaneous amplitudes and instantaneous frequencies is the minimal representation of voiced speech signals. The minimal representation is optimal feature set since the set describes effectively the biomechanical characteristics of the vocal codes and the vocal track. Raw speech signals are modified by changing the parameters for generation of emotion speech.
Keywords :
adaptive filters; emotion recognition; man-machine systems; robots; signal representation; source separation; speech processing; AM-FM components; adaptive filter; biomechanical characteristics; emotion speech generation; human-robot interaction; signal decomposition; time-frequency boundary; variable bandwidth filter; vocal codes; vocal track; voiced speech signal representation; Electronic mail; Feature extraction; Frequency estimation; Independent component analysis; Larynx; Sensor arrays; Signal generators; Signal processing; Speech; Time frequency analysis;
Conference_Titel :
Robot and Human interactive Communication, 2007. RO-MAN 2007. The 16th IEEE International Symposium on
Conference_Location :
Jeju
Print_ISBN :
978-1-4244-1634-9
Electronic_ISBN :
978-1-4244-1635-6
DOI :
10.1109/ROMAN.2007.4415068