Title :
Vowel-based frequency alignment function design and recognition-based time alignment for automatic speech morphing
Author :
Onishi, Masato ; Takahashi, Toru ; Irino, Toshio ; Kawahara, Hideki
Author_Institution :
Fac. of Syst. Eng., Wakayama Univ., Wakayama
Abstract :
New design procedures of time-frequency alignment for automatic speech morphing are proposed. The frequency alignment function at a specific frame is represented as a weighted average of vowel alignment functions based on similarity to each vowel. Julian, an open source speech recognition system, was used to design a time alignment function. Objective and subjective tests were conducted to evaluate the proposed method, and test results indicated that the proposed method yields comparable naturalness to the manually morphed samples in terms of time alignment. The results also illustrated that the proposed frequency alignment provides significantly better naturalness than morphed samples without frequency alignment.
Keywords :
speech recognition; automatic speech morphing; open source speech recognition system; recognition-based time alignment; vowel-based frequency alignment function; Automatic speech recognition; Design engineering; Informatics; Labeling; Spectrogram; Speech analysis; Speech recognition; Systems engineering and theory; Testing; Time frequency analysis; Audio systems; Auditory system; Signal processing; Speech communication; Speech processing; Speech recognition; Speech synthesis; Vocal system;
Conference_Titel :
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location :
Goa
Print_ISBN :
978-1-4244-3471-8
Electronic_ISBN :
978-1-4244-3472-5
DOI :
10.1109/SLT.2008.4777831