Title :
Non-segmental analysis and synthesis based on a speech database
Author :
Slater, Andrew ; Coleman, John
Author_Institution :
Phonetics Lab., Oxford Univ., UK
Abstract :
The paper reports on experiments in non segmental speech analysis and synthesis using parameters derived from a speech database of British English monosyllables. The database includes almost every onset, nucleus and coda, and almost all onset nucleus and nucleus consonant combinations occurring in English. Acoustic parameters including f0, formant frequencies and bandwidths, and amplitude of voicing were determined for each token in the database. Fine duration differences within minimal pairs are analyzed using dynamic time warping techniques, avoiding the need for manual segmentation. For each parameter, a matrix of distances between all samples of the two words is calculated, together with a minimal path through the matrix (the warp path). The set of warp paths for all parameters identifies the nature and location of acoustic differences between the words, including locations of temporal expansion and compression. Preliminary experiments using dynamic time warping for non segmental synthesis are also discussed
Keywords :
database management systems; natural languages; speech processing; speech synthesis; British English monosyllables; acoustic differences; acoustic parameters; dynamic time warping; dynamic time warping techniques; f0; fine duration differences; formant frequencies; manual segmentation; minimal pairs; minimal path; non segmental speech analysis; non segmental speech synthesis; nucleus consonant combinations; onset nucleus; speech database; temporal expansion; warp path; Bandwidth; Databases; Frequency; Laboratories; Manuals; Speech analysis; Speech synthesis; Steady-state; Table lookup; Timing;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607287