DocumentCode :
3527781
Title :
Extraction of cochlear processed formants for prediction of temporally localized distortions in synthesized speech
Author :
Lu, Wenliang ; Sen, D.
Author_Institution :
Univ. of New South Wales, Sydney, NSW
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
3977
Lastpage :
3980
Abstract :
Temporally localized distortions account for the most variance in subjective evaluation of coded speech signals. The ability to discern and decompose perceptually relevant temporally localized coding noise from other types of distortions is both of theoretical importance as well as a valuable tool for deploying and designing speech synthesis systems. The work described within, uses a physiologically motivated cochlear model to provide a trackable analysis of formant trajectories as processed by the cochlea. Subsequent statistical analysis shows simple relationships between the jitter of these trajectories and temporal attributes of the diagnostic acceptability measure (DAM).
Keywords :
ear; speech coding; speech synthesis; cochlear processed formants; coded speech signals; diagnostic acceptability measure; physiologically motivated cochlear model; speech synthesis systems; subjective evaluation; synthesized speech; temporally localized coding noise; temporally localized distortions; Distortion measurement; Frequency; Nonlinear distortion; Psychoacoustic models; Signal processing; Speech analysis; Speech coding; Speech enhancement; Speech processing; Speech synthesis; Diagnostic Acceptability Measure; Objective measurement of speech quality;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960499
Filename :
4960499
Link To Document :
بازگشت