DocumentCode :
626519
Title :
Improved structural similarity measurement for vocal signals
Author :
Wei-Sheng Lai ; Chi-Jung Tseng ; Jian-Jiun Ding
Author_Institution :
Grad. Inst. of Commun., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
2013
fDate :
19-23 May 2013
Firstpage :
301
Lastpage :
304
Abstract :
In recent years, the SSIM was proposed for image and vocal signal assessments to match human perception. The existing SSIMs for vocal signals are similar to those for images. However, the human perceptions for voices and images are different. If two vocal signals differ only by phase, delay, or logistic frequency shift, they are heard similarly. In this paper, we propose the non-uniform sampling frequency mean SSIM (NUS-FMSSIM) to highly match the human perception for voices. Simulations show that it is more robust to phase change, time shift, and logistic frequency shift than the existing SSIMs for vocal signals.
Keywords :
audio signal processing; signal sampling; human perception; image assessments; logistic frequency shift; nonuniform sampling frequency mean SSIM; phase change; structural similarity measurement; time shift; vocal signal assessments; vocal signals; Distortion; Indexes; Robustness; Time-domain analysis; Time-frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
Conference_Location :
Beijing
ISSN :
0271-4302
Print_ISBN :
978-1-4673-5760-9
Type :
conf
DOI :
10.1109/ISCAS.2013.6571842
Filename :
6571842
Link To Document :
بازگشت