Title :
Improved structural similarity measurement for vocal signals
Author :
Wei-Sheng Lai ; Chi-Jung Tseng ; Jian-Jiun Ding
Author_Institution :
Grad. Inst. of Commun., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
In recent years, the SSIM was proposed for image and vocal signal assessments to match human perception. The existing SSIMs for vocal signals are similar to those for images. However, the human perceptions for voices and images are different. If two vocal signals differ only by phase, delay, or logistic frequency shift, they are heard similarly. In this paper, we propose the non-uniform sampling frequency mean SSIM (NUS-FMSSIM) to highly match the human perception for voices. Simulations show that it is more robust to phase change, time shift, and logistic frequency shift than the existing SSIMs for vocal signals.
Keywords :
audio signal processing; signal sampling; human perception; image assessments; logistic frequency shift; nonuniform sampling frequency mean SSIM; phase change; structural similarity measurement; time shift; vocal signal assessments; vocal signals; Distortion; Indexes; Robustness; Time-domain analysis; Time-frequency analysis;
Conference_Titel :
Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-5760-9
DOI :
10.1109/ISCAS.2013.6571842