DocumentCode
626519
Title
Improved structural similarity measurement for vocal signals
Author
Wei-Sheng Lai ; Chi-Jung Tseng ; Jian-Jiun Ding
Author_Institution
Grad. Inst. of Commun., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2013
fDate
19-23 May 2013
Firstpage
301
Lastpage
304
Abstract
In recent years, the SSIM was proposed for image and vocal signal assessments to match human perception. The existing SSIMs for vocal signals are similar to those for images. However, the human perceptions for voices and images are different. If two vocal signals differ only by phase, delay, or logistic frequency shift, they are heard similarly. In this paper, we propose the non-uniform sampling frequency mean SSIM (NUS-FMSSIM) to highly match the human perception for voices. Simulations show that it is more robust to phase change, time shift, and logistic frequency shift than the existing SSIMs for vocal signals.
Keywords
audio signal processing; signal sampling; human perception; image assessments; logistic frequency shift; nonuniform sampling frequency mean SSIM; phase change; structural similarity measurement; time shift; vocal signal assessments; vocal signals; Distortion; Indexes; Robustness; Time-domain analysis; Time-frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
Conference_Location
Beijing
ISSN
0271-4302
Print_ISBN
978-1-4673-5760-9
Type
conf
DOI
10.1109/ISCAS.2013.6571842
Filename
6571842
Link To Document