مرکز منطقه ای اطلاع رساني علوم و فناوري - Improved structural similarity measurement for vocal signals

DocumentCode :

626519

Title :

Improved structural similarity measurement for vocal signals

Author :

Wei-Sheng Lai ; Chi-Jung Tseng ; Jian-Jiun Ding

Author_Institution :

Grad. Inst. of Commun., Nat. Taiwan Univ., Taipei, Taiwan

fYear :

2013

fDate :

19-23 May 2013

Firstpage :

301

Lastpage :

304

Abstract :

In recent years, the SSIM was proposed for image and vocal signal assessments to match human perception. The existing SSIMs for vocal signals are similar to those for images. However, the human perceptions for voices and images are different. If two vocal signals differ only by phase, delay, or logistic frequency shift, they are heard similarly. In this paper, we propose the non-uniform sampling frequency mean SSIM (NUS-FMSSIM) to highly match the human perception for voices. Simulations show that it is more robust to phase change, time shift, and logistic frequency shift than the existing SSIMs for vocal signals.

Keywords :

audio signal processing; signal sampling; human perception; image assessments; logistic frequency shift; nonuniform sampling frequency mean SSIM; phase change; structural similarity measurement; time shift; vocal signal assessments; vocal signals; Distortion; Indexes; Robustness; Time-domain analysis; Time-frequency analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Circuits and Systems (ISCAS), 2013 IEEE International Symposium on

Conference_Location :

Beijing

ISSN :

0271-4302

Print_ISBN :

978-1-4673-5760-9

Type :

conf

DOI :

10.1109/ISCAS.2013.6571842

Filename :

6571842

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=626519