• DocumentCode
    626519
  • Title

    Improved structural similarity measurement for vocal signals

  • Author

    Wei-Sheng Lai ; Chi-Jung Tseng ; Jian-Jiun Ding

  • Author_Institution
    Grad. Inst. of Commun., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2013
  • fDate
    19-23 May 2013
  • Firstpage
    301
  • Lastpage
    304
  • Abstract
    In recent years, the SSIM was proposed for image and vocal signal assessments to match human perception. The existing SSIMs for vocal signals are similar to those for images. However, the human perceptions for voices and images are different. If two vocal signals differ only by phase, delay, or logistic frequency shift, they are heard similarly. In this paper, we propose the non-uniform sampling frequency mean SSIM (NUS-FMSSIM) to highly match the human perception for voices. Simulations show that it is more robust to phase change, time shift, and logistic frequency shift than the existing SSIMs for vocal signals.
  • Keywords
    audio signal processing; signal sampling; human perception; image assessments; logistic frequency shift; nonuniform sampling frequency mean SSIM; phase change; structural similarity measurement; time shift; vocal signal assessments; vocal signals; Distortion; Indexes; Robustness; Time-domain analysis; Time-frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
  • Conference_Location
    Beijing
  • ISSN
    0271-4302
  • Print_ISBN
    978-1-4673-5760-9
  • Type

    conf

  • DOI
    10.1109/ISCAS.2013.6571842
  • Filename
    6571842