• DocumentCode
    417199
  • Title

    Predicting foreground SH, SL and BNH DAM scores for multidimensional objective measure of speech quality

  • Author

    Sen, D.

  • Author_Institution
    New South Wales Univ., Sydney, NSW, Australia
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Current objective measures of speech quality (Rix et al. (2000), Beerends et al. (1994) attempt to evaluate degraded speech by calculating a single distance measure between the original signal and the synthesized signal being evaluated. The distance measure is usually carried out after both the original and synthesized signal have been transformed to represent the effect of the auditory periphery. However, the fact that the subjective judgement of quality is based on a multidimensional perceptual space representation suggests that a measure that is based on predicting a multitude of independent perceptual characteristics, would yield better results and be applicable to a wider range of distortions and speech synthesis systems. This paper presents such a multidimensional approach to objective evaluation of speech quality and is directly motivated by the work of Voiers (2001) from which the subjective evaluation procedure known as diagnostic acceptability measure (DAM) was created. While the DAM is a subjective measure of the detectability of the distortions identified by Voiers, this work reports on the first baby steps taken for objective evaluation of a subset of those same parametric distortions determined to be the principal components of the quality space from a previous statistical analysis (Sen (2001)).
  • Keywords
    prediction theory; principal component analysis; signal representation; speech enhancement; speech synthesis; BNH DAM scores; PCA; SL; diagnostic acceptability measure; foreground SH prediction; independent perceptual characteristics; multidimensional objective measure; multidimensional perceptual space representation; objective evaluation; parametric distortions; principal component analysis; quality space; speech quality; speech synthesis; statistical analysis; Current measurement; Degradation; Distortion measurement; Extraterrestrial measurements; Multidimensional systems; Pediatrics; Signal synthesis; Speech analysis; Speech synthesis; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326030
  • Filename
    1326030