• DocumentCode
    698461
  • Title

    Enhanced output-based perceptual measure for predicting subjective quality of speech

  • Author

    Mahdi, A.E. ; Picovici, D.

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Univ. of Limerick, Limerick, Ireland
  • fYear
    2005
  • fDate
    4-8 Sept. 2005
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents an enhanced version of a non-intrusive measure for assessment of speech quality of voice communication systems and evaluates its performance. The new measure, which uses only the output of the system, is based on measuring perception-based objective auditory distances between voiced parts of the output (processed) speech whose quality is to be evaluated to appropriately matching references extracted from one of four pre-formulated codebooks, depending on their estimated pitch values. The codebooks are formed by optimally clustering large number of parametric speech vectors extracted from a database of clean speech records. The measured auditory distances are then mapped into equivalent subjective Mean Opinion Scores (MOS). The required clustering and matching process was effected by using an efficient data-mining tool known as the Self-Organizing Map (SOM). The short-time Bark Spectrum analysis is used in order to achieve perception-based, speaker-independent parametric representation of the speech. Reported evaluation results show that the proposed enhanced speech quality assessment method provides quality scores that are highly correlated with MOS obtained by formal subjective listening tests.
  • Keywords
    self-organising feature maps; speech coding; data-mining tool; formal subjective listening tests; mean opinion scores; output-based perceptual measure; parametric speech vectors; perception-based objective auditory distances; preformulated codebooks; self-organizing map; short-time bark spectrum analysis; speaker-independent parametric representation; speech quality assessment; speech records; Distortion measurement; Speech; Speech coding; Speech enhancement; Support vector machine classification; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2005 13th European
  • Conference_Location
    Antalya
  • Print_ISBN
    978-160-4238-21-1
  • Type

    conf

  • Filename
    7078046