DocumentCode :
698461
Title :
Enhanced output-based perceptual measure for predicting subjective quality of speech
Author :
Mahdi, A.E. ; Picovici, D.
Author_Institution :
Dept. of Electron. & Comput. Eng., Univ. of Limerick, Limerick, Ireland
fYear :
2005
fDate :
4-8 Sept. 2005
Firstpage :
1
Lastpage :
4
Abstract :
This paper presents an enhanced version of a non-intrusive measure for assessment of speech quality of voice communication systems and evaluates its performance. The new measure, which uses only the output of the system, is based on measuring perception-based objective auditory distances between voiced parts of the output (processed) speech whose quality is to be evaluated to appropriately matching references extracted from one of four pre-formulated codebooks, depending on their estimated pitch values. The codebooks are formed by optimally clustering large number of parametric speech vectors extracted from a database of clean speech records. The measured auditory distances are then mapped into equivalent subjective Mean Opinion Scores (MOS). The required clustering and matching process was effected by using an efficient data-mining tool known as the Self-Organizing Map (SOM). The short-time Bark Spectrum analysis is used in order to achieve perception-based, speaker-independent parametric representation of the speech. Reported evaluation results show that the proposed enhanced speech quality assessment method provides quality scores that are highly correlated with MOS obtained by formal subjective listening tests.
Keywords :
self-organising feature maps; speech coding; data-mining tool; formal subjective listening tests; mean opinion scores; output-based perceptual measure; parametric speech vectors; perception-based objective auditory distances; preformulated codebooks; self-organizing map; short-time bark spectrum analysis; speaker-independent parametric representation; speech quality assessment; speech records; Distortion measurement; Speech; Speech coding; Speech enhancement; Support vector machine classification; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1
Type :
conf
Filename :
7078046
Link To Document :
بازگشت