DocumentCode
940412
Title
An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities
Author
Creusere, Charles D. ; Kallakuri, Kumar D. ; Vanam, Rahul
Author_Institution
Klipsch Sch. of Electr. & Comput. Eng., New Mexico State Univ., Las Cruces, NM
Volume
16
Issue
1
fYear
2008
Firstpage
129
Lastpage
136
Abstract
The goal of this paper is to develop an audio quality metric that can accurately quantify subjective quality over audio fidelities ranging from highly impaired to perceptually lossless. As one example of its utility, such a metric would allow scalable audio coding algorithms to be easily optimized over their entire operating ranges. We have found that the ITU-recommended objective quality metric, ITU-R BS.1387, does not accurately predict subjective audio quality over the wide range of fidelity levels of interest to us. In developing the desired universal metric, we use as a starting point the model output variables (MOVs) that make up BS.1387 as well as the energy equalization truncation threshold which has been found to be particularly useful for highly impaired audio. To combine these MOVs into a single quality measure that is both accurate and robust, we have developed a hybrid least-squares/minimax optimization procedure. Our test results show that the minimax-optimized metric is up to 36% lower in maximum absolute error compared to a similar metric designed using the conventional least-squares procedure.
Keywords
audio coding; least squares approximations; minimax techniques; ITU-R BS.1387; audio fidelity; energy equalization truncation threshold; human subjective audio quality metric; least-squares procedure; minimax optimization procedure; model output variable; scalable audio coding algorithm; Audio quality metrics; metric optimization; objective metrics; perceptual audio analysis; quality evaluation; universal quality metrics;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2007.907571
Filename
4358089
Link To Document