DocumentCode :
788201
Title :
Single-Ended Speech Quality Measurement Using Machine Learning Methods
Author :
Falk, Tiago H. ; Chan, Wai-Yip
Author_Institution :
Dept. of Electr. & Comput. Eng., Queen´´s Univ., Kingston, Ont.
Volume :
14
Issue :
6
fYear :
2006
Firstpage :
1935
Lastpage :
1947
Abstract :
We describe a novel single-ended algorithm constructed from models of speech signals, including clean and degraded speech, and speech corrupted by multiplicative noise and temporal discontinuities. Machine learning methods are used to design the models, including Gaussian mixture models, support vector machines, and random forest classifiers. Estimates of the subjective mean opinion score (MOS) generated by the models are combined using hard or soft decisions generated by a classifier which has learned to match the input signal with the models. Test results show the algorithm outperforming ITU-T P.563, the current "state-of-art" standard single-ended algorithm. Employed in a distributed double-ended measurement configuration, the proposed algorithm is found to be more effective than P.563 in assessing the quality of noise reduction systems and can provide a functionality not available with P.862 PESQ, the current double-ended standard algorithm
Keywords :
Gaussian processes; learning (artificial intelligence); speech processing; support vector machines; Gaussian mixture models; machine learning methods; mean opinion score; multiplicative noise; noise reduction systems; random forest classifiers; single-ended speech quality measurement; speech signals; support vector machines; temporal discontinuities; Current measurement; Degradation; Impedance matching; Learning systems; Machine learning algorithms; Signal generators; Speech enhancement; Support vector machine classification; Support vector machines; Testing; Mean opinion score (MOS); objective quality measurement; quality model; single-ended measurement; speech communication; speech distortions; speech enhancement; speech quality; subjective quality;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2006.883253
Filename :
1709883
Link To Document :
بازگشت