Title of article :
Intelligibility Evaluation of Pathological Speech through Multigranularity Feature Extraction and Optimization
Author/Authors :
Fang, Chunying School of Computer Science and Technology - Harbin Institute of Technology - Harbin, China , Li, Haifeng School of Computer Science and Technology - Harbin Institute of Technology - Harbin, China , Ma, Lin School of Computer Science and Technology - Harbin Institute of Technology - Harbin, China , Zhang, Mancai School of Computer Science and Technology - Harbin Institute of Technology - Harbin, China
Abstract :
Pathological speech usually refers to speech distortion resulting from illness or other biological insults. The assessment of
pathological speech plays an important role in assisting the experts, while automatic evaluation of speech intelligibility is difficult
because it is usually nonstationary and mutational. In this paper, we carry out an independent innovation of feature extraction and
reduction, and we describe a multigranularity combined feature scheme which is optimized by the hierarchical visual method. A
novel method of generating feature set based on 𝑆-transform and chaotic analysis is proposed. There are BAFS (430, basic acoustics
feature), local spectral characteristics MSCC (84, Mel 𝑆-transform cepstrum coefficients), and chaotic features (12). Finally, radar
chart and 𝐹-score are proposed to optimize the features by the hierarchical visual fusion. The feature set could be optimized from
526 to 96 dimensions based on NKI-CCRT corpus and 104 dimensions based on SVD corpus. The experimental results denote that
new features by support vector machine (SVM) have the best performance, with a recognition rate of 84.4% on NKI-CCRT corpus
and 78.7% on SVD corpus. The proposed method is thus approved to be effective and reliable for pathological speech intelligibility
evaluation.
Keywords :
Optimization , Multigranularity , NKI-CCRT , MSCC
Journal title :
Computational and Mathematical Methods in Medicine