Title :
Automatic evaluation of hypernasality and speech intelligibility for children with cleft palate
Author :
Ling He ; Jing Zhang ; Qi Liu ; Heng Yin ; Lech, Margaret
Author_Institution :
Sch. of Electr. Eng. & Inf., Sichuan Univ., Chengdu, China
Abstract :
The speech of cleft palate (CP) patients has typical characteristics. Hypernasality and low speech intelligibility are the primary characteristics for CP speech. In this work, an automatic evaluation of different levels of hypernasality and speech intelligibility algorithm for CP speech was proposed, in order to provide an objective tool for speech therapist. To identify different levels of hypernasality, the short-time energy and Mel frequency cepstral coefficients were calculated as acoustic features, then Gaussian mixture model was applied as classifier. For the automatic speech intelligibility evaluation, the classical automatic isolated word recognition system was applied. The automatic speech recognition accuracy could be viewed as an indicator for various levels of speech intelligibility. The experiment results indicated that the proposed computer-based system achieved a good performance on the automatic classification of CP hypernasality and speech intelligibility levels. The average classification accuracy was over 79% for four types of hypernasality detection, and the automatic speech recognition accuracy decreased along with the drop of speech intelligibility.
Keywords :
cepstral analysis; feature extraction; medical disorders; medical signal processing; paediatrics; patient diagnosis; signal classification; speech intelligibility; speech processing; speech recognition; CP patient speech characteristics; Gaussian mixture model; Mel frequency cepstral coefficient calculation; acoustic feature calculation; automatic CP hypernasality classification; automatic child hypernasality evaluation; automatic child speech intelligibility evaluation; automatic isolated word recognition system; automatic speech intelligibility evaluation; automatic speech intelligibility level classification; automatic speech recognition accuracy; average classification accuracy; cleft palate; computer-based system performance; hypernasality detection; hypernasality level; objective speech therapist tool; short-time energy calculation; speech intelligibility algorithm; speech intelligibility level indicator; Accuracy; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Speech; Speech recognition; automatic speech recognition; cleft palate; hypernasality; speech intelligibility;
Conference_Titel :
Industrial Electronics and Applications (ICIEA), 2013 8th IEEE Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-1-4673-6320-4
DOI :
10.1109/ICIEA.2013.6566369