Title :
A computer-aided speech disorders correction system for Arabic language
Author :
Seddik, Ahmed Farag ; El Adawy, Mohamed ; Shahin, Ahmed Ismail
Author_Institution :
Biomed. Eng. Dept., Helwan Univ., Helwan, Egypt
Abstract :
In this paper, we propose a hybrid technique that integrates both the audio visual analysis techniques to automate speech disorders treatment for Arabic language that can be used in many developing countries. We proposed a technique that is based on audio visual analysis of the patient´s speech. For patient´s audio analysis, we used the mel frequency cepstrum coefficients (MFCC´s) and linear predictive cepstrum coefficients (LPCC´s) as the key features to classify the audio. In addition, we used visual features for the analysis of the patient´s video based on the patient´s appearance. Audio visual features techniques are combined for increasing the efficiency of our recognition system. We present a comparative evaluation for both audio and video features. We perform the features evaluation using Dynamic time warping (DTW) for speech features and histogram based approach analysis for visual feature. Finally, we used a neural network based classifier to differentiate between normal and abnormal speech. This research presents an expert system with an interactive computerized environment that has the ability to treat patients with speech disorders problems.
Keywords :
bioacoustics; biomedical optical imaging; classification; feature extraction; interactive systems; medical disorders; medical expert systems; medical image processing; neural nets; patient treatment; speech processing; Arabic language; DTW; LPCC feature; MFCC feature; abnormal speech differentiation; audio classification; audio visual feature technique; comparative audio feature evaluation; comparative video feature evaluation; computer-aided speech disorder correction system; dynamic time warping; expert system; histogram based method; hybrid audio-visual analysis; interactive computerized environment; linear predictive cepstrum coefficient; mel frequency cepstrum coefficient; neural network based classifier; patient appearance; patient audio analysis; patient speech audio-visual analysis; patient video analysis; recognition system efficiency; speech disorder patient treatment; speech disorder treatment automation; speech feature analysis; visual feature analysis; Feature extraction; Image color analysis; Lips; Mel frequency cepstral coefficient; Speech; Speech recognition; Visualization; Arabic speech recognition; Dynamic time wrapping; Lips segmentation; Neural Network; Speech Disorders Classification; Visual speech recognition;
Conference_Titel :
Advances in Biomedical Engineering (ICABME), 2013 2nd International Conference on
Conference_Location :
Tripoli
Print_ISBN :
978-1-4799-0249-1
DOI :
10.1109/ICABME.2013.6648836