Title :
Dysfluent speech detection by image forensics techniques
Author :
Palfy, Juraj ; Darjaa, Sakhia ; Pospichal, Jiri
Author_Institution :
Inst. of Inf., Bratislava, Slovakia
Abstract :
As speech recognition has become popular, the importance of dysfluency detection increased considerably. Once a dysfluent event in spontaneous speech is identified, the speech recognition performance could be enhanced by eliminating its negative effect. Most existing techniques to detect such dysfluent events are based on statistical models. Sparse regularity of dysfluent events and complexity to describe such events in a speech recognition system makes its recognition rigorous. These problems are addressed by our algorithm inspired by image forensics. This paper suggests our algorithm developed to extract novel features of complex dysfluencies. The common steps of classifier design were used to statistically evaluate the proposed features of complex dysfluencies in spectral and cepstral domains. Support vector machines perform objective assessment of MFCC features, MFCC based derived features, PCA based derived features and kernel PCA based derived features of complex dysfluencies, where our derived features increased the performance by 46% opposite to MFCC.
Keywords :
image forensics; principal component analysis; speech recognition; support vector machines; MFCC features; dysfluent speech detection; image forensics techniques; kernel PCA; speech recognition system; statistical models; support vector machines; Feature extraction; Kernel; Mel frequency cepstral coefficient; Principal component analysis; Speech; Speech recognition; Vectors; Dysfluency detection; image forensics; speech;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location :
Olomouc
DOI :
10.1109/ASRU.2013.6707712