DocumentCode
672335
Title
Dysfluent speech detection by image forensics techniques
Author
Palfy, Juraj ; Darjaa, Sakhia ; Pospichal, Jiri
Author_Institution
Inst. of Inf., Bratislava, Slovakia
fYear
2013
fDate
8-12 Dec. 2013
Firstpage
96
Lastpage
101
Abstract
As speech recognition has become popular, the importance of dysfluency detection increased considerably. Once a dysfluent event in spontaneous speech is identified, the speech recognition performance could be enhanced by eliminating its negative effect. Most existing techniques to detect such dysfluent events are based on statistical models. Sparse regularity of dysfluent events and complexity to describe such events in a speech recognition system makes its recognition rigorous. These problems are addressed by our algorithm inspired by image forensics. This paper suggests our algorithm developed to extract novel features of complex dysfluencies. The common steps of classifier design were used to statistically evaluate the proposed features of complex dysfluencies in spectral and cepstral domains. Support vector machines perform objective assessment of MFCC features, MFCC based derived features, PCA based derived features and kernel PCA based derived features of complex dysfluencies, where our derived features increased the performance by 46% opposite to MFCC.
Keywords
image forensics; principal component analysis; speech recognition; support vector machines; MFCC features; dysfluent speech detection; image forensics techniques; kernel PCA; speech recognition system; statistical models; support vector machines; Feature extraction; Kernel; Mel frequency cepstral coefficient; Principal component analysis; Speech; Speech recognition; Vectors; Dysfluency detection; image forensics; speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location
Olomouc
Type
conf
DOI
10.1109/ASRU.2013.6707712
Filename
6707712
Link To Document