• DocumentCode
    672335
  • Title

    Dysfluent speech detection by image forensics techniques

  • Author

    Palfy, Juraj ; Darjaa, Sakhia ; Pospichal, Jiri

  • Author_Institution
    Inst. of Inf., Bratislava, Slovakia
  • fYear
    2013
  • fDate
    8-12 Dec. 2013
  • Firstpage
    96
  • Lastpage
    101
  • Abstract
    As speech recognition has become popular, the importance of dysfluency detection increased considerably. Once a dysfluent event in spontaneous speech is identified, the speech recognition performance could be enhanced by eliminating its negative effect. Most existing techniques to detect such dysfluent events are based on statistical models. Sparse regularity of dysfluent events and complexity to describe such events in a speech recognition system makes its recognition rigorous. These problems are addressed by our algorithm inspired by image forensics. This paper suggests our algorithm developed to extract novel features of complex dysfluencies. The common steps of classifier design were used to statistically evaluate the proposed features of complex dysfluencies in spectral and cepstral domains. Support vector machines perform objective assessment of MFCC features, MFCC based derived features, PCA based derived features and kernel PCA based derived features of complex dysfluencies, where our derived features increased the performance by 46% opposite to MFCC.
  • Keywords
    image forensics; principal component analysis; speech recognition; support vector machines; MFCC features; dysfluent speech detection; image forensics techniques; kernel PCA; speech recognition system; statistical models; support vector machines; Feature extraction; Kernel; Mel frequency cepstral coefficient; Principal component analysis; Speech; Speech recognition; Vectors; Dysfluency detection; image forensics; speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
  • Conference_Location
    Olomouc
  • Type

    conf

  • DOI
    10.1109/ASRU.2013.6707712
  • Filename
    6707712