• DocumentCode
    2290377
  • Title

    A Fractal-Based Approach for Speech Segmentation

  • Author

    Fantinato, Paulo César ; Guido, Rodrigo Capobianco ; Chen, Shi-Huang ; Santos, Bruno Leonardo Silveira ; Vieira, Lucimar Sasso ; Jonior, S.B. ; Rodrigues, Luciene Cavalcanti ; Sanchez, Fabrício Lopes ; Escola, Josão Paulo Lemos ; Souza, Leonardo Mendes ;

  • fYear
    2008
  • fDate
    15-17 Dec. 2008
  • Firstpage
    551
  • Lastpage
    555
  • Abstract
    Nowadays, fractal analysis has been successfully applied to digital speech processing, particularly for word and phoneme segmentation, which represents one of the fundamental steps in automatic speech recognition systems. The practical use of fractal analysis for this purpose should match two principles: low computational cost, to allow the use in real-time, and accuracy in the results, in order to produce a satisfactory segmentation, sending the correct data to the classifier. Aiming at meeting these two requirements, this work proposes a technique for speech segmentation based on the fractal dimension, which is obtained by using the discrete wavelet transform that avoids the use of 1/k pre-filtering. Many families of wavelets are presented and compared, and the results assure the efficacy of the proposed method.
  • Keywords
    discrete wavelet transforms; filtering theory; fractals; speech processing; speech recognition; word processing; 1/k prefiltering; automatic speech recognition system; digital speech processing; discrete wavelet transform; fractal analysis; fractal-based approach; phoneme segmentation; speech segmentation; word segmentation; Automatic speech recognition; Computational efficiency; Computer science; Discrete wavelet transforms; Educational institutions; Fractals; Hidden Markov models; Physics; Speech analysis; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia, 2008. ISM 2008. Tenth IEEE International Symposium on
  • Conference_Location
    Berkeley, CA
  • Print_ISBN
    978-0-7695-3454-1
  • Electronic_ISBN
    978-0-7695-3454-1
  • Type

    conf

  • DOI
    10.1109/ISM.2008.123
  • Filename
    4741225