• DocumentCode
    341870
  • Title

    An investigation of audio-visual speech recognition as applied to multimedia speech therapy applications

  • Author

    Georgopoulos, Voula C.

  • Author_Institution
    Dept. of Speech Therapy, Technol. Educ. Inst. of Patras, Greece
  • Volume
    1
  • fYear
    1999
  • fDate
    36342
  • Firstpage
    481
  • Abstract
    A multimedia speech therapy system should be able to be used for customized speech therapy for different problems and for different ages. The speech recognition must be designed to work with high inter- and intra-speaker variability. In addition to displaying text on a screen, recording the voice reading the text, analyzing the recorded spoken signal and performing speech recognition which includes identification of speech irregularities and tracking of patient progress, it should be capable of analyzing visual signal of the patients´ speech and provide visual as well as audio feedback. This implies that the synchronization of different media is important in realizing effective multimedia speech therapy applications. In order to perform speech recognition and identification tasks, time-frequency analysis and neural networks are proposed with integration of visual information
  • Keywords
    audio-visual systems; medical computing; multimedia systems; neural nets; speech recognition; audio feedback; audio-visual speech recognition; customized speech therapy; identification tasks; intra-speaker variability; multimedia speech therapy applications; multimedia speech therapy system; neural networks; patient progress; recorded spoken signal; speech irregularities; synchronization; time-frequency analysis; visual information; visual signal; Audio recording; Medical treatment; Multimedia systems; Neurofeedback; Performance analysis; Signal analysis; Signal processing; Speech analysis; Speech recognition; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Computing and Systems, 1999. IEEE International Conference on
  • Conference_Location
    Florence
  • Print_ISBN
    0-7695-0253-9
  • Type

    conf

  • DOI
    10.1109/MMCS.1999.779249
  • Filename
    779249