• DocumentCode
    1713291
  • Title

    Dynamical spectrograms that can be perceived as visual gestures

  • Author

    Farani, A. A Soltani ; Chilton, E. H S ; Shirley, Robin

  • Author_Institution
    Centre for Vision, Speech & Signal Process., Surrey Univ., Guildford, UK
  • fYear
    1998
  • Firstpage
    537
  • Lastpage
    540
  • Abstract
    A new system for speech visualisation, has been implemented to allow deaf and hearing-impaired people to understand verbal information over channels such as the ordinary public telephone system. Incorporating a computational model of the human ear, the system converts incoming sounds into a sequence of animated images, which show the temporal variations of the spectral pattern of the input sound in real-time and which are perceived like visual gestures. Preliminary results from forced-choice tests with 28 human subjects are reported, using a sequence of 2- to 4-word sets. To demonstrate the language independence of this approach, some of these were taken from 4 very different languages-English, Persian, French and Czech. The results show high levels of recognition after only 10 learning trials (typical mean scores of 50-85%, where zero represents chance expectation), and encourage further investigation
  • Keywords
    computer animation; gesture recognition; handicapped aids; hearing; image sequences; natural languages; spectral analysis; speech processing; visual communication; visual perception; Czech; English; French; Persian; animated images; computational model; deaf people; dynamical spectrograms; forced-choice tests; hearing-impaired people; human ear; image recognition; input sound spectral pattern; language independence; learning trials; public telephone system; speech visualisation; temporal variations; verbal information; visual gestures; Computational modeling; Deafness; Ear; Humans; Image converters; Natural languages; Spectrogram; Speech; Telephony; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Time-Frequency and Time-Scale Analysis, 1998. Proceedings of the IEEE-SP International Symposium on
  • Conference_Location
    Pittsburgh, PA
  • Print_ISBN
    0-7803-5073-1
  • Type

    conf

  • DOI
    10.1109/TFSA.1998.721480
  • Filename
    721480