• DocumentCode
    698776
  • Title

    Real-time speech visualization system: Kannon - Applying auditory characteristics

  • Author

    Nakamuro, Ken ; Haruki, Katsuhiro ; Sugimoto, Sueo

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Ritsumeikan Univ., Kusatsu, Japan
  • fYear
    2005
  • fDate
    4-8 Sept. 2005
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    We have been developing a real time speech-displaying system called “KanNon” which helps deaf person to understand speaker´s speech contents. We designed the KanNon system to display a sound spectrogram, pitch frequency and loudness of speech as well as characters by speech-recognition system as real-time scrolling image. For the purpose of displaying formant patterns clearly with high accuracy, we applied Burg method combining with the minimum cross-entropy (Burg-MCE) method, and human auditory characteristics such as an equal loudness preemphasis and mel-scale frequency to the sound spectrogram. Finally, we show more effective display for the spectrogram reading in the KanNon system.
  • Keywords
    minimum entropy methods; speech recognition; Burg-MCE method; KanNon system; human auditory characteristics; mel-scale frequency; minimum cross-entropy method; pitch frequency; real time speech-displaying system; real-time scrolling imaging; real-time speech visualization system; sound spectrogram; speaker speech content; speech loudness; speech-recognition system; Estimation; Frequency estimation; History; Predictive models; Spectrogram; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2005 13th European
  • Conference_Location
    Antalya
  • Print_ISBN
    978-160-4238-21-1
  • Type

    conf

  • Filename
    7078370