Title :
Real-time speech visualization system: Kannon - Applying auditory characteristics
Author :
Nakamuro, Ken ; Haruki, Katsuhiro ; Sugimoto, Sueo
Author_Institution :
Dept. of Electr. & Electron. Eng., Ritsumeikan Univ., Kusatsu, Japan
Abstract :
We have been developing a real time speech-displaying system called “KanNon” which helps deaf person to understand speaker´s speech contents. We designed the KanNon system to display a sound spectrogram, pitch frequency and loudness of speech as well as characters by speech-recognition system as real-time scrolling image. For the purpose of displaying formant patterns clearly with high accuracy, we applied Burg method combining with the minimum cross-entropy (Burg-MCE) method, and human auditory characteristics such as an equal loudness preemphasis and mel-scale frequency to the sound spectrogram. Finally, we show more effective display for the spectrogram reading in the KanNon system.
Keywords :
minimum entropy methods; speech recognition; Burg-MCE method; KanNon system; human auditory characteristics; mel-scale frequency; minimum cross-entropy method; pitch frequency; real time speech-displaying system; real-time scrolling imaging; real-time speech visualization system; sound spectrogram; speaker speech content; speech loudness; speech-recognition system; Estimation; Frequency estimation; History; Predictive models; Spectrogram; Speech; Speech recognition;
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1