DocumentCode
698776
Title
Real-time speech visualization system: Kannon - Applying auditory characteristics
Author
Nakamuro, Ken ; Haruki, Katsuhiro ; Sugimoto, Sueo
Author_Institution
Dept. of Electr. & Electron. Eng., Ritsumeikan Univ., Kusatsu, Japan
fYear
2005
fDate
4-8 Sept. 2005
Firstpage
1
Lastpage
4
Abstract
We have been developing a real time speech-displaying system called “KanNon” which helps deaf person to understand speaker´s speech contents. We designed the KanNon system to display a sound spectrogram, pitch frequency and loudness of speech as well as characters by speech-recognition system as real-time scrolling image. For the purpose of displaying formant patterns clearly with high accuracy, we applied Burg method combining with the minimum cross-entropy (Burg-MCE) method, and human auditory characteristics such as an equal loudness preemphasis and mel-scale frequency to the sound spectrogram. Finally, we show more effective display for the spectrogram reading in the KanNon system.
Keywords
minimum entropy methods; speech recognition; Burg-MCE method; KanNon system; human auditory characteristics; mel-scale frequency; minimum cross-entropy method; pitch frequency; real time speech-displaying system; real-time scrolling imaging; real-time speech visualization system; sound spectrogram; speaker speech content; speech loudness; speech-recognition system; Estimation; Frequency estimation; History; Predictive models; Spectrogram; Speech; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2005 13th European
Conference_Location
Antalya
Print_ISBN
978-160-4238-21-1
Type
conf
Filename
7078370
Link To Document