DocumentCode :
2651838
Title :
Text-based vs. vowel-based automatic evaluation of tracheoesophageal substitute voice
Author :
Haderlein, Tino ; Bocklet, Tobias ; Nöth, Elmar ; Rosanowski, Frank
Author_Institution :
Dept. of Phoniatrics & Pedaudiology, Univ. of Erlangen-Nuremberg, Erlangen
fYear :
2008
fDate :
25-28 June 2008
Firstpage :
295
Lastpage :
298
Abstract :
The hoarseness diagram, a program for voice quality analysis using recordings of sustained vowels, was compared to an automatic speech recognition system with a module for prosodic analysis. The latter computed prosodic features on a text recording. We examined whether the voice analysis of sustained vowel and text analysis correlate on a group of 24 male laryngectomees (average age: 60.6plusmn8.9 years) using tracheoesophageal substitute speech. Each person read the German version of the text ldquothe north wind and the sunrdquo which consists of 108 words. Additionally, 5 sustained vowels were recorded from each patient. The correlation between the measures obtained by the Hoarseness Diagram and the prosodic features from the prosody module was determined. Parameters like jitter, shimmer, F0 and irregularity computed by the Hoarseness Diagram on vowel recordings show correlations of about -0.8 to prosodic features obtained from the text recordings. Hence, voice properties can reliably be evaluated both on a vowel and a text recording. The text analysis, however, offers also possibilities for automatic speech evaluation since it represents a real communication situation better.
Keywords :
patient rehabilitation; speech recognition; automatic speech evaluation; automatic speech recognition system; hoarseness diagram; jitter; prosodic analysis; shimmer; sustained vowel recording; text recording; tracheoesophageal substitute voice; voice quality analysis; voice rehabilitation; Audio recording; Automatic speech recognition; Electronic mail; Esophagus; High definition video; Humans; Jitter; Pattern recognition; Speech analysis; Text analysis; Automatic speech recognition; Hoarseness Diagram; Prosodic features; Substitute voice;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Signals and Image Processing, 2008. IWSSIP 2008. 15th International Conference on
Conference_Location :
Bratislava
Print_ISBN :
978-80-227-2856-0
Electronic_ISBN :
978-80-227-2880-5
Type :
conf
DOI :
10.1109/IWSSIP.2008.4604425
Filename :
4604425
Link To Document :
بازگشت