DocumentCode :
3530639
Title :
Genre effects on automatic sentence segmentation of speech: A comparison of broadcast news and broadcast conversations
Author :
Kolar, Jáchym ; Liu, Yang ; Shriberg, Elizabeth
Author_Institution :
Dept. of Cybern., Univ. of West Bohemia, Pilsen
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4701
Lastpage :
4704
Abstract :
We investigate genre effects on the task of automatic sentence segmentation, focusing on two important domains - broadcast news (BN) and broadcast conversation (BC). We employ an HMM model based on textual and prosodic information and analyze differences in segmentation accuracy and feature usage between the two genres using both manual and automatic speech transcripts. Experiments are evaluated using Czech broadcast corpora annotated for sentence-like units (SUs). Prosodic features capture information about pause, duration, pitch, and energy patterns. Textual knowledge sources include words, part-of-speech, and automatically induced classes. We also analyze effects of using additional textual data that is not annotated for SUs. Feature analysis reveals significant differences in both textual and prosodic feature usage patterns between the two genres. The analysis is important for building automatic understanding systems when limited matched-genre data are available, or for designing eventual genre-independent systems.
Keywords :
hidden Markov models; speech recognition; automatic speech sentence segmentation; broadcast conversation; broadcast news; hidden Markov model; prosodic information; speech recognition; speech transcript; textual information; Automatic speech recognition; Broadcasting; Buildings; Computer science; Cybernetics; Hidden Markov models; Natural languages; Speech analysis; Speech recognition; Testing; Spoken language understanding; broadcast conversations; broadcast news; prosody; sentence segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960680
Filename :
4960680
Link To Document :
بازگشت