DocumentCode
1133487
Title
Speech segmentation and spoken document processing
Author
Ostendorf, Mari ; Favre, Benoit ; Grishman, Ralph ; Hakkani-Tur, Dilek ; Harper, Mary ; Hillard, Dustin ; Hirschberg, Julia ; Ji, Heng ; Kahn, Jeremy G. ; Liu, Yang ; Maskey, Sameer ; Matusov, Evgeny ; Ney, Hermann ; Rosenberg, Andrew ; Shriberg, Elizabet
Author_Institution
Univ. of Washington, Seattle
Volume
25
Issue
3
fYear
2008
fDate
5/1/2008 12:00:00 AM
Firstpage
59
Lastpage
69
Abstract
Progress in both speech and language processing has spurred efforts to support applications that rely on spoken rather than written language input. A key challenge in moving from text-based documents to such spoken documents is that spoken language lacks explicit punctuation and formatting, which can be crucial for good performance. This article describes different levels of speech segmentation, approaches to automatically recovering segment boundary locations, and experimental results demonstrating impact on several language processing tasks. The results also show a need for optimizing segmentation for the end task rather than independently.
Keywords
natural language processing; speech processing; text analysis; language processing; segment boundary location recovery; speech processing; speech segmentation; spoken document processing; text-based document; Auditory system; Automatic speech recognition; Broadcasting; Digital recording; History; Humans; Natural languages; Process design; Speech coding; Speech processing;
fLanguage
English
Journal_Title
Signal Processing Magazine, IEEE
Publisher
ieee
ISSN
1053-5888
Type
jour
DOI
10.1109/MSP.2008.918023
Filename
4490202
Link To Document