Speech segmentation and spoken document processing

Author

Ostendorf, Mari ; Favre, Benoit ; Grishman, Ralph ; Hakkani-Tur, Dilek ; Harper, Mary ; Hillard, Dustin ; Hirschberg, Julia ; Ji, Heng ; Kahn, Jeremy G. ; Liu, Yang ; Maskey, Sameer ; Matusov, Evgeny ; Ney, Hermann ; Rosenberg, Andrew ; Shriberg, Elizabet

Author_Institution

Univ. of Washington, Seattle

Volume

25

Issue

3

fYear

2008

fDate

5/1/2008 12:00:00 AM

Firstpage

59

Lastpage

69

Abstract

Progress in both speech and language processing has spurred efforts to support applications that rely on spoken rather than written language input. A key challenge in moving from text-based documents to such spoken documents is that spoken language lacks explicit punctuation and formatting, which can be crucial for good performance. This article describes different levels of speech segmentation, approaches to automatically recovering segment boundary locations, and experimental results demonstrating impact on several language processing tasks. The results also show a need for optimizing segmentation for the end task rather than independently.

Keywords

natural language processing; speech processing; text analysis; language processing; segment boundary location recovery; speech processing; speech segmentation; spoken document processing; text-based document; Auditory system; Automatic speech recognition; Broadcasting; Digital recording; History; Humans; Natural languages; Process design; Speech coding; Speech processing;

fLanguage

English

Journal_Title

Signal Processing Magazine, IEEE

Publisher

ieee

ISSN

1053-5888

Type

jour

DOI

10.1109/MSP.2008.918023

Filename

4490202