DocumentCode
323763
Title
Sub-sentence discourse models for conversational speech recognition
Author
Ma, Kristine W. ; Zavaliagkos, George ; Meteer, Marie
Author_Institution
GTE/BBN Technol., Cambridge, MA, USA
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
693
Abstract
According to discourse theories in linguistics, conversational utterances possess an informational structure that partitions each sentence into two portions: a “given” and “new”. We explore this idea by building sub-sentence discourse language models for conversational speech recognition. The internal sentence structure is captured in statistical language modeling by training multiple n-gram models using the expectation-maximization algorithm on the Switchboard corpus. The resulting model contributes to a 30% reduction in language model perplexity and a small gain in word error rate
Keywords
grammars; natural languages; speech processing; speech recognition; statistical analysis; Switchboard corpus; conversational speech recognition; conversational utterances; expectation-maximization algorithm; given-new language model; informational structure; internal sentence structure; language model perplexity reduction; linguistics; multiple n-gram models training; statistical language modeling; sub-sentence discourse models; word error rate; Acoustic testing; Acoustic waves; Automatic testing; Data analysis; Data mining; Error analysis; Performance gain; Speech recognition; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675359
Filename
675359
Link To Document