Title :
Modeling disfluencies in conversational speech
Author :
Siu, Man-Hung ; Ostendorf, Mari
Abstract :
Conversational speech is notably different from read speech in several ways, particularly in the presence of disfluencies but also in the frequent use of a small set of words that mark the flow of the discourse. Disfluencies are sometimes viewed as a “problem” in language modeling, where most previous work has focused on written text. In this paper, we take the view that disfluencies provide information themselves. In particular, we give evidence that filled pauses serve different functions, including marking linguistic unit and restart boundaries, and signaling hesitation where the speaker wants to hold the floor. The different functions can be connected to similar functions of other words common in spontaneous but not written speech, and the particular function affects the word conditioning choices in a variable n-gram model. Thus, at least some of the idiosyncrasies of spontaneous speech can be viewed as a source of information for language modeling rather than an interruption in the linguistic structure
Keywords :
linguistics; nomograms; conversational speech; discourse flow; disfluencies; filled pauses; hesitation signals; idiosyncrasies; information content; interruptions; language modeling; linguistic structure; linguistic unit boundary marking; restart boundary marking; spontaneous speech; variable n-gram model; word conditioning choices; words; History; Information resources; Natural languages; Speech recognition; Telephony; Testing;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607135