DocumentCode :
3166322
Title :
Corpus-independent history compression for stochastic turn-taking models
Author :
Laskowski, Kornel ; Shriberg, Elizabeth
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4937
Lastpage :
4940
Abstract :
Stochastic turn-taking models use a truncated representation of past speech activity to specify how likely a speaker is to talk at the next instant. An unanswered question in such modeling is how far back to extend the conditioning context. We study this question using Switchboard (English, telephone) and Spontal (Swedish, face-to-face) conversations. We also explore whether to trade off precision with range when moving backward in the history. We find that (1) a nearly logarithmic compression of history is optimal, for both speaker and interlocutor; (2) the absolute duration of the conditioning context is at least 7 seconds; and (3) the compression scheme generalizes remarkably well across the two different corpora.
Keywords :
speech processing; Spontal conversations; Switchboard conversations; corpus-independent history compression; history logarithmic compression; speech activity representation; stochastic turn-taking models; Context; Context modeling; Data models; Entropy; History; Speech; Switches; Turn-taking; conversational speech; dialogue; diarization; speech activity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289027
Filename :
6289027
Link To Document :
بازگشت