DocumentCode :
1051929
Title :
Temporal Compression Of Speech: An Evaluation
Author :
Tucker, Simon ; Whittaker, Steve
Author_Institution :
Dept. of Inf. Studies, Univ. of Sheffield, Sheffield
Volume :
16
Issue :
4
fYear :
2008
fDate :
5/1/2008 12:00:00 AM
Firstpage :
790
Lastpage :
796
Abstract :
Efficient browsing of speech recordings is problematic. The linear nature of speech, coupled with the lack of abstraction that the medium affords, means that listeners have to listen to long segments of a recording to locate points of interest. We explore temporal compression algorithms that attempt to reduce the amount of time users require to listen to speech recordings, while retaining the important content. This paper implements two main approaches to temporal compression: artificial speech rate alteration (speed-up) and unimportant segment removal (excision). We evaluate the effectiveness of these approaches by having listeners rate comprehension and listening effort for different types of temporal compression. For different compression levels, we compare performance of various implementations of speed-up and excision as well as techniques based on semantic features and acoustic features. Our results indicate that listeners prefer low compression levels, excision over speed-up, and algorithms based on semantic rather than acoustic features. Finally, listeners were negative about hybrid algorithms that used speed-up to indicate missing regions in an excised recording.
Keywords :
audio recording; data compression; information retrieval; speech coding; acoustic feature; artificial speech rate alteration; information browsing; semantic feature; speech recordings; temporal speech compression; unimportant segment removal; Information retrieval; speech processing; text processing; user interfaces;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.916527
Filename :
4443890
Link To Document :
بازگشت