DocumentCode
417206
Title
A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues
Author
Wang, Dagen ; Narayanan, Shrikanth S.
Author_Institution
Electr. Eng. Dept., Univ. of Southern California, Los Angeles, CA, USA
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
We propose a multi-pass linear fold algorithm for sentence boundary detection in spontaneous speech. It uses only prosodic cues and does not rely on segmentation information from a speech recognition decoder. We focus on features based on pitch breaks and pitch durations, study their local and global structural properties and find their relationship with sentence boundaries. In the first step, the algorithm, which requires no training, automatically finds a set of candidate pitch breaks by simple curve fitting. In the next step, by exploiting statistical properties of sentence boundaries and disfluency, the algorithm finds the sentence boundaries within these candidate pitch breaks. With this simple method without any explicit segmentation information from an ASR, a 25% error rate was achieved on a randomly selected portion of the switchboard corpus. The result from this method is comparable with those that include word segmentation information and can be used in conjunction to improve the overall performance and confidence.
Keywords
curve fitting; speech processing; speech recognition; statistical analysis; curve fitting; disfluency; multi-pass linear fold algorithm; performance; pitch breaks; pitch durations; prosodic cues; sentence boundary detection; speech recognition; spontaneous speech; statistical properties; switchboard corpus; Automatic speech recognition; Data mining; Decision trees; Error analysis; Feature extraction; Image processing; Image segmentation; Signal processing; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326038
Filename
1326038
Link To Document