DocumentCode
2309888
Title
A Sinusoidal Model Approach to Acoustic Landmark Detection and Segmentation for Robust Segment-Based Speech Recognition
Author
Sainath, Tara N. ; Hazen, Timothy J.
Author_Institution
Comput. Sci. & Artificial Intelligence Lab., MIT
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
In this paper, we present a noise robust landmark detection and segmentation algorithm using a sinusoidal model representation of speech. We compare the performance of our approach under noisy conditions against two segmentation methods used in the SUMMIT segment-based speech recognizer, a full segmentation approach and an approach that detects segment boundaries based on spectral change. The word error rate of the spectral change segmentation method degrades rapidly in the presence of noise, while the sinusoidal and full segmentation models degrade more gracefully. However, the full segmentation method requires the largest computation time of the three approaches. We find that our new algorithm provides the best tradeoff between word accuracy and computation time of the three methods. Furthermore, we find that our model is robust when speech is contaminated by various noise types
Keywords
acoustic signal detection; speech recognition; SUMMIT segment-based speech recognizer; acoustic landmark detection; acoustic landmark segmentation; noise; sinusoidal model approach; spectral change segmentation method; word error rate; Acoustic noise; Acoustic signal detection; Change detection algorithms; Error analysis; Hidden Markov models; Mathematical model; Noise robustness; Speech analysis; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660073
Filename
1660073
Link To Document