DocumentCode
3530268
Title
Detecting bandlimited audio in broadcast television shows
Author
Fuhs, Mark C. ; Jin, Qin ; Schultz, Tanja
Author_Institution
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA
fYear
2009
fDate
19-24 April 2009
Firstpage
4589
Lastpage
4592
Abstract
For TV and radio shows containing narrowband speech, Speech-to-text (STT) accuracy on the narrowband audio can be improved by using an acoustic model trained on acoustically matched data. To selectively apply it, one must first be able to accurately detect which audio segments are narrowband. The present paper explores two different bandwidth classification approaches: a traditional Gaussian mixture model (GMM) approach and a spline-based classifier that categorizes audio segments based on their power spectra. We focus on shows found in the DARPA GALE Mandarin training and test sets, where the ratio of wideband to narrowband shows is very large. In this setting, the spline-based classifier reduces the number of misclassified wideband segments by up to 95% relative to the GMM-based classifier for the same number of misclassified narrowband segments.
Keywords
pattern classification; speech recognition; speech synthesis; splines (mathematics); Gaussian mixture model; TV shows; acoustically matched data; audio segments; bandlimited audio detection; bandwidth classification; broadcast television shows; misclassified narrowband segments; narrowband audio; narrowband speech; radio shows; speech-to-text accuracy; spline-based classifier; Acoustic signal detection; Acoustic testing; Bandwidth; Decoding; Narrowband; Speech; Spline; TV broadcasting; Telephony; Wideband; Speech processing; pattern classification; speech recognition; telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960652
Filename
4960652
Link To Document