DocumentCode
167144
Title
A multistage algorithm for fricative spotting
Author
Ruinskiy, Dima ; Lavner, Yizhar
Author_Institution
Dept. of Comput. Sci., Tel-Hai Coll., Galilee, Israel
fYear
2014
fDate
11-13 April 2014
Firstpage
1
Lastpage
5
Abstract
We present an algorithm for spotting fricative consonants in continuous speech. Fricative spotting can be useful in professional audio applications, where excessive accentuation of these phonemes can degrade the aesthetics of voice recordings, or in applications for the hearing-impaired, where certain manipulations can increase their perception. All stages of our algorithm rely only on features extracted directly from the audio signal and on common classification techniques, making it simple to implement and language-invariant. In the first stage, a linear classifier, pre-trained using the Fisher´s Linear Discriminant Analysis (LDA) method, is used to detect fricatives inside speech sentences. In the second stage, the detected phonemes are further analyzed using a decision-tree classifier, attempting to reject false detections. Tested on the full corpus of the TIMIT audio database the algorithm achieved very good detection rates across the entire range of fricative phonemes.
Keywords
audio signal processing; decision trees; signal classification; speech processing; TIMIT audio database; continuous speech; decision-tree classifier; fricative spotting; linear classifier; linear discriminant analysis; multistage algorithm; professional audio applications; speech sentences; Algorithm design and analysis; Classification algorithms; Decision trees; Speech; Speech recognition; Support vector machine classification; Training; Fricatives; Linear Discriminant Analysis; decision tree; phoneme spotting;
fLanguage
English
Publisher
ieee
Conference_Titel
Pacific Voice Conference (PVC), 2014 XXII Annual
Conference_Location
Krakow
Print_ISBN
978-1-4799-3699-1
Type
conf
DOI
10.1109/PVC.2014.6845421
Filename
6845421
Link To Document