مرکز منطقه ای اطلاع رساني علوم و فناوري - Voiced-Unvoiced-Silence Speech Sound Classification Based on Unsupervised Learning

DocumentCode :

3195025

Title :

Voiced-Unvoiced-Silence Speech Sound Classification Based on Unsupervised Learning

Author :

Deng, Huiqun ; O´Shaughnessy, Douglas

Author_Institution :

Quebec Univ., Montreal

fYear :

2007

fDate :

2-5 July 2007

Firstpage :

176

Lastpage :

179

Abstract :

Voiced-unvoiced-silence (V/UV/S) classification of speech sounds is important in automatic speech/speaker recognition, speech segmentation, speech signal compression, and speech analysis. Training-based classifications suffer from lack of training databases or degrade when training and test statistics mismatch due to variances in speakers, languages, talking styles, noise, transmission channels, etc. This paper proposes a novel voiced-unvoiced-silence classification based on unsupervised learning. The class-dependent statistics (feature means, covariance matrices, and occurrence frequencies of voiced, unvoiced, and silence classes) needed for the classification are estimated directly from the signal to be classified via Gaussian mixture models and the expectation maximization algorithm. The classification is evaluated using NTIMIT, and the results are encouraging: V/UV/S classification accuracy is greater than 91.15%, and voice activity detection accuracy is greater than 97.45%.

Keywords :

Gaussian processes; audio databases; data compression; expectation-maximisation algorithm; signal classification; speaker recognition; speech coding; statistical testing; unsupervised learning; Gaussian mixture model; automatic speech/speaker recognition; expectation maximization algorithm; signal classification; speech analysis; speech segmentation; speech signal compression; statistical testing; training database; unsupervised learning; voiced-unvoiced-silence speech sound classification; Degradation; Frequency estimation; Loudspeakers; Natural languages; Spatial databases; Speaker recognition; Speech analysis; Statistical analysis; Testing; Unsupervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2007 IEEE International Conference on

Conference_Location :

Beijing

Print_ISBN :

1-4244-1016-9

Electronic_ISBN :

1-4244-1017-7

Type :

conf

DOI :

10.1109/ICME.2007.4284615

Filename :

4284615

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3195025