Title :
Speaker and language independent voice quality classification applied to unlabelled corpora of expressive speech
Author :
Kane, John ; Scherer, Stefan ; Aylett, Matthew ; Morency, Louis-Philippe ; Gobl, Christer
Author_Institution :
Phonetics & Speech Lab., Trinity Coll. Dublin, Dublin, Ireland
Abstract :
Voice quality plays a pivotal role in speech style variation. Therefore, control and analysis of voice quality is critical for many areas of speech technology. Until now, most work has focused on small purpose built corpora. In this paper we apply state-of-the-art voice quality analysis to large speech corpora built for expressive speech synthesis. A fuzzy-input fuzzy-output support vector machine classifier is trained and validated using features extracted from these corpora. We then apply this classifier to freely available audiobook data and demonstrate a clustering of the voice qualities that approximates the performance of human perceptual ratings. The ability to detect voice quality variation in these widely available unlabelled audiobook corpora means that the proposed method may be used as a valuable resource in expressive speech synthesis.
Keywords :
fuzzy set theory; pattern classification; speech processing; speech synthesis; support vector machines; audiobook corpora; expressive speech; fuzzy input fuzzy output support vector machine classifier; human perceptual ratings; language independent voice quality classification; speaker independent voice quality classification; speech corpora; speech style variation; speech synthesis; speech technology; state-of-the-art voice quality analysis; unlabelled corpora; Analysis of variance; Educational institutions; Feature extraction; Hidden Markov models; Speech; Speech synthesis; Support vector machines; Voice quality; audiobooks; expressive speech; glottal source; speech synthesis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639219