DocumentCode :
1255607
Title :
Exploring the Predictability of Non-Unique Acoustic-to-Articulatory Mappings
Author :
Ananthakrishnan, G. ; Engwall, Olov ; Neiberg, Daniel
Author_Institution :
Centre for Speech Technology, KTH (Royal Institute of Technology), Stockholm, Sweden
Volume :
20
Issue :
10
fYear :
2012
Firstpage :
2672
Lastpage :
2682
Abstract :
This paper explores statistical tools that help analyze the predictability in the acoustic-to-articulatory inversion of speech, using an Electromagnetic Articulography database of simultaneously recorded acoustic and articulatory data. Since it has been shown that speech acoustics can be mapped to non-unique articulatory modes, the variance of the articulatory parameters is not sufficient to understand the predictability of the inverse mapping. We, therefore, estimate an upper bound to the conditional entropy of the articulatory distribution. This provides a probabilistic estimate of the range of articulatory values (either over a continuum or over discrete non-unique regions) for a given acoustic vector in the database. The analysis is performed for different British/Scottish English consonants with respect to which articulators (lips, jaws or the tongue) are important for producing the phoneme. The paper shows that acoustic-articulatory mappings for the important articulators have a low upper bound on the entropy, but can still have discrete non-unique configurations.
Keywords :
Acoustics; Entropy; Lips; Speech; Statistical analysis; Tongue; Upper bound; Acoustic-to-articulatory inversion; entropy of GMM (Gaussian mixture model); many-to-one-mapping;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2012.2210876
Filename :
6255765
Link To Document :
بازگشت