Title :
Cross-entropic comparison of the effects of accent, speaker and database recording on spectral features of English accents
Author :
Ghorshi, Seyed ; Vaseghi, Saeed ; Qin Yan
Author_Institution :
Sch. of Eng. & Design, Brunel Univ., Uxbridge, UK
Abstract :
This paper investigates the use of cross-entropy information measure for quantification and comparison of the impact of the variations of accents, speaker groups and recordings on the probability models of spectral features of phonetic units of speech. Cross-entropy measure can be used in applications such as accent identification, improved speech recognition, cross-accent phonetic-tree analysis and analysis of the influence of accents on different sets of speech parameters and models. For the purpose of this study the focus is on British English, Australian English and two different databases of American English accents (namely WSJ and TIMIT). Comparison of the cross entropies of formants and cepstrum features indicate that cepstrum features are less indicative of accents compared to formants. In particular it appears that the measurements of differences in formants across accents are less sensitive to different recording or databases. It is found that the cross entropies of the same phonemes across different accents (inter-accent distances) are significantly greater than the cross entropies of the same phonemes across different speaker groups of the same accent (intra-accent distances). The cross entropy measure is also used to construct cross-accent phonetic trees, which serve to show the structural similarities and differences of the phonetic systems across accents.
Keywords :
entropy; feature extraction; natural language processing; probability; speech processing; trees (mathematics); American English accents; Australian English; British English; TIMIT database; WSJ database; accent identification; accent recording; accents influence analysis; cepstrum features; cross-accent phonetic-tree analysis; cross-entropic comparison; cross-entropy information measure; database recording; inter accent distances; intra accent distances; probability models; speaker recording; spectral features; speech models; speech parameters; speech phonetic units; speech recognition; Cepstrum; Databases; Entropy; Hidden Markov models; Signal processing; Speech; accent; cepstrum; cross entropy; formant; phonetic-tree clustering;
Conference_Titel :
Signal Processing Conference, 2007 15th European
Conference_Location :
Poznan
Print_ISBN :
978-839-2134-04-6