Cross-entropic comparison of the effects of accent, speaker and database recording on spectral features of English accents

Author

Ghorshi, Seyed ; Vaseghi, Saeed ; Qin Yan

Author_Institution

Sch. of Eng. & Design, Brunel Univ., Uxbridge, UK

fYear

2007

fDate

3-7 Sept. 2007

Firstpage

2365

Lastpage

2369

Abstract

This paper investigates the use of cross-entropy information measure for quantification and comparison of the impact of the variations of accents, speaker groups and recordings on the probability models of spectral features of phonetic units of speech. Cross-entropy measure can be used in applications such as accent identification, improved speech recognition, cross-accent phonetic-tree analysis and analysis of the influence of accents on different sets of speech parameters and models. For the purpose of this study the focus is on British English, Australian English and two different databases of American English accents (namely WSJ and TIMIT). Comparison of the cross entropies of formants and cepstrum features indicate that cepstrum features are less indicative of accents compared to formants. In particular it appears that the measurements of differences in formants across accents are less sensitive to different recording or databases. It is found that the cross entropies of the same phonemes across different accents (inter-accent distances) are significantly greater than the cross entropies of the same phonemes across different speaker groups of the same accent (intra-accent distances). The cross entropy measure is also used to construct cross-accent phonetic trees, which serve to show the structural similarities and differences of the phonetic systems across accents.

Keywords

entropy; feature extraction; natural language processing; probability; speech processing; trees (mathematics); American English accents; Australian English; British English; TIMIT database; WSJ database; accent identification; accent recording; accents influence analysis; cepstrum features; cross-accent phonetic-tree analysis; cross-entropic comparison; cross-entropy information measure; database recording; inter accent distances; intra accent distances; probability models; speaker recording; spectral features; speech models; speech parameters; speech phonetic units; speech recognition; Cepstrum; Databases; Entropy; Hidden Markov models; Signal processing; Speech; accent; cepstrum; cross entropy; formant; phonetic-tree clustering;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing Conference, 2007 15th European

Conference_Location

Poznan

Print_ISBN

978-839-2134-04-6

Type

conf

Filename

7099231