DocumentCode
706294
Title
Cross-entropic comparison of the effects of accent, speaker and database recording on spectral features of English accents
Author
Ghorshi, Seyed ; Vaseghi, Saeed ; Qin Yan
Author_Institution
Sch. of Eng. & Design, Brunel Univ., Uxbridge, UK
fYear
2007
fDate
3-7 Sept. 2007
Firstpage
2365
Lastpage
2369
Abstract
This paper investigates the use of cross-entropy information measure for quantification and comparison of the impact of the variations of accents, speaker groups and recordings on the probability models of spectral features of phonetic units of speech. Cross-entropy measure can be used in applications such as accent identification, improved speech recognition, cross-accent phonetic-tree analysis and analysis of the influence of accents on different sets of speech parameters and models. For the purpose of this study the focus is on British English, Australian English and two different databases of American English accents (namely WSJ and TIMIT). Comparison of the cross entropies of formants and cepstrum features indicate that cepstrum features are less indicative of accents compared to formants. In particular it appears that the measurements of differences in formants across accents are less sensitive to different recording or databases. It is found that the cross entropies of the same phonemes across different accents (inter-accent distances) are significantly greater than the cross entropies of the same phonemes across different speaker groups of the same accent (intra-accent distances). The cross entropy measure is also used to construct cross-accent phonetic trees, which serve to show the structural similarities and differences of the phonetic systems across accents.
Keywords
entropy; feature extraction; natural language processing; probability; speech processing; trees (mathematics); American English accents; Australian English; British English; TIMIT database; WSJ database; accent identification; accent recording; accents influence analysis; cepstrum features; cross-accent phonetic-tree analysis; cross-entropic comparison; cross-entropy information measure; database recording; inter accent distances; intra accent distances; probability models; speaker recording; spectral features; speech models; speech parameters; speech phonetic units; speech recognition; Cepstrum; Databases; Entropy; Hidden Markov models; Signal processing; Speech; accent; cepstrum; cross entropy; formant; phonetic-tree clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2007 15th European
Conference_Location
Poznan
Print_ISBN
978-839-2134-04-6
Type
conf
Filename
7099231
Link To Document