• DocumentCode
    3245170
  • Title

    Analysis of acoustic correlates of British, Australian and American accents

  • Author

    Yan, Qin ; Vaseghi, Saeed ; Rentzos, Dimitrios ; Ho, Ching-Hsiang ; Turajlic, Emir

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Brunel Univ., Uxbridge, UK
  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    345
  • Lastpage
    350
  • Abstract
    This paper presents an analysis of the acoustic correlates of the differences of British, Australian and American English accents. The structures of the differences that characterise accents in speech can be divided into two parts: (a) phonetic differences; and (b) acoustic differences. The focus of this paper is on the analysis of acoustic correlates of accents including formants and their trajectories, pitch trajectory, pitch accent, pitch nucleus, duration and speaking rate. The acoustics of accents are modelled and estimated using 2D HMMs of formants and a model of pitch such as the rise/fall/connect (RFC) model. The differences between the accents are discussed. The Australian accent has a lower 1st formant (F1) but higher 2nd formant (F2) compared to British and American. The 2nd formant in speech is considered as the most sensitive to accent identity. British speakers have the largest pitch frequency range and the largest initial pitch rise and final pitch fall rates in utterances. Australian accent exhibits significant elongation of vowels and the lowest speaking rate compared to other two accents. The differences in acoustic correlates across accents are used to morph the accent of a source speaker towards a target accent.
  • Keywords
    correlation methods; hidden Markov models; speech processing; 2D HMM; American English accents; Australian accents; British accents; accent acoustic correlates analysis; accent identity; acoustic differences; formant trajectories; phonetic differences; pitch accent; pitch frequency range; pitch nucleus; pitch trajectory; rise/fall/connect pitch model; speaking rate; speech characteristics; speech duration; vowel elongation; Acoustics; Australia; Automatic speech recognition; Frequency; Hidden Markov models; Loudspeakers; Natural languages; Spatial databases; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318465
  • Filename
    1318465