• DocumentCode
    167136
  • Title

    Is phoneme length and phoneme energy useful in automatic speaker recognition?

  • Author

    Igras, Magdalena ; Ziolko, Bartosz ; Ziolko, Mariusz

  • Author_Institution
    Dept. of Electron., AGH Univ. of Sci. & Technol., Kraków, Poland
  • fYear
    2014
  • fDate
    11-13 April 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The paper presents analysis of prosodic parameters of speech (energy, phoneme duration) as features characteristic for speaker. The most significant parameters of the features were investigated using CORPORA speech database and described statistically. We observed that phoneme duration depends on a speaker, as well as the preboundary lengthening of the phonemes in sentences. An average phoneme energy and an amount of energy per time are speaker-specific values also. These features can be used as complementary to a standard feature vectors of time-energy distributions for speaker recognition systems. The results of the investigation can be also applied to speech modeling for automatic speech recognition.
  • Keywords
    speaker recognition; speech processing; statistical analysis; CORPORA speech database; automatic speaker recognition; average phoneme energy; phoneme duration; phoneme length; phoneme preboundary lengthening; speaker feature characteristic; speaker-specific values; speech modeling; speech prosodic parameters; standard feature vectors; statistical analysis; time-energy distributions; Databases; Educational institutions; Speaker recognition; Speech; Speech recognition; Standards; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pacific Voice Conference (PVC), 2014 XXII Annual
  • Conference_Location
    Krakow
  • Print_ISBN
    978-1-4799-3699-1
  • Type

    conf

  • DOI
    10.1109/PVC.2014.6845416
  • Filename
    6845416