• DocumentCode
    987423
  • Title

    A Method for Automatic Detection of Vocal Fry

  • Author

    Ishi, Carlos Toshinori ; Sakakibara, Ken-Ichi ; Ishiguro, Hiroshi ; Hagita, Norihiro

  • Author_Institution
    Adv. Telecommun. Res. Inst. Int., Intell. Robot. & Commun. Labs., Kyoto
  • Volume
    16
  • Issue
    1
  • fYear
    2008
  • Firstpage
    47
  • Lastpage
    56
  • Abstract
    Vocal fry (also called creak, creaky voice, and pulse register phonation) is a voice quality that carries important linguistic or paralinguistic information, depending on the language. We propose a set of acoustic measures and a method for automatically detecting vocal fry segments in speech utterances. A glottal pulse-synchronized method is proposed to deal with the very low fundamental frequency properties of vocal fry segments, which cause problems in the classic short-term analysis methods. The proposed acoustic measures characterize power, aperiodicity, and similarity properties of vocal fry signals. The basic idea of the proposed method is to scan for local power peaks in a ldquovery short-termrdquo power contour for obtaining glottal pulse candidates, check for periodicity properties, and evaluate a similarity measure between neighboring glottal pulse candidates for deciding the possibility of being vocal fry pulses. In the periodicity analysis, autocorrelation peak properties are taken into account for avoiding misdetection of periodicity in vocal fry segments. Evaluation of the proposed acoustic measures in the automatic detection resulted in 74% correct detection, with an insertion error rate of 13%.
  • Keywords
    acoustic correlation; acoustic signal detection; speech processing; synchronisation; voice communication; acoustic measurement; autocorrelation peak properties; glottal pulse-synchronized method; paralinguistic information; speech utterances; vocal fry automatic detection; voice quality; Acoustic measurements; Acoustic pulses; Acoustic signal detection; Autocorrelation; Damping; Frequency; Intelligent robots; Power measurement; Pulse measurements; Speech analysis; Automatic detection; creaky voice; paralinguistic information; vocal fry; voice quality;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.910791
  • Filename
    4389059