• DocumentCode
    47570
  • Title

    Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index

  • Author

    Prathosh, A.P. ; Ananthapadmanabha, T.V. ; Ramakrishnan, A.G.

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
  • Volume
    21
  • Issue
    12
  • fYear
    2013
  • fDate
    Dec. 2013
  • Firstpage
    2471
  • Lastpage
    2480
  • Abstract
    Epoch is defined as the instant of significant excitation within a pitch period of voiced speech. Epoch extraction continues to attract the interest of researchers because of its significance in speech analysis. Existing high performance epoch extraction algorithms require either dynamic programming techniques or a priori information of the average pitch period. An algorithm without such requirements is proposed based on integrated linear prediction residual (ILPR) which resembles the voice source signal. Half wave rectified and negated ILPR (or Hilbert transform of ILPR) is used as the pre-processed signal. A new non-linear temporal measure named the plosion index (PI) has been proposed for detecting `transients´ in speech signal. An extension of PI, called the dynamic plosion index (DPI) is applied on pre-processed signal to estimate the epochs. The proposed DPI algorithm is validated using six large databases which provide simultaneous EGG recordings. Creaky and singing voice samples are also analyzed. The algorithm has been tested for its robustness in the presence of additive white and babble noise and on simulated telephone quality speech. The performance of the DPI algorithm is found to be comparable or better than five state-of-the-art techniques for the experiments considered.
  • Keywords
    Hilbert transforms; speech processing; EGG recordings; Hilbert transform; dynamic plosion index; dynamic programming techniques; epoch extraction; integrated linear prediction residual; nonlinear temporal measure; singing voice samples; speech analysis; speech signal; telephone quality speech; voice source signal; voiced speech; Heuristic algorithms; Linear systems; Predictive analysis; Transient analysis; Epoch extraction; GCI detection; glottal closure instant; integrated linear prediction residual; plosion index;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2273717
  • Filename
    6562799