• DocumentCode
    2789853
  • Title

    Advancements in whisper-island detection using the linear predictive residual

  • Author

    Zhang, Chi ; Hansen, John H L

  • Author_Institution
    Center for Robust Speech Syst.(CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5170
  • Lastpage
    5173
  • Abstract
    In this study, we consider the use of a new entropy-based feature extracted from linear predictive residual for whisper-island detection within normally phonated audio streams. The proposed feature, which is sensitive to vocal effort changes between whisper and neutral speech, is integrated within a BIC/T2-BIC segmentation for vocal effort change point(VECP) detection and utilized for vocal effort classification. Evaluation is based on the proposed multi-error score(MES), where the improved feature is shown to improve performance in VECP detection with the lowest MES of 20.70. Furthermore, more accurate whisper-island detection was obtained using the proposed feature and algorithm. Finally, the experimental detection rate results of 97.37% represents the best whisper-island detection performance available in the literature to date.
  • Keywords
    audio signal processing; feature extraction; maximum entropy methods; speech recognition; entropy-based feature extraction; linear predictive residual; multierror score; neutral speech; phonated audio streams; vocal effort change point detection; whisper speech; whisper-island detection; Change detection algorithms; Computer science; Feature extraction; Information retrieval; Robustness; Signal analysis; Signal processing; Speech analysis; Speech processing; Streaming media; Τ2-BIC; detection; feature; segmentation; vocal effort; whisper;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495022
  • Filename
    5495022