• DocumentCode
    2949749
  • Title

    Automated Detection of Transition Segments for Intensity and Time-Scale Modification for Speech Intelligibility Enhancement

  • Author

    Jayan, A.R. ; Pandey, P.C. ; Lehana, P.K.

  • Author_Institution
    Indian Inst. of Technol., Mumbai
  • fYear
    2008
  • fDate
    4-6 Jan. 2008
  • Firstpage
    63
  • Lastpage
    68
  • Abstract
    Spectral transition segments serve as landmarks for the perception of consonants. In "clear speech" mode adopted by speakers to improve intelligibility in difficult communication environments, transition segments are of increased duration and intensity. Modification of conversational speech to have acoustic properties of clear speech has been reported to improve its intelligibility. This paper presents an automated method for locating spectral transition segments in speech, and to produce natural quality resynthesized speech with intensity and time-scale modified spectral transition segments. The boundaries of spectral transition segments are located using an index derived from the rate of variation of energy and centroid frequency in five non-overlapping spectral bands. Time-scale modification is performed using harmonic plus noise model (HNM) based analysis-synthesis. The overall speech duration is kept unaltered by appropriately compressing the steady state segments. Transition segments are intensity scaled by 6 dB. The effectiveness of the method was evaluated by conducting listening tests on normal hearing subjects using VCV syllables as the test material.
  • Keywords
    speech enhancement; speech intelligibility; speech recognition; clear speech; harmonic plus noise model; listening tests; spectral bands; spectral transition segments; speech duration; speech intelligibility enhancement; steady state segments; time-scale modification; transition segment detection; Auditory system; Envelope detectors; Frequency; Humans; Loudspeakers; Materials testing; Signal processing; Speech analysis; Speech enhancement; Speech processing; CVR modification; Clear speech; Harmonic plus noise model; Time-scale modification; Transition segment detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, Communications and Networking, 2008. ICSCN '08. International Conference on
  • Conference_Location
    Chennai
  • Print_ISBN
    978-1-4244-1924-1
  • Electronic_ISBN
    978-1-4244-1924-1
  • Type

    conf

  • DOI
    10.1109/ICSCN.2008.4447162
  • Filename
    4447162