DocumentCode
2949749
Title
Automated Detection of Transition Segments for Intensity and Time-Scale Modification for Speech Intelligibility Enhancement
Author
Jayan, A.R. ; Pandey, P.C. ; Lehana, P.K.
Author_Institution
Indian Inst. of Technol., Mumbai
fYear
2008
fDate
4-6 Jan. 2008
Firstpage
63
Lastpage
68
Abstract
Spectral transition segments serve as landmarks for the perception of consonants. In "clear speech" mode adopted by speakers to improve intelligibility in difficult communication environments, transition segments are of increased duration and intensity. Modification of conversational speech to have acoustic properties of clear speech has been reported to improve its intelligibility. This paper presents an automated method for locating spectral transition segments in speech, and to produce natural quality resynthesized speech with intensity and time-scale modified spectral transition segments. The boundaries of spectral transition segments are located using an index derived from the rate of variation of energy and centroid frequency in five non-overlapping spectral bands. Time-scale modification is performed using harmonic plus noise model (HNM) based analysis-synthesis. The overall speech duration is kept unaltered by appropriately compressing the steady state segments. Transition segments are intensity scaled by 6 dB. The effectiveness of the method was evaluated by conducting listening tests on normal hearing subjects using VCV syllables as the test material.
Keywords
speech enhancement; speech intelligibility; speech recognition; clear speech; harmonic plus noise model; listening tests; spectral bands; spectral transition segments; speech duration; speech intelligibility enhancement; steady state segments; time-scale modification; transition segment detection; Auditory system; Envelope detectors; Frequency; Humans; Loudspeakers; Materials testing; Signal processing; Speech analysis; Speech enhancement; Speech processing; CVR modification; Clear speech; Harmonic plus noise model; Time-scale modification; Transition segment detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, Communications and Networking, 2008. ICSCN '08. International Conference on
Conference_Location
Chennai
Print_ISBN
978-1-4244-1924-1
Electronic_ISBN
978-1-4244-1924-1
Type
conf
DOI
10.1109/ICSCN.2008.4447162
Filename
4447162
Link To Document