DocumentCode
2961937
Title
High-quality digital speech at 4 kb/s
Author
Granzow, Wolfgang ; Atal, Bishnu S.
Author_Institution
AT&T Bell Lab., Murray Hill, NJ, USA
fYear
1990
fDate
2-5 Dec 1990
Firstpage
941
Abstract
A speech coder based on a single-pulse excitation code-excited linear predictive coding (SPE-CELP) model of linear-predictive coding (LPC) is proposed. An algorithm for determining the time instants of pitch periods within a short interval of periodic speech, which results in a time sequence of marker points that indicate the beginning of the pitch periods in the analyzed speech interval, is described. The LPC excitation is generated by a stochastic codebook for nonperiodic speech and by a single pulse per pitch period for periodic speech. The proper alignment of the excitation pulse is efficiently computed using dynamic programming. It is concluded that, at overall bit rates of around 3 kb/s, the coder produces significantly better speech quality than LPC10E, though the synthesized speech still sounds slightly buzzy for certain speakers
Keywords
dynamic programming; encoding; speech analysis and processing; speech synthesis; 4 kbit/s; SPE-CELP model; algorithm; dynamic programming; excitation pulse alignment; high-quality digital speech; nonperiodic speech; periodic speech; pitch periods; single pulse; single-pulse excitation code-excited linear predictive coding; speech coder; stochastic codebook; time instants; Algorithm design and analysis; Bit rate; Dynamic programming; Linear predictive coding; Predictive models; Pulse generation; Speech analysis; Speech coding; Speech synthesis; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Global Telecommunications Conference, 1990, and Exhibition. 'Communications: Connecting the Future', GLOBECOM '90., IEEE
Conference_Location
San Diego, CA
Print_ISBN
0-87942-632-2
Type
conf
DOI
10.1109/GLOCOM.1990.116641
Filename
116641
Link To Document