DocumentCode
310680
Title
A new 2-kbit/s speech coder based on normalized pitch waveform
Author
Hiwasaki, Yuvsuke ; Mano, Kazunori
Author_Institution
NTT Human Interface Labs., Tokyo, Japan
Volume
2
fYear
1997
fDate
21-24 Apr 1997
Firstpage
1583
Abstract
Speech coding at very low bit-rate is useful for purposes such as voice communication over computer networks. However speech coding at around 2.0 kbit/s is difficult for CELP coders while maintaining a high quality. In this paper, a speech coding model called `normalized pitch waveform´ and its quantization scheme are presented, aiming for effective compression coding of the `voiced´ speech. Listening tests have proven that an efficient and high quality coding has been achieved at 2.0 kbit/s, less than half of the FS1016. Furthermore this paper discusses the disadvantage of the normalized pitch waveform and presents an alternative method of using non-normalized pitch waveform. Encoding of a transitional `mixed´ state between the `voiced´ and the `unvoiced´ state is discussed for further improvements
Keywords
linear predictive coding; quantisation (signal); speech coding; vocoders; voice communication; 2 kbit/s; LPC parameters; computer networks; effective compression coding; high quality coding; listening tests; non-normalized pitch waveform; normalized pitch waveform; quantization scheme; speech coder; speech coding; transitional mixed state; unvoiced speech; very low bit-rate coding; voice communication; voiced speech; Bit rate; Computer networks; Filters; Humans; Interpolation; Linear predictive coding; Quantization; Signal processing; Speech analysis; Speech coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.596255
Filename
596255
Link To Document