Title :
Efficient spectrum coding for super-wideband speech and its application to 7/10/15 kHz bandwidth scalable coders
Author :
Oshikiri, Masahiro ; Ehara, Hiroyuki ; Yoshida, Koji
Author_Institution :
Next-Generation Mobile Commun. Dev. Center, Matsushita Electr. Ind. Co. Ltd, Yokosuka, Japan
Abstract :
The paper presents an efficient spectrum coding method for super-wideband (beyond 7 kHz, e.g. 10 kHz or 15 kHz bandwidth) speech signals based on a bandwidth expansion technique. By using a 7 kHz bandwidth speech signal, its frequency band over 7 kHz is generated by the expansion technique without violating the harmonics structure of the speech signal. The bandwidth expansion is performed by pitch filtering in a frequency domain. A 7 kHz bandwidth spectrum is used as a pitch filter state, and pitch filtering is performed toward a frequency band over 7 kHz. We adopted this pitch filtering based spectrum coding (PFSC) to our proposing 7/10/15 kHz bandwidth scalable coder. The scalable coder consists of an existing standard wideband coder as a base-layer and two PFSC coders as an enhancement-layer. One PFSC coder encodes a 7-10 kHz band spectrum at 4.4 kbit/s and the other a 10-15 kHz band spectrum at 2.2 kbit/s. When the AMR-WB coder at 15.85 kbit/s is used as the base-layer, the total bitrate of the scalable coder is 22.45 kbit/s and the total algorithmic delay is 30 ms. We conducted degradation category rating (DCR) tests for both 10 kHz and 15 kHz bandwidth signals. The results show that the DMOS score of the proposed coder is better than that of the 7 kHz bandwidth original signals in both bandwidth clean speech conditions. In addition, when G.722 at 56 kbit/s is used as the base-layer instead of the AMR-WB coder, the DMOS score of this scalable coder is close to that of the 7 kHz bandwidth original signals in both bandwidth audio conditions.
Keywords :
audio coding; filtering theory; frequency-domain analysis; harmonics; speech coding; vocoders; 4.4 to 56 kbit/s; 7 to 15 kHz; AMR-WB coder; DMOS score; G.722; algorithmic delay; audio coding technique; bandwidth expansion; bandwidth scalable coders; degradation category rating tests; frequency domain; pitch filtering based spectrum coding; super-wideband speech coding; Bandwidth; Bit rate; Code standards; Delay; Filtering; Frequency domain analysis; Power harmonic filters; Signal generators; Speech coding; Wideband;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326027