DocumentCode :
2535426
Title :
Compression of navigable speech soundfield zones
Author :
Zheng, Xiguang ; Ritz, Christian
Author_Institution :
ICT Res. Inst., Univ. of Wollongong, Wollongong, NSW, Australia
fYear :
2011
fDate :
17-19 Oct. 2011
Firstpage :
1
Lastpage :
6
Abstract :
This paper presents a new coding architecture for the compression of navigable speech soundfield zones. The proposed coding scheme encodes multiple speech soundfields, each representing different spatial zones, into a mono or stereo sound-field mixture signal that can be compressed with an existing speech or audio coder. The resulting compressed signals can be decoded back to individual soundfield zones. Objective and subjective testing results show that the approach successfully compresses up to 3 speech soundfields (each consisting of 4 individual speakers) at a bit rate of 48 kbps whilst maintaining the perceptual quality of each decoded soundfield zone.
Keywords :
speech coding; compressed signals; navigable speech soundfield zones; speech coding architecture; Azimuth; Bit rate; Microphones; Navigation; Speech; Speech coding; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing (MMSP), 2011 IEEE 13th International Workshop on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4577-1432-0
Electronic_ISBN :
978-1-4577-1433-7
Type :
conf
DOI :
10.1109/MMSP.2011.6093795
Filename :
6093795
Link To Document :
بازگشت