مرکز منطقه ای اطلاع رساني علوم و فناوري - Compression of navigable speech soundfield zones

DocumentCode :

2535426

Title :

Compression of navigable speech soundfield zones

Author :

Zheng, Xiguang ; Ritz, Christian

Author_Institution :

ICT Res. Inst., Univ. of Wollongong, Wollongong, NSW, Australia

fYear :

2011

fDate :

17-19 Oct. 2011

Firstpage :

Lastpage :

Abstract :

This paper presents a new coding architecture for the compression of navigable speech soundfield zones. The proposed coding scheme encodes multiple speech soundfields, each representing different spatial zones, into a mono or stereo sound-field mixture signal that can be compressed with an existing speech or audio coder. The resulting compressed signals can be decoded back to individual soundfield zones. Objective and subjective testing results show that the approach successfully compresses up to 3 speech soundfields (each consisting of 4 individual speakers) at a bit rate of 48 kbps whilst maintaining the perceptual quality of each decoded soundfield zone.

Keywords :

speech coding; compressed signals; navigable speech soundfield zones; speech coding architecture; Azimuth; Bit rate; Microphones; Navigation; Speech; Speech coding; Time frequency analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia Signal Processing (MMSP), 2011 IEEE 13th International Workshop on

Conference_Location :

Hangzhou

Print_ISBN :

978-1-4577-1432-0

Electronic_ISBN :

978-1-4577-1433-7

Type :

conf

DOI :

10.1109/MMSP.2011.6093795

Filename :

6093795

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2535426