مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust Learning of 2-D Separable Transforms for Next-Generation Video Coding

DocumentCode :

2944667

Title :

Robust Learning of 2-D Separable Transforms for Next-Generation Video Coding

Author :

Sezer, Osman G. ; Cohen, Robert ; Vetro, Anthony

Author_Institution :

Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA

fYear :

2011

fDate :

29-31 March 2011

Firstpage :

Lastpage :

Abstract :

With the simplicity of its application together with compression efficiency, the Discrete Cosine Transform(DCT) plays a vital role in the development of video compression standards. For next-generation video coding, a new set of 2-D separable transforms has emerged as a candidate to replace the DCT. These separable transforms are learned from residuals of each intra prediction mode, hence termed as Mode dependent-directional transforms (MDDT). MDDT uses the Karhunen-Loeve Transform (KLT) to create sets of separable transforms from training data. Since the residuals after intra prediction have some structural similarities, transforms utilizing these correlations improve coding efficiency. However, the KLT is the optimal approach only if the data has a Gaussian distribution without outliers. Due to the nature of the least-square norm, outliers can arbitrarily affect the directions of the KLT components. In this paper, we will address robust learning of separable transforms by enforcing sparsity on the coefficients of the representations. With this new approach, it is possible to improve upon the video coding performance of H.264/AVC by up to 10.2% BD-rate for intra coding. At no additional cost, the proposed techniques can also provide up to 3.9% improvement in BD-rate for intra coding compared to existing MDDT schemes.

Keywords :

Gaussian distribution; Karhunen-Loeve transforms; data compression; discrete cosine transforms; learning (artificial intelligence); video coding; 2D separable transform; BD-rate; DCT; Gaussian distribution; H.264-AVC; KLT; Karhunen-Loeve transform; MDDT; discrete cosine transform; intra coding; least-square norm; mode dependent-directional transform; next-generation video coding; robust learning; video compression; Automatic voltage control; Cost function; Discrete cosine transforms; Image coding; Robustness; Video coding; mode dependent directional transforms; next generation; orthonormal transforms; robust learning; spare representation; sparse orthonormal transforms; video coding;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Compression Conference (DCC), 2011

Conference_Location :

Snowbird, UT

ISSN :

1068-0314

Print_ISBN :

978-1-61284-279-0

Type :

conf

DOI :

10.1109/DCC.2011.14

Filename :

5749464

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2944667