DocumentCode :
1459969
Title :
Multi-core platform for an efficient H.264 and VC-1 video decoding based on macroblock row-level parallelism
Author :
Lee, Joon-Yeong ; Lee, Jae-Jun ; Park, Sung Min
Volume :
4
Issue :
2
fYear :
2010
fDate :
3/1/2010 12:00:00 AM
Firstpage :
147
Lastpage :
158
Abstract :
In order for the video decoding processing such as H.264 and VC-1 to be effective in multi-core environments, several kinds of parallelisms must be utilised. Here, a novel parallelisation methodology, macroblock row-level parallelism (MBRLP), of video decoding is presented. The ETRI multimedia processing core (EMC) and the ETRI multi-core platform (EMP) are proposed for adopting MBRLP. In terms of the scalability and utilisation of processing cores, MBRLP has advantages over other parallelisation strategies such as frame, slice and macroblock (MB)-level parallelism. The scalability can be easily achieved by just increasing the number of processing cores and applying homogeneous software design/optimisation techniques to each EMC. Instead of employing a dynamic MB-level scheduler, a hybrid approach is used, which is a two-stage functional pipelining combined with MBRLP. The hybrid approach of combining MBRLP and de-blocking pipelining can relieve the synchronisation and inter-processor communication overheads incurred by multicore decoding systems as well as run-time scheduler´s overheads. As a result, the proposed parallelisation method and architectures can boost the performance with the efficiency of 83%. The proposed architecture consisting of six EMC clusters has the capability to process Dl (720 ?? 480) 30 fps real-time decoding at around 200 MHz. The same concept can be applied to full-HD (1920 ?? 1088) video decoding in this work. It can be found that as the number of processing cores increase, the performance improvement is enhanced almost linearly. The EMP consisting of four EMC clusters (eight cores), memories and other peripherals are prototyped on Xilinx Virtex4 XC4VL200 FPGA which is operating at 60 MHz.
Keywords :
decoding; image coding; multimedia computing; multiprocessing systems; parallel processing; scheduling; EMC clusters; ETRI multicore platform; ETRI multimedia processing core; H.264 video decoding; VC-1 video decoding; Xilinx Virtex4 XC4VL200 FPGA; deblocking pipelining; dynamic MB-level scheduler; homogeneous software design; inter-processor communication; macroblock row-level parallelism; optimisation techniques; parallelisation methodology; run-time scheduler; two-stage functional pipelining;
fLanguage :
English
Journal_Title :
Circuits, Devices & Systems, IET
Publisher :
iet
ISSN :
1751-858X
Type :
jour
DOI :
10.1049/iet-cds.2009.0038
Filename :
5441030
Link To Document :
بازگشت