DocumentCode :
1936829
Title :
A fine-grained parallel implementation of a H.264/AVC encoder on a 167-processor computational platform
Author :
Xiao, Zhibin ; Le, Stephen ; Baas, Bevan
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Davis, CA, USA
fYear :
2011
fDate :
6-9 Nov. 2011
Firstpage :
2067
Lastpage :
2071
Abstract :
The emerging many-core architecture provides a flexible solution for the rapid evolving multimedia applications demanding both high performance and high energy-efficiency. However, developing parallel multimedia applications that can efficiently harness and utilize many-core architectures is the key challenge for scalable computing. We contribute to this challenge by presenting a fully-parallel H.264/AVC baseline encoder on a 167-core asynchronous array of simple processors (AsAP) computation platform. By exploiting fine-grained data and task level parallelism in the algorithms, we partition and map the dataflow of the H.264/AVC encoder to an array of 115 small processors coupled with two shared memories and a hardware accelerator for motion estimation. The proposed parallel H.264/AVC encoder is capable of encoding video sequences with variable frame sizes. The encoder presented is capable of encoding VGA (640 × 480) video at 21 frames per second (fps) with 931 mW average power consumption by adjusting each processor to workload-based optimal clock frequencies and dual supply voltages with less than 1dB loss in resolution.
Keywords :
image sequences; motion estimation; multimedia computing; multiprocessing systems; parallel processing; shared memory systems; video coding; 167-core asynchronous array of simple processors; 167-processor computational platform; AsAP computation platform; VGA video; dual supply voltages; encoder fine-grained parallel implementation; energy-efficiency; fine-grained data; fully-parallel H.264/AVC baseline encoder; hardware accelerator; many-core architecture; motion estimation; parallel multimedia applications; power 931 mW; power consumption; scalable computing; shared memories; task level parallelism; video sequence encoding; workload-based optimal clock frequencies; Encoding; Memory management; Motion estimation; Program processors; Streaming media; Transforms; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers (ASILOMAR), 2011 Conference Record of the Forty Fifth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
ISSN :
1058-6393
Print_ISBN :
978-1-4673-0321-7
Type :
conf
DOI :
10.1109/ACSSC.2011.6190391
Filename :
6190391
Link To Document :
بازگشت