Title :
Automatic region-of-interest detection and prioritisation for visually optimised coding of low bit rate videos
Author :
Himawan, Ivan ; Wei Song ; Tjondronegoro, Dian
Author_Institution :
Sci. & Eng. Fac, Queensland Univ. of Technol., Brisbane, QLD, Australia
Abstract :
The increasing popularity of video consumption from mobile devices requires an effective video coding strategy. To overcome diverse communication networks, video services often need to maintain sustainable quality when the available bandwidth is limited. One of the strategy for a visually-optimised video adaptation is by implementing a region-of-interest (ROI) based scalability, whereby important regions can be encoded at a higher quality while maintaining sufficient quality for the rest of the frame. The result is an improved perceived quality at the same bit rate as normal encoding, which is particularly obvious at the range of lower bit rate. However, because of the difficulties of predicting region-of-interest (ROI) accurately, there is a limited research and development of ROI-based video coding for general videos. In this paper, the phase spectrum quaternion of Fourier Transform (PQFT) method is adopted to determine the ROI. To improve the results of ROI detection, the saliency map from the PQFT is augmented with maps created from high level knowledge of factors that are known to attract human attention. Hence, maps that locate faces and emphasise the centre of the screen are used in combination with the saliency map to determine the ROI. The contribution of this paper lies on the automatic ROI detection technique for coding a low bit rate videos which include the ROI prioritisation technique to give different level of encoding qualities for multiple ROIs, and the evaluation of the proposed automatic ROI detection that is shown to have a close performance to human ROI, based on the eye fixation data.
Keywords :
Fourier transforms; mobile computing; mobile handsets; object detection; video coding; PQFT; ROI prioritisation technique; ROI-based video coding; automatic region-of-interest detection; automatic region-of-interest prioritisation; diverse communication networks; eye fixation data; human ROI; low bit rate videos; mobile devices; phase spectrum quaternion of Fourier transform method; region-of-interest based scalability; sustainable quality; video coding strategy; video consumption; video services; visually optimised coding; visually-optimised video adaptation; Bit rate; Encoding; Fourier transforms; Humans; Quaternions; Video coding; Videos;
Conference_Titel :
Applications of Computer Vision (WACV), 2013 IEEE Workshop on
Conference_Location :
Tampa, FL
Print_ISBN :
978-1-4673-5053-2
Electronic_ISBN :
1550-5790
DOI :
10.1109/WACV.2013.6475002