ieee transactions on consumer electronics, vol. 58, no. 2, may 2012 kyungmin lim, seongwan kim,...

31
FAST BLOCK SIZE AND MODE DECISION ALGORITHM FOR INTRA PREDICTION IN H.264/AVC IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報報報 報報報 1

Upload: evangeline-haynes

Post on 20-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

1

FAST BLOCK SIZE AND MODE DECISION ALGORITHM

FOR INTRA PREDICTION IN H.264/AVC

IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012

Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member,

IEEE

報告者:劉冠宇

Page 2: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

2

Outline

Introduction Overview Method

A. Block Size Decision using Inner-Block Variance B. MD-based Intra Mode Skip C. Candidate Prediction Modes Reduction in Intra4x4

Experimental Results Conclusion Q & A

Page 3: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

3

Introduction(1/2)

There are various techniques to improve the performance Spatial Intra Prediction Improved Inter Prediction Motion Estimation Multiple Reference Frame Context-Based Adaptive Binary Arithmetic Coding(CABAC) Rate Distortion Optimization (RDO) Most Probable Mode (MPM) The sum of absolute hadamard transformed differences (SATD)

Page 4: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

4

Introduction(2/2)

RDO technique compares all possible combinations of modes to find the minimum rate distortion cost.

MPM is selected as the best mode which provides effective mode coding by addressing the spatial correlation of neighboring blocks.

SATD is calculated for each of four chroma prediction modes. The mode which has the minimum SATD is set as the best mode.

Page 5: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

5

Overview(1/6)

In H.264 intra prediction, intra4x4 and intra16x16 prediction are provided for luma, and 8x8 prediction is supported for chroma.

There are nine prediction modes for intra 4x4 prediction that use reference pixels which are block boundary pixels such as X, A-L.

Page 6: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

6

Overview(2/6)

Page 7: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

7

Overview(3/6)

Intra16x16 has three directional prediction modes and one DC mode.

Chroma prediction also has four prediction modes similar to intra16x16.

Page 8: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

8

Overview(4/6)

The total combination with RDO computation in the MB will be Chr8x8 (16 Intra4x4 + Intra16x16) = 592 RDO ⅹ ⅹcomputations for every MB and identifies the mode with the minimum value as the best prediction mode. [1]

This full search process requires high computational cost.

[1] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, “Fast mode decision algorithm for intraprediction in H.264/AVC video coding,” IEEETrans.Circuits Syst. Video Technol., vol. 15, no. 7, pp. 813–822, Jul. 2005.

Page 9: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

9

Overview(5/6)

Most Probable Mode (MPM) which exploits the correlations between intra modes of neighboring 4x4 blocks.

If the block A and B are coded as intra 4x4 as depicted, the MPM of the current coding block C is set as the minimum prediction modes of A and B.

Page 10: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

10

Overview(6/6)

Otherwise, MPM is set to DC mode. For each 4x4 block, one flag bit is used for signaling MPM. If the flag is signaled as “1”, the best mode is selected to be MPM. If the flag is “0”, a different mode is set as the best mode.

Therefore, if MPM is selected as the best mode, only one flag bit is used to represent the mode. Otherwise, three additional bits are needed. [2]

[2] I. E. G. Richardson, H.264/MPEG-4 Part 10: Intra Prediction White Paper 2002.

Page 11: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

11

Page 12: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

12

Method A. Block Size Decision using Inner-Block Variance

B. MD-based Intra Mode Skip

C. Candidate Prediction Modes Reduction in Intra4x4

Page 13: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

13

Method.A(1/5)

The intra4x4 prediction which is predicted based on each 4x4 block is well suited for coding parts of significant details(complex). On the other hand, the intra16x16 which is predicted for the whole 16x16 block is well suited for smooth regions.

Page 14: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

14

Method.A(2/5)

In many cases, they found that intra16x16 is selected as the best mode when the border of the MB lies on an edge of the homogeneous region.

We assume that it is well guaranteed whether intra16x16 can be selected as the best block size or not, by observing the homogeneity of the inner-8x8 sub-block.

Page 15: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

15

Method.A(3/5)

The variance of the inner-8x8 sub-block defined is used as the measurement of texture homogeneity.

Where Y (i, j) is the luminance value of the pixel at (i, j).

Page 16: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

16

Method.A(4/5)

A simulation result of the cumulative hit ratio of intra16x16 and intra4x4 corresponding to the variance using different QPs.

Page 17: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

17

Method.A(5/5)

The resulted threshold formula is as follows 456 2

If the sub Var is higher than Th , the examination of intra16x16 block is skipped.

Page 18: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

18

Page 19: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

19

Method A. Block Size Decision using Inner-Block Variance

B. MD-based Intra Mode Skip

C. Candidate Prediction Modes Reduction in Intra4x4

Page 20: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

20

Method.B(1/3)

In intra4x4 prediction, we use the mean deviations (MD) to measure the similarities of boundary pixels. It can be obtained as follows

1

2

Where is the upper block boundary pixel, is the left block boundary pixel, and m is the average of the boundary block pixels.

Page 21: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

21

Method.B(2/3)

The threshold Th , which is depending on QP, is defined as 1

we propose that MPM is the best mode when the

is lower than . RDO selects more intra4x4 blocks with MPM than any other modes. It always needs only one bit for mode coding.

Page 22: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

22

Method.B(3/3)

This algorithm also can be applied to intra16x16. In this case, DC mode is selected as the best mode when the MD of intra16x16 block boundary pixels is lower than the threshold Th , which is defined as

0

Page 23: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

23

Page 24: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

24

Method A. Block Size Decision using Inner-Block Variance

B. MD-based Intra Mode Skip

C. Candidate Prediction Modes Reduction in Intra4x4

Page 25: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

25

Method.C(1/3)

As described above, the MD-based intra mode skip can effectively reduce RDO calculations. But if the MD of block boundary pixels is higher than Th , nine prediction modes will still be calculated using the RDO technique for intra4x4.

If the MD in the upper pixels, which are denoted as pixels A, B, C, and D, is small enough, then prediction mode 0, 3, 4, 5, and 7 will produce similar prediction blocks. The MD of upper pixels is defined as follows

123

where is the upper block boundary pixel.

Page 26: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

26

Method.C(2/3)

Therefore, when the upper pixels are similar to each other, we only search mode 0 instead of adjacent five modes (mode 0, 3, 4, 5, 7). In the same manner, when the left pixels are similar to each other, mode 1 is searched and mode 4, 6, and 8 are skipped..

Page 27: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

27

Method.C(3/3)

If the MD value of upper and left is lower than Th , we skip the adjacent directional modes which produce similar prediction block. Below shows the skipped modes in both cases.

Page 28: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

28

Experimental Results Algorithm in [4]

MPM DC Mode skip

Algorithm in [7]

[4] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, “Fast mode decision algorithm for intraprediction in H.264/AVC video coding,” IEEETrans.Circuits Syst. Video Technol., vol. 15, no. 7, pp. 813–822, Jul. 2005.

[7] A.C. Tsai, A. Paul, J.C. Wang, and J.F. Wang, “Intensity gradient technique for efficient intra-prediction in H.264/AVC,” IEEE Transactions on Circuits and Systems for Video Technology, vol.18, no.5, pp.694-698, May 2008.

Page 29: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

29

Experimental Results

Page 30: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

30

Conclusions This paper presents a fast block size and prediction

mode decision methods for intra prediction. The basic idea which uses the variance of the inner-8x8

sub-block is motivated by the observation. The future work for this research is that effectively

extends our algorithm to the new video coding scheme known as high-efficiency video coding (HEVC).

Page 31: IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇

31

Q & A