e38443a1b957c33306c13ec9e2ba95c1

7/30/2019 e38443a1b957c33306c13ec9e2ba95c1

1/2

Grid adaptive interpolation filters

Chee Sun Won

A general framework for image interpolation in a uniformly spaced

image grid is presented. The proposed formulation is suitable for repre-

senting fractional (or sub-pixel) pixels as well as integer pixel interpo-

lations. Also, by fitting an interpolation kernel to the grid formulation,

finite impulse response filter coefficients can be readily determined for

a given sampling interval and filter length. As an example, a four-tap

lowpass filter is derived by fitting the Lagrange interpolation kernel

to a row (or column) expansion by 2 on integer pixel locations.

Introduction: Image interpolation is a popular technique for various

image processing problems including colour restoration for missing

pixels [1] and motion compensation for video compression [2, 3].

Image interpolations are performed on integer pixel locations as in [ 1]

or on fractional sub-pixels to increase the accuracy of the motion esti-

mation as in [2, 3]. Noting thatfinite impulse response (FIR) filter coef-

ficients can be determined by sampling (or approximating) a desirable

sampling kernel such as the continuous-time sinc function, it would

be useful if we have a formula to adjust a sampling kernel to the

layout and the scale of the image grid. Motivated by this requirement,

this Letter: (i) provides a general form of uniform interpolations for

both integer and sub-pixel locations in terms of the sampling interval

andfilter length, (ii) derives a grid adaptive four-tap filter from the pro-

posed formulation.

General formula for uniform interpolations: As shown in Fig. 1a, the

original image has a uniform grid with KvKh pixels. Here, the ver-

tical and horizontal distances between the two nearest known (exist-

ing) pixels are Tv and Th, respectively. Then, our goal is to insert

and interpolate Nv and Nh pixels within the intervals of Tv and Th,

respectively. So, after the interpolation we will have a total of

[Kv(1 +Nv) Nv] [Kh(1 +Nh) Nh] pixels and the sampling inter-

vals for horizontal and vertical directions will be changed to

Dh = Th/(Nh + 1) and Dv = Tv/(Nv + 1), respectively. For example,

Fig. 1b shows the case of Nv = Nh = 1 with Dv = Tv/2 and

Dh = Th/2.

Tv

Th

Th

h

v

Tv

a b

Fig. 1 Uniformly spaced interpolation

a Before interpolation with known (bold square) pixelsb Interpolation for missing (dotted square) pixels

All necessary interpolation parameters include {Kv,Kh,Nv,Nh,Dv,

Dh, Tv, Th} . Having de

fined all the interpolation parameters, a generalformula for interpolating the missing value y(kvTv + nvDv, khTh +

nhDh) with a 2D FIRfilter can be written as follows

y(kvTv + nvDv, khTh + nhDh) =

Mv

mv=Mv+1

Mh

mh=Mh+1

y((kv + mv)Tv, (kh + mh)Th)

P(mvTv nvDv, mhTh nhDh)

(1)

where 0 kv Kv 1 , 0 kh Kh 1 , 1 nv Nv , 1 nh Nh ,

and the support area of the 2D filter covers 2Mv 2Mh.

Owing to the computational complexity, a separable filter is fre-

quently adopted such that P(mvTv nvDv, mhTh nhDh)=Pv(mvTv

nvDv)Ph(mhTh nhDh) . Then (1) is equivalent to applying a 1D

FIR filter Pv with 2Mv-tap for vertical direction and then a 1D FIRfilter Ph with 2Mh-tap for horizontal direction. In this case, the

expression in (1) can be simplified as a 1D FIR filter for each

image row and column. So, for notational convenience, if we drop

subscript v and h, we then have a 1D FIR interpolation formula

as follows:

y(kT+ nD) =M

m=M+1

y(kT+ mT)P(mT nD) (2)

where n denotes the pixel location to be interpolated within two refer-

ence (known) samples, y(kT) and y(kT+ T). So we need to evaluate

(2) for all n = 1, ,N.

The merit of expression (2) is that it covers both fractional and integer

pixel interpolations. That is, pixels to be interpolated can be located

either in the fractional pixel positions (i.e. D , 1) or in the integer

pixel locations.In H.264/AVC a six-tap filter (i.e. M= 3) is used for the interpolation

of half pixels in motion compensation [2]. This corresponds to the

form of (2) with N= 1 and its symmetric filter coefficients are

P(D) = P(T D) = 20/32 , P(T D) = P(2T D) = 5/32 ,

P(2T D) = P(3T D) = 1/32 . Similarly, the four-tap filter of

the AVS (audio video system) [3] has the filter coefficients as

P(D) = P(T D) = 5/8 and P(T D) = P(2T D) = 1/8 .

Grid adaptive four-tap filter from Lagrange interpolation kernel: From

a mathematical point of view, the Lagrange interpolation is to generalise

the linear interpolation by approximating the sinc function [4]. The

Lagrange interpolation kernel is an Lth-order polynomial function deter-

mined by L + 1 sample values as follows:

P(t+ mT nD) = QmTnD(t)QmTnD(mT nD)

(3)

where QmTnD(t) =

k

(t (kT nD))/(t (mT nD)).

For any sampling grid layout and scale we can calculate the filter coef-

ficients by fitting (3) to the grid. For example, by setting the parameters

in (2) as T= 2, N= 1, M= 2 andD = 1, we have

y(2k+ 1) =2

m=1

y(2k+ 2m)P(2m 1)

= y(2k 2)P(3) +y(2k)P(1) +y(2k+ 2)P(1)

+y(2k+ 4)P(3)

(4)

Here, the filter coefficients P( 3), P( 1), P(1), and P(3) are samples

of (3) when thefi

lter function P is located in the middle of the interpo-lating pixel (i.e. t= 0). This leads us to a four-tap filter by fitting the

Lagrange interpolation filter kernel to ourfilter formulation with T= 2

andD = 1, and we have the coefficients as follows:

P(3) =Q3(0)

Q3(3)=

1

16= P(3), P(1) =

Q1(0)

Q1(1)=

9

16

= P(1) (5)

Experiments: We compared the performance for the linear averaging

filter (denoted as Lin-2 or linear-2), the H.264/AVC six-tap filter [2]

(denoted as H.264-6) with coefficients (1, 5 20, 20, 5, 1)/32, the

AVS four-tap filter [3] (denoted as AVS-4) with coefficients ( 1, 5,

5, 1)/8 and the proposed four-tap filter (denoted as proposed-4) with

coeffi

cients (

1, 9, 9,

1)/16. First, the magnitude responses of thefilters are shown in Fig. 2. For the pass zone performance, linear-2

has the largest attenuation, while AVS-4 has the largest amplification,

both of which may cause excessive smoothing and distortions. On the

other hand, the H.264-6 and the proposed-4 filters have relatively flat

responses in the pass zone. For the transition band characteristics,

H.264-6 and the AVS-4 have the most rapid transitions from the pass

zone to the stop zone and their pass zones extend beyond the normalised

frequency of 0.5 (see Fig. 2). This extended pass zone can cause an

aliasing artefact.

Peak-signal-to-noise ratios (PSNRs) are compared with well-known

test images in Table 1. All test images are expanded as the same

sampling layout and scale of the proposed four-tap filter (i.e. doubling

the number of rows and columns with T= 2, N= 1, M= 2, and

D = 1). As shown in the Table, although the filter length is shorter

than H.264-6, the proposed four-tap filter yields the highest averagePSNRs. This is because our filter coefficients (prop-4) are adaptively

determined such that the sampling layout and scale in the experiments

performed in Table 1 match exactly with those of the Lagrange inter-

polation kernel.

ELECTRONICS LETTERS 31st January 2013 Vol. 49 No. 3

7/30/2019 e38443a1b957c33306c13ec9e2ba95c1

2/2

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.00

0.2

0.4

0.6

0.8

1.0

1.2

1.4

normalised frequency

magnituderesponse

linear-2

H.264/AVC-6

AVS-4

proposed-4

Fig. 2 Comparison of magnitude responses

Table 1: PSNR comparisons (first 12 images are from http://sipi.usc.edu/DATABASE/ and remaining 15 images arefrom http://www.imagecompression.info/)

F il e n ame Li n- 2 H 2 64 -6 AV S- 4 P ro p- 4 F il e n ame L in -2 H 2 64 -6 AVS -4 P ro p- 4

airplane 32.62 33.37 33.26 33.46 big_tree 38.33 39.16 39.01 39.37

akiyal #1 34.15 34.24 34.32 34.62 bridge 35.96 36.65 36.53 36.84

baboon 23.16 22.39 22.45 22.93 cathedral 37.92 38.61 38.5 38.85

barbara 26.19 25.01 25.17 25.85 deer 34.14 33.11 33.27 33.84

f or em an 3 3.2 2 3 3. 25 3 3. 25 3 3. 6 fireworks 36.56 39.79 39.45 38.79

f ru it _m ix ed 3 1. 4 3 1. 41 3 1. 42 3 1. 76 flow er _f ove on 4 6.1 4 7. 48 4 7. 33 4 7. 58

goldhill 31.65 31.11 31.15 31.62 hdr 44.36 45.79 45.63 45.82

h ou se 3 0.4 4 3 0. 53 3 0. 54 3 0. 83 l eave s_ is o_ 20 0 3 2.5 1 3 4.7 1 3 4. 53 3 4. 19

l ena 3 3.3 1 3 3. 72 3 3. 67 3 3. 93 l eaves _i so _1 60 0 3 1.8 9 3 3.4 2 3 3. 33 3 3. 24

ma n 3 1.3 8 3 1. 45 3 1. 44 3 1. 72 n ig ht sh ot _i so _1 00 4 1.5 8 4 4.2 3 4 3.8 9 4 3. 86

peppers 32.6 32.45 32.41 32.79 nihgtshot_iso_1600 37.12 37.05 37.08 37.42

tree 27.56 27.67 27.67 27.97 spider_web 47.26 51.08 50.66 50.57

artficial 36.32 36.92 36.88 37.02 zone_palte 8.36 7.34 7.44 7.91

big_building 35.08 36.82 36.62 36.37 AVERAGE 33.75 34.399 34.33 34.55

Conclusions: We have established a general form of uniform inter-

polation formula in this Letter. The formulation is useful to determine

the filter coefficients adaptively for a given image sampling layout

and scale. To demonstrate the power of our grid adaptive filter,

four-tap FIRfilter coefficients are derived by fitting the Lagrange inter-

polation kernel to the sampling scale of doubling the number of rows

and columns at integer pixel locations. In our experiments the grid adap-

tive four-tap filter yields the highest average PSNR values (even higher

than the six-tap filter).

Acknowledgments: This work was supported by the Basic Science

Research Program through the National Research Foundation of

Korea (NRF) funded by the Ministry of Education, Science and

Technology (20110025770).

The Institution of Engineering and Technology 2013

17 July 2012

doi: 10.1049/el.2012.2481

One or more of the Figures in this Letter are available in colour online.

Chee Sun Won (Department of Electronic and Electrical Engineering,

Dongguk University-Seoul, Seoul, 100-715, Republic of Korea)

E-mail: [email protected]

References

1 Su, C.-Y., Chang, M.-K., and Hong, C.-M.: Optimal integer FIRfilter-ing for colour interpolation, Electron. Lett., 2010, 46, (20),

pp. 1376

13772 Wiegand, T., Sullivan, G.J., Gjontegaard, G., and Luthra, A., Overview

of the H.264/AVC video coding standard, IEEE Trans. Circuits Syst.Video Technol., 2003, 13, pp. 560576

3 Wang, R., Huang, C., Li, J., and Shen, Y., Sub-pixel motion compen-sation interpolation filter in AVS. Proc. of IEEE ICME, Taipei,Taiwan, 2004, pp. 9396

4 Lehmann, T.M., Gonner, C., and Spitzer, K., Survey: interpolationmethods in medical image processing, IEEE Trans. Med. Imag., 1999,18, (11), 1999, pp. 10491075

ELECTRONICS LETTERS 31st January 2013 Vol. 49 No. 3

e38443a1b957c33306c13ec9e2ba95c1

Documents