Professional Documents
Culture Documents
R.KANAGAVALLI
ECE-3rd yr
when (7)
where is the MSE of any element in the vector .
By substituting(5)into(6)
TheabovederivationshowsthattheMSEmeasuredi
nthe pixel domain is the weighted sum of the
MSE of subbands in the waveletdomain. Note
thatthe weightingfactors aredetermined by the
lters. The derivations from (4)(8) for the onelevel decomposition case can be used to derive
the MSE of a multilevel decomposition case, as
described in [17].
B. Temporal Subband Weighting
thleveloftemporal decomposition,weusethe
subscript (starting from 0) to predict and update
matrices and respectively. The error at level 2
that causes the error at level 1 can be derived by
extending the analysis of the following subbands
(see Fig. 2)
From (19)(21), the quantization errors of
subbands , , and can be derived, respectively, as
follows:
Followingthesamederivationsasthosein(14)and(1
5),wecan obtain: Substituting (22) and (24) into
(23) for and , and applying the derivations in
(14) and (15), we now have Here, we omit the
derivations of the other subbands at level 1
because they can be obtained by a similar
derivation to (25) or (26). We should also point
out that the errors at level 2 only affect the lowfrequency subbands at level 1. To sum up, the
matrix that relates errors between levels note
that the rows in corresponding to high-frequency
subbandsatlevel1(e.g., )arezero.Thereconstructed
MSE of the frames (at level 0) obtained by
quantizing the subbands at level 2 can be derived
by substituting (27)
Accordingly,thesummationofthevaluesinacolum
nof used to measure the error caused by
quantizing a subband becomes the reconstructed
MSE of the GOP. As in the one-level temporal
decomposition case, we call the summation the
temporal weighting factor of the subband and
denote it as . The temporal weighting factors of
the subbands at level 2 are calculated as
follows:The above derivation can also be used to
generate the temporal error propagation model
for temporal decomposition of more than two
levels; thus, we omit the derivation of that case
here. We have analyzed the error propagation
models of spatial wavelet subbands and
temporal wavelet subbands individually. Next,
we combine the analyses to construct the error
propagation model of spatial-temporal subbands.
distortionfunctionofeachsubbandcan be derived
independently. Therefore, the optimal bitallocation of a GOP can be formulated as follow
where denotes the set of spatial-temporal
subbands of the GOP, is the rate of motion
vectors, and is the spatial-temporal weighting of
subband , whose value is calculated according to
(32). The formulation implies that we should
modify the encoding phase of 2-D , shown in the
top subgure of Fig. 3, to incorporate the
weighting factors. This can bedone
bymultiplyingthe spatialweightingfactors bythe
spatial wavelet subbands, followed by
multiplying the temporal weighting matrix by
the result of the MCTF process, as shown in the
bottom subgure of Fig. 3. Althoughthe solution
ofthe aboveoptimizationproblem can be obtained
by using a Lagrangian optimization approach
[15] or a dynamic programming algorithm [11],
we propose a sim Fig. 3. Top: Bit-allocation of a
2-D structure without weighting. Bottom:
Proposed bit-allocation method, where spatial
and temporal weighting are applied to a 2-D
structure.
Fig. 4. Top: Schematic lifting structure of the 5-3
temporal lter. Bottom: Lifting structure of the
9-7 temporal lter. Conceptually, the lifting
structure of the 9-7 lter can be regarded as the
concatenation of two 5-3 lifting structures, but
with different coefcients on the prediction and
update matrices.
pler, more efcient algorithm to derive a suboptimal solution.
Conceptually,ouralgorithmisbasedonthepremiset
hatthe the magnitudes of the slopes of the
optimal rate-distortion function
indicatetheamountofdistortionreduction.Therefor
e,thealgorithm
assignsbitstoencodesubbandsaccording
toeachslopes magnitude. First, we compute the
rate-distortion function of
eachsubband.Then,wecalculatetheslopesofallrate
-distortion functions, and arrange the slopes in
EZBC[6]toencodeindividualspatialsubbandsthatr
esultfrom the temporal MCTF process. We
demonstrate and compare the results on three
different parameter settings. Scheme 1 does not
use spatial weighting or temporal weighting in
the wavelet
domain.Scheme2usesthespatialandtemporalweig
htingfactors, which are derived without
considering the pixel connectivity in the MCTF
process. Scheme 3 applies the proposed method
in the wavelet domain.
Figs.5and6show,respectively,theabsoluteMSEdif
ference between the reconstruction error and the
quantization error in the wavelet domain by
using the 5-3 and 9-7 temporal lters versus the
bit rate for various CIF sequences. We observe
that both Scheme 2 and Scheme 3 reduce the
absolute MSE differencemore effectivelythan
Scheme 1, which does notapply any
weightingfactors.Withregardtotheweightingfacto
rs,Scheme 2 reduces the absolute value of the
MSE difference; however, Scheme 3, which
incorporates the pixel connectivity into the
weightingfactors,reducestheabsolutevalueofthe
MSEdifferencemoreeffectivelythanScheme2athi
ghbitrates.Scheme3
yieldsasubstantialabsoluteMSEreductioninallbitr
atescompared to Scheme 2 when the 9-7
temporal lter is used. We also compare the
coding performances of the three schemes for
bit-allocation in a 2-D structure. Figs. 7 and 8
illustrate the PSNR coding gain of Scheme 2 and
Scheme 3 over that of Scheme 1. The average
coding performance of each scheme is measured
on the CIF resolution of a GOP with different
spatial and temporal lter combinations. As
shown in Fig. 7, for the 5-3 temporal lter, the
coding gains of Scheme 2
andScheme3arealmostthesameforallbitrates.Ifwe
average the coding gains over all the bit rates,
both schemes achieve
CHENG et al.: SUBBAND WEIGHTING
WITH PIXEL CONNECTIVITY FOR 3-D
WAVELET CODING 59
R.Ohm,Transitionlteringandoptimized
quantizationininterframewaveletvideocoding,Pr
oc.SPIEVis.Communi. Image Process., vol.
5150, pp. 682694, 2003. [14] A. Secker and D.
Taubman, Highly scalable video compression
with scalable motion coding, IEEE Trans.
Image Process., vol. 13, no. 8, pp. 10291041,
Aug. 2004. [15] G. J. Sullivan and T. Wiegand,
Rate-distortion optimization for video
compression, IEEE Signal Process. Mag., vol.
15, no. 6, pp. 7490, Nov. 1998. [16] D. S.
Taubman and M. W. Marcellin, JPEG2000.
Norwell, MA: Kluwer, 2002. [17] B. Usevitch,
Optimal bit allocation for biorthogonal wavelet
coding, in Proc. Data Compression Conf.,
1996, pp. 387395. [18] B. Usevitch, A tutorial
on modern lossy wavelet image compression:
Foundations of JPEG 2000, IEEE Signal