perceptually optimized coding of color images for band‐limited information networks

Upload: sep-publisher

Post on 02-Apr-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    1/14

    InternationalJournalofCommunications(IJC)Volume2Issue2,June2013 www.seipub.org/ijc

    49

    PerceptuallyOptimizedCodingofColor

    ImagesforBandLimitedInformation

    Networks

    EvgenyGershikov

    DepartmentofElectricalEngineering,OrtBraudeAcademicCollegeofEngineering,Karmiel,Israel

    andDepartmentofElectricalEngineering,TechnionIIT,Haifa,Israel

    [email protected]

    Abstract

    The Mean Square Error (MSE) or the Peak Signal to Noise

    Ratio (PSNR) are common distortion measures used toassess imagequality. Nevertheless, theyare usually chosen

    duetotheirsimplicityandnottheirperformanceastheyare

    notalwayssuitablecomparedtothehumanobserver.Inthis

    workwepresentaRateDistortionapproach tocolor image

    compressionbasedonsubbandtransformsusingperceptual

    optimization of the compression quality. This approach is

    basedonminimizationof theWeightedMeanSquareError

    (WMSE)oftheencoded image,whichbettercorrespondsto

    thequalityassessmentofthehumaneye.TheWMSEcanbe

    measuredintheYCbCrcolorspace,forwhichvisualweights

    are relatively easily derived. Based on the new approach,

    new

    optimized

    compression

    algorithms

    are

    introduced

    using the Discrete Cosine Transform and the Discrete

    Wavelet Transform. We compare the new algorithms to

    presentlyavailablealgorithmssuchasJPEGandJPEG2000.

    Our conclusion is that the new WMSE optimization

    approach outperforms presently available compression

    systemswhenahumanobserver isconsidered.

    Keywords

    ColorImageCompression; WeightedMeanSquareError;Discrete

    CosineTransform;DiscreteWaveletTransform;PerceptualRate

    DistortionModel;OptimalColorComponentTransform;Optimal

    Rate

    Allocation

    Introduction

    Many color image coding algorithms are based on

    subband transforms for thecompressionprocess.The

    complexity of such algorithms varies from systems

    based on elementaryblock transforms like the DCT

    (DiscreteCosineTransform)[14]used,forexample,in

    JPEG [21] to more complicated algorithmsbased on

    the Lapped Biorthogonal Transform (LBT), the

    Discrete Wavelet Transform (DWT), wavelet packets

    and filterbanks, such as EZW (Embedded ZerotreeWavelet)[19], JPEG2000 [13][15], JPEG XR [2][16] or

    uniform DFT filterbanks [18]. Still it is not always

    clear that the added complexity also improves the

    compression results. The recently introduced Rate

    Distortion (RD)model forsubband transformcoders

    [5] can be used in such applications to assess the

    performance of the compression algorithm. This RD

    model, however, approximates the MSE distortion of

    the compression results, which is not always well

    correlatedwithsubjectiveimagequalityasseenbythe

    humaneye.Morecomplicateddistortionmeasurescan

    be proposed, such as calculating the MSE distortion

    between two imagesafteran intensity transformation

    and filtering for grayscale images [11] or a similar

    process using a nonlinear transformation of the

    primarycolorcomponents,followedbyfiltering,fora

    colorimage[1].Abasicmeasurethatissimilartothe

    MSE,but can incorporate perceptual weights is the

    Weighted MSE (WMSE). This measure assigns a

    different weight to the MSE of each subband of the

    image, thus simulating the varying sensitivity of the

    Human Visual System (HVS) to different horizontal

    and vertical spatial frequencies. As a more realistic

    tool,itcanimprovetheassessmentofthemodel.

    ThegoalofthisworkistodevelopaperceptualRate

    Distortion (RD) model of subband transform coders

    based on the WMSE as the distortion measure. We

    demonstrate the efficiency of the new model for

    subbandtransformcodingbypresentinganewtypeof

    compression algorithms based on perceptual

    optimization of the preprocessing stage and of the

    subbandratesallocation.

    ObjectiveRateDistortionTheoryofSubband

    TransformCoders

    Considerageneralsubbandtransformcoderforcolor

    images. Typically, the image samples are first pre

    processed,

    then

    subband

    transformed

    and

    quantized

    and finally postprocessed losslessly. A detailed

    descriptionofthesestagesisgivenbelow.

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    2/14

    www.seipub.org/ijc International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

    50

    1) PreprocessingHereaCCT(ColorComponentsTransform)isapplied

    totheRGBcolorcomponentsoftheimage.Wedenote

    the RGB components in vector form as

    and thenewcolorcomponentsas .The

    sizeCCTmatrixisdenotedasM.Thisstagecanbewrittenas:

    (1)

    ThegoalofusingtheCCTtransformisusuallytode

    correlate the highly correlated RGB components

    [7][10][15][23].TheCCTtransformisoftenfollowedby

    level shifting as for example is the case inJPEG2000

    [13] so that the sample range of values becomes

    symmetricaroundzero.

    2) SubbandTransformingandQuantizingAsubbandtransform,suchastheDCTortheDWTis

    applied to each color component. The subband

    coefficients of each color component are then

    quantized. An independent uniform scalar quantizer

    foreachsubbandisused.

    3) PostprocessingThequantizedcoefficientsareencodedlosslessly.The

    goal is to reduce the number ofbits required for the

    coefficients without loss of information. Techniques

    such

    as

    runlength

    encoding,

    zero

    trees,

    delta

    modulation and entropy coding are used here. This

    stagehastobeadaptedtothesubbandtransformused.

    Toderive theRDbehaviorof thealgorithm, first the

    RD of a scalar uniform quantizer needs to be

    considered. Assuming that a random signal with

    variance isquantizedbysuchaquantizer,itsRate

    Distortionbehaviorhasbeenapproximatedby[6][20]:

    (2)

    where R is the rate and is a constant dependent

    upon thedistributionof .Thenbasedon (2) theRD

    model of a general monochromatic subband coder

    with subbandscanbeexpressedas:

    (3)

    Here istheMSEofsubband ,

    is its variance, is its energy gain [20] and is the

    rate allocated to it. Also is its sample rate, i.e., the

    relativepartofthenumberofcoefficientsinitfromthe

    total number of samples in the signal. is a constant

    equalto .

    Consider now a color image. The coding algorithm

    described in the beginning of this section may be

    regardedasapplyingaCCTtotheimage,followedby

    monochromaticly subband coding each of the new

    color components. The RateDistortion model of this

    algorithmis

    [2]:

    (4)

    where and have the same meaning asbefore,

    but for subband of color component ( ).

    Optimalratesallocationforthesubbandscanbefound

    by minimizing the expression of Equation (4) under

    therateconstraint:

    (5)

    for some total image rate . Here downsampling

    factors havebeen used. denotes how much the

    number of samples of color component has been

    reducedbydownsampling.Forexample,ifthedown

    samplingisbyafactorof2horizontallyandvertically

    then:

    The optimal rates under the rate constraint of (5) as

    wellasnonnegativityconstraintsare:

    3 1

    3

    1

    2 2 1

    1 23

    1

    ( )

    1,

    ( )k k

    j jj

    bi

    j jj

    T

    i b biii

    i

    T Act

    k kkk

    kk

    RR

    G

    lna

    GM

    MM

    MM

    (6)

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    3/14

    InternationalJournalofCommunications(IJC)Volume2Issue2,June2013 www.seipub.org/ijc

    51

    Wherei

    b Act with iAct denoting the

    setofnonzero(oractive)ratesinthecolorcomponent

    ,i.e.,

    (7)

    Also

    (8)

    The structure of this work is as follows. In the next

    section the perceptual RateDistortion model is

    introduced. Section Perceptually Optimized

    Compression presents new color compression

    algorithms optimized according to this modelbased

    on the DCT subband transform and on the DWT.

    Simulationsofthenewalgorithmsandcomparisonto

    presently available algorithms are provided in this

    section.

    Our

    conclusions

    and

    summary

    are

    given

    in

    thefinalsection.

    The Perceptual R-D Model

    We assume here that we are given the visual

    perception weightscorresponding to thesubbandsof

    a certain subband transform (SBT) in a color space.

    Sucha space canbe, for example,YCbCr aswe have

    chosen in this work. We now wish to derive an

    expression for the WMSEdistortion of a coderbased

    on thesubband transform.Thesame coderdescribed

    in Subsection Objective RateDistortion theory ofsubbandtransformcodersisassumed,sothataCCT

    isapplied to the RGBcolorcomponentsof the image

    priortocodingandtheactualimagedatacompression

    isperformed inanothercolorspacedenotedC1C2C3.

    We denote as the vector of theSBT

    coefficientsat some index in subband in the YCbCr

    color space. Similarly, the vector of subband

    coefficients in the C1C2C3 color space is denoted

    .DuetothelinearityoftheSBT,the

    followingrelationshipholds:

    (9)

    where stands for the CCT matrix from the YCbCr

    color space to the C1C2C3 space. If is the CCT

    matrixfromRGBtoC1C2C3and istheRGBto

    YCbCrmatrix,then:

    (10)

    SincetheSBTcoefficientsarelossyencoded,errorsare

    introducedbetweenthereconstructedcoefficients

    in the YCbCr color space and the original ones. The

    error

    covariance

    matrix

    for

    the

    subband

    in

    the

    YCbCrdomainis:

    (11)

    where denotes statistic mean. Similarly in the

    C1C2C3domain:

    (12)

    andusing(9)wecanexpress by as:

    (13)

    The MSE distortions of the YCbCr color

    componentsinsubband arethediagonalelementsof

    andthus:

    (14)

    where is the row of in column form. In a

    similar fashion, the diagonal elements of canbe

    recognized as the MSE distortions of the , ,

    colorcomponents,givenby (2)andslightlyrewritten

    tobecome:

    (15)

    where andbi

    R and , denote the

    rateandvarianceofsubband ofcolorcomponent ,

    respectively. Note that we continue here with the

    consistent notation of a tilde for the variables related

    to the C1C2C3 color space. Assuming that the

    quantization errors of the three color components in

    eachsubbandintheC1C2C3domainareuncorrelated,

    becomesadiagonalmatrixand(14)becomes

    (16)

    once(15)issubstitutedfor .Nowif,forthesakeof

    convenience,wedenote theYCbCrcolorcomponents

    at each pixel as a vector , then the

    WMSEofthe colorcomponent is:

    (17)

    Ascanbeseen,thisexpressionincorporatestheenergy

    gainsofthesubbands aswellastheirsamplerates

    .

    Also

    the

    visual

    weights

    are

    part

    of

    the

    expression,providingvaryingsignificancetodifferent

    subbands of the same color component as well as

    between color components. Defining the total WMSE

    astheaverageWMSEoftheYCbCrcolorcomponents,

    weget:

    3

    1

    3 1

    1 0

    1( )

    3

    1

    3

    i

    i

    B

    b b bi b i

    i b

    WMSE WMSE x

    G w d

    YCbCr

    (18)

    and after substituting (16) for the expression

    becomes:

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    4/14

    www.seipub.org/ijc International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

    52

    21 23 1 3

    2

    1 0 1

    2121 3 3 _

    2

    0 1 1

    _

    1

    3

    1.

    3

    bk

    bk

    BaR

    bkb b bi k i b k ik

    BaR

    bkb b k bib k i

    ik

    WMSE G w e

    G e w

    M

    M

    (19)

    Tosimplify(19)wedenote:

    (20)

    sothattheWMSEexpressionbecomes:

    (21)

    Clearly,ifthevisualweights areallequalto1,the

    WMSE expression of (21) should become the

    expression for the MSE in the YCbCr domain. This

    expression is given exactlyby (4) with the differencethat there is tobe replacedby in our case. From

    thecomparisonofequations(21)and(4)weconclude

    that in that case, which means

    accordingto(20)that

    (22)

    Astraightforwardcheckprovesthatthisisindeedthe

    case.

    Decorrelation

    of

    the

    Quantization

    Errors

    In the derivation of the WMSE expression of (21) we

    haveassumedthatthequantizationerrorsofthe , ,

    colorcomponentsareuncorrelatedineachsubband.

    It is of interest to note that the assumption in the

    derivationoftheMSEexpressionof(4)wasthelackof

    correlation of the quantization errors in the image

    domain [2], i.e. that and have zero

    correlation for , . Note that

    denotes here the reconstructed component. The

    questionthat

    rises

    is

    whether

    the

    assumption

    of

    zero

    correlation in each subband means also zero

    correlationintheimagedomain.

    Using the vector space interpretation of subband

    transforms[20],wecanwriteforthecolorcomponent

    :

    (23)

    here is the color component in vector form after

    lexicographic ordering. are the SBT synthesis

    vectors. Also the sum on is on all the coefficient

    indices in the subband, denotes the subband

    coefficient of at index and is the same

    coefficientafterquantizationandreconstruction.Now

    consider the expression for

    , where is the number of the image pixels.

    UsingEquation(23),itcanbewrittenas:

    (24)

    where . Assuming zero

    correlation of the quantization errors of the different

    color components in each subband and between

    subbands means that , hence

    immediately follows

    according to (24). But this means exactly zerocorrelationofthequantizationerrorsinimagedomain.

    ThusthederivationoftheWMSEexpressionof(21)is

    once again consistent with the derivation of the MSE

    expressionof(4).

    BasicOptimizationUsingtheWMSEModel

    AfterderivingtheWMSEexpression,thenaturalnext

    step is touse it to find theoptimalratesandoptimal

    CCT in the WMSE sense. First we wish to minimize

    the WMSE of (21), subject to the rate constraint

    , resulting in the followingLagrangian( istheLagrangemultiplier):

    (25)

    whichisminimizedbytheoptimalratesgivenby:

    (26)

    Here

    (27)

    Notethatnoconstraintsfornonnegativityoftherates

    are used here, which means that high rates are

    assumed.As for theoptimal CCTmatrix : it canbe

    foundbyminimizingthetargetfunction ,that is

    actuallythedenominatorofthe in(26)aftersome

    straightforwardmanipulations:

    (28)

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    5/14

    InternationalJournalofCommunications(IJC)Volume2Issue2,June2013 www.seipub.org/ijc

    53

    Weshouldremindhere that is a functionof as

    given in (20). Also the variances depend on , or

    more specifically on . These variances are the

    diagonalelementsofthesubband covariancematrix

    intheC1C2C3imagedomain:

    (29)

    andcanalsobeexpressedusingthe matrixandthe

    subband b covariance matrix in the RGB image

    domain:

    ,T

    Y YE Y Y

    b b

    RGB RGB RGB RGB

    b b b

    Y E Y bRGB RGB

    b

    (30)

    accordingto:

    (31)

    Here is defined similarly to the

    definitions in thebeginning of the section of . Also

    denotesthe rowofthe matrixinvectorform.

    Thusthetargetfunction canberewrittenas:

    3 1_

    1 0

    213 _

    1

    ( ) ( ) ,

    .

    bB

    T

    b bk

    k b

    bk bi

    i ik

    f m m G

    w

    k b kM

    M

    (32)

    OptimalRateswithDownSampling

    When considering potential downsampling of some

    ofthecolorcomponents,therateconstraintbecomes(5)

    and the Lagrangian that incorporates this constraint,

    as well as constraints for the nonnegativity of the

    subbandrates,is:

    21 3

    2

    0 1

    3 1 3 1

    1 0 1 0

    1{ }, , ,{ }

    3

    ,

    bk

    BaR

    bkbi bi b b k bk

    b k

    B B

    i b bi bi bii b i b

    L R G e

    R R R

    M

    (33)

    where are the Lagrange multipliers for the new

    constraints.Theratesthatminimize(33)are:

    3

    1

    3

    1

    2 2

    2

    3

    1

    1,

    k k

    j jj

    bi

    j jj

    i b bi bk

    i

    Act Act

    k k k

    kk

    RR

    G

    lna

    GM

    (34)

    wherei

    b Act with iAct defined in (7)while

    andi

    areasin(8).Asfor ,itisgivenby:

    (35)

    Perceptually Optimized Compression

    Inthissectionwepresentageneralapproachtocolor

    image compression using a subband transform with

    perceptual optimization of the CCT and of the

    subbandratesallocation.Thisapproachconsistsofthe

    stagesdescribedinthebeginningofSectionObjective

    RateDistortion theoryofsubband transform coders.

    Thedifferenceshereisthatinthepreprocessingstage,

    theperceptuallyoptimalCCT transform isapplied to

    thecolorcomponentsandinthequantizationstagethe

    perceptually optimal rates allocation is used. Wedemonstrate this approachboth for the DCT and the

    DWT in the following subsections. Note that a

    probabilitydistributionmodelcanbeusedfortheSBT

    coefficients to improve the performance of the

    algorithms withrespect toruntime and compression

    quality [4]. For example, the Laplacian probability

    modelcanbeassumedforDCTcoefficients[9].

    DCTBasedCompressionAlgorithm

    Since the DCT is a subband transform, the Rate

    Distortion theory of Section The Perceptual RDModel canbe applied to it. To find the DCT visual

    weights we use the HVS CSF (Contrast Sensitivity

    Function)curvesfortheYCbCrcolorspacethatcanbe

    found, forexample, in [20].Toconvert thecpd (cycle

    perdegree)unitsof thesegraphs tospatialfrequency

    unitsfortheDCT, theequationsproposed in [22]can

    be used. We consider, for example,

    512 512 images displayed as

    onadisplaywithdotpitchof0.25mm.

    Theviewingdistance isassumed tobe four times the

    imageheight[12],i.e.,inthisexample50cm.Similarlywe can consider images displayed as

    onabigscreenataviewingdistance

    of100cm.

    Thestagesoftheproposedalgorithmareasfollows:

    1. FindtheoptimalCCT byminimizing(32).2. Apply the CCT to the RGB colorcomponentsoftheimagetoreceivethenewcolor

    components , , .

    3. Apply theDCTblock transform toeachcolorcomponent , .

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    6/14

    www.seipub.org/ijc International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

    54

    4. Calculate the optimal rates according to (34)substituting there the used CCT matrix and the

    variancesoftheDCTsubbands.Tofindtheactive

    subbands, the algorithm presented in [5] canbe

    used.

    5. QuantizetheDCTcoefficientsusingauniformscalar quantizer in each subband. The (optimal)

    quantization steps are found using an iterative

    algorithm[5].

    6. Use postquantization coding similar to theone used inJPEG. Adaptive Huffman coding is

    employed and the codes are sent with the image

    data.Thisstage is losslessanddoesnotaffectthe

    imagedistortion.

    It is of interest to compare the performance of this

    algorithm to the other DCTbased compressionalgorithms, such as the MSE optimized algorithm

    proposed in [5] andJPEG. A comparison for several

    imagesisgiveninTable1.Weconsiderheretheabove

    algorithm with WMSE RD optimization of the rates

    allocation and CCT, as well as another version that

    usesoptimalrates,but in theYCbCrcolorspace.The

    PSPNRmeasureusedhereis:

    (36)

    where for each color component is calculated

    in the DWT domain in the YCbCr color space

    accordingtothevisualweightssuggestedin[20].Then

    theaveragePSPNRonthe3colorcomponentsistaken.

    Based on our experience and results, this is a good

    measureofsubjectiveimagequality.

    Note that each image in the table wascompressedat

    the same compression rate (given in the last column

    fromtheleft)foreachofthealgorithms,buttherateis

    not the same for different images. The reason is that

    thethe

    rate

    was

    chosen

    to

    achieve

    the

    same

    objective

    performance (PSNR) for the first algorithm (from the

    left) to allow meaningful comparison of average

    algorithmperformances.SeealsoTable2below.

    TABLE1Perceptuallybasedresults(PSPNR)for(fromlefttoright):TheDCTbasedWMSEoptimizedalgorithmintheYCbCrdomain;Thesame

    algorithmwithoptimalCCT;TheMSEoptimizedalgorithm;JPEG.Thecompressionrateforeachimageisshownintherightcolumn.

    Image WMSEAlg.inthe

    YCbCrdomain

    WMSEAlg.inthe

    optimaldomain

    MSEAlg. JPEG Rate[bpp]

    Lena 39.4 40.6 38.9 37.6 0.76

    Peppers 39.6 39.6 38.1 36.6 0.81

    Baboon 42.0 42.5 39.2 36.1 1.76

    Cat 41.3 43.1 41.3 39.9 1.30

    Landscape 42.5 42.5 40.5 38.0 1.85

    House 39.8 40.3 39.2 38.1 0.54

    JellyBeans 38.5 38.6 38.3 37.5 0.47

    Fruits 41.0 42.3 40.4 38.9 0.71

    Sails 41.0 42.9 39.7 37.6 1.84

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    7/14

    InternationalJournalofCommunications(IJC)Volume2Issue2,June2013 www.seipub.org/ijc

    55

    Monarch 39.8 40.2 38.7 37.5 1.03

    Goldhill 42.9 43.4 41.9 40.6 2.17

    Mean 40.7 41.5 39.7 38.0

    TABLE2SameasTable1,butforPSNRinsteadofPSNR.NotethatoptimizationofPSPNR,asinducedbythehumanobserver,doesnot

    necessarilymeanoptimizationofthearbitrarilyusedPSNR(seetext).

    PSNR Image WMSEAlg.inthe

    YCbCrdomain

    WMSEAlg.inthe

    optimaldomain

    MSEAlg. JPEG Rate[bpp]

    Lena 30.0 30.5 30.7 29.7 0.76

    Peppers 30.0 30.1 30.5 29.3 0.81

    Baboon 30.0 29.0 30.5 26.5 1.76

    Cat 30.0 29.6 31.3 29.5 1.30

    Landscape 30.0 30.1 30.3 25.9 1.85

    House 30.0 30.2 30.3 29.5 0.54

    JellyBeans 30.0 30.3 30.6 29.7 0.47

    Fruits 30.0 29.8 30.6 30.6 0.71

    Sails 30.0 29.7 30.6 28.9 1.84

    Monarch 30.0 29.6 30.6 29.4 1.03

    Goldhill 30.0 30.2 31.7 29.2 2.17

    Mean 30.0 29.9 30.7 28.9

    It can be concluded from the table that the WMSE

    optimized algorithm with the optimal CCT achieves

    thehighestPSPNR,which is1.8dBhigheronaverage

    thanthePSPNRoftheMSEoptimizedalgorithmand

    3.5dB above JPEG. The use of the optimal CCT in

    WMSEsenseincreasestheperformancebyalmost1dB

    (0.8dB) on average when perceptually optimal rates

    areemployed.Anothercomparisonofinterestisofthe

    standardorobjectivedistortionsofthealgorithm,i.e.,

    the PSNR as presented in Table 2. As expected, the

    MSEoptimizedalgorithmissuperiorhere,butwhatis

    perhaps less intuitive is the fact that the use of the

    optimal CCT slightly decreases the PSNR, indicating

    thatPSNRandPSPNRaredifferentmeasures.Ascan

    beseenintheexamplesofFigs.1 2below,thehuman

    observerjudgement is closely related to the PSPNR,

    not the PSNR. Despite this both WMSE algorithms

    have MSE performance superior toJPEG with a gain

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    8/14

    www.seipub.org/ijc International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

    56

    of1.1dB in thePSNRwithoutusing theoptimalCCT

    and slightly less (1dB) with the optimal CCT. We

    conclude this section by presenting a visual

    comparisonof thealgorithmsascanbeseen inFig.1

    fortheLenaimageandinFig.2fortheBaboonimage.

    Itcanbeseen that theWMSEalgorithmprovides the

    bestresultsforbothimagesvisually,whiletheresults

    of theMSEalgorithmareslightly lesspleasing to the

    eye.YetbothalgorithmsaresuperiortoJPEG.

    FIG.1COMPRESSIONRESULTSFORLENAAT0.72BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBYTHEWMSE

    OPTIMIZEDALGORITHM(TOPRIGHT,PSPNR=40.4DB);IMAGECOMPRESSEDBYJPEG(BOTTOMLEFT,PSPNR=37.7DB);IMAGE

    COMPRESSEDBYTHEMSEOPTIMIZEDALGORITHM(BOTTOMRIGHT,PSPNR=39.3DB).ASEXPECTED,THEWMSEALGORITHM

    OUTPERFORMSTHEOTHERMETHODS,ESPECIALLYINTHEMARKEDAREAS.

    FIG.2COMPRESSIONRESULTSFORTHEBABOON(ZOOMEDIN)AT0.88BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBY

    THEWMSE

    OPTIMIZED

    ALGORITHM

    (TOP

    RIGHT,

    PSPNR=36.9DB);

    IMAGE

    COMPRESSED

    BY

    JPEG

    (BOTTOM

    LEFT,

    PSPNR=33.6DB);

    IMAGECOMPRESSEDBYTHEMSEOPTIMIZEDALGORITHM(BOTTOMRIGHT,PSPNR=35.6DB).HEREAGAIN,THEWMSE

    ALGORITHMOUTPERFORMSTHEOTHERMETHODS.

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    9/14

    InternationalJournalofCommunications(IJC)Volume2Issue2,June2013 www.seipub.org/ijc

    57

    DWTBasedCompressionAlgorithm

    When the DWT is considered, there are quite a few

    options for the wavelet filterbank tobe used for the

    decomposition. We have chosen the Daubechies 9/7

    filter

    bank,

    but

    obviously

    other

    choices

    can

    be

    consideredaswell.Notiling[13]isused.Thechoiceof

    the visual weights is according to [20]. The stages of

    theproposedalgorithmare:

    1. FindtheoptimalCCT byminimizing(32).2. Apply the CCT to the RGB colorcomponentsoftheimagetoreceivethenewcolor

    components , , .

    3. ApplytheDWTtreedecompositionuptotherequired depth of the tree (3, 4, 5 or higher

    according to imagesize) toeachcolorcomponent, .

    4. Calculate the optimal rates according to (34)substituting there the used CCT matrix and the

    variances, sample rates and energy gains of the

    DWT subbands. The determination of the active

    subbands is the same as for the DCTbased

    algorithmoftheprevioussubsection.

    5. Quantize the DWT coefficientsby a uniformquantizer with a central deadzone in each

    subbandsimilartotheoneusedinJPEG2000PartI

    [8].Useoptimalquantizationsteps.

    6. Use thepostquantizationcodingof the EZWalgorithm[19]onthequantizedsubbandcoefficients.Thisstage is losslessand includesbit

    plane coding using zero trees. The bit plane

    coding is split into two passes (dominant and

    subordinate) and a separate arithmetic coder is

    employedforeachpass.

    Itis interestingtocomparetheproposedalgorithmto

    JPEG2000. We have considered the JPEG2000

    implementationusingtheJasPersoftwarepackage[24]

    andanotherversionofthe implementationwithfixed

    visual

    weighting

    at

    subband

    level

    using

    the

    CSF

    weightsof [20].Thevisualresults for theLena image

    canbeseeninFig.3.ThePSNRresultshereare29.5dB

    for theproposedWMSEoptimizedalgorithm,28.6dB

    for JPEG2000 (original JasPer implementation) and

    28.5dB forJPEG2000 with CSF weights. We conclude

    that the use of CSF weights, that affects the tier2

    codingstageoftheJPEG2000algorithm,decreasesthe

    PSNR,butslightlyimprovesthevisualperformance.

    FIG.3COMPRESSIONRESULTSFORLENAAT0.52BPP.ORIGINALIMAGE(TOPLEFT);IMAGECOMPRESSEDBYTHEDWTBASED

    WMSEOPTIMIZED

    ALGORITHM

    (TOP

    RIGHT,

    PSPNR=19.7DB);

    IMAGE

    COMPRESSED

    BY

    JPEG2000

    (BOTTOM

    LEFT,

    PSPNR=19.1DB);

    IMAGECOMPRESSEDBYJPEG2000WITHCSFWEIGHTS(BOTTOMRIGHT,PSPNR=19.2DB).

    ALSOHERE,THEWMSEALGORITHMISSUPERIORTOTHEREST.

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    10/14

    www.seipub.org/ijc International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

    58

    Also the proposed algorithm produces an image that

    ismuchmorepleasingtotheeyethanJPEG2000.

    Similar results canbe seen in the comparison of the

    proposed algorithm andJPEG2000 for other example

    images inFigs.4,5and6.InFig.4the lossofspatial

    details at high frequencies in the Landscape andHouse images as well as the introduction of false

    contoursintheJellyBeans imageshouldbenotedfor

    JPEG2000. These effects can be seen in the marked

    regions. In Fig. 5 once again the superiority of the

    WMSEoptimizedalgorithmonJPEG2000canbeseen

    in regions of high frequency details as demonstrated

    bytheFruitsandCatimages.Forexample,thedetails

    oftheappletextureintheFruitsimageandofthefur

    andmustachetexturesintheCatimagearelost.Inthe

    case of the Peppers image, the compression result of

    JPEG2000 is less pleasing to the eye due to the color

    artifacts introduced. Fig. 6 further demonstrates the

    loss of spatial details in the case of JPEG2000

    compression

    of

    the

    Sails

    image,

    theblurring

    of

    the

    contoursintheMonarchimageandbotheffectsinthe

    Goldhill image (see the top marked area for the

    blurred contour effect and, for example, thebottom

    left marked area for the loss of spatial details).

    Furthermore, color artifacts are introduced by

    JPEG2000 in the Goldhill image as indicated, for

    instance,inthemarkedareainthecenteroftheimage.

    FIG.4LANDSCAPE,HOUSEANDJELLYBEANSIMAGES FROMLEFTTORIGHT:ORIGINAL,COMPRESSEDBYTHEWMSE

    ALGORITHM(WMSEALG.)ANDCOMPRESSEDBYJPEG2000.

    PSPNRFORTHELANDSCAPEIMAGE:17.1DB(WMSEALG.)AND15.7DB(JPEG2000).

    PSNR:28.7DB(WMSEALG.)AND25.3DB(JPEG2000)AT0.97BPP.

    PSPNRFORTHEHOUSEIMAGE:19.4DB(WMSEALG.)AND19.0DB(JPEG2000).

    PSNR:31.2DB(WMSEALG.)AND33.1DB(JPEG2000)AT0.68BPP.

    PSPNRFORTHEJELLYBEANSIMAGE:18.8DB(WMSEALG.)AND18.2DB(JPEG2000).

    PSNR:32.3DB

    (WMSE

    ALG.)

    AND

    32.1DB

    (JPEG2000)

    AT

    0.48BPP.

    INTHEONLYCASEWHERETHEPSNROFJPEG2000ISHIGHERTHANTHENEWALGORITHM(HOUSE),THEPSPNRRESULT

    SUPPORTSTHEFACTTHATVISUALLYTHENEWALGORITHMPROVIDESSUPERIORRESULTS.

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    11/14

    InternationalJournalofCommunications(IJC)Volume2Issue2,June2013 www.seipub.org/ijc

    59

    FIG.5FRUITS,CATANDPEPPERSIMAGES FROMLEFTTORIGHT:ORIGINAL,COMPRESSEDBYTHEWMSEALGORITHM(WMSE

    ALG.)ANDCOMPRESSEDBYJPEG2000.

    PSPNRFORTHEFRUITSIMAGE:22.2DB(WMSEALG.)AND21.1DB(JPEG2000).

    PSNR:30.0DB(WMSEALG.)AND29.0DB(JPEG2000)AT1.34BPP.

    PSPNRFORTHECATIMAGE:17.0DB(WMSEALG.)AND16.2DB(JPEG2000).

    PSNR:28.9DB(WMSEALG.)AND26.9DB(JPEG2000)AT0.63BPP.

    PSPNRFORTHEPEPPERSIMAGE:20.3DB(WMSEALG.)AND19.3DB(JPEG2000).

    PSNR:30.8DB(WMSEALG.)AND30.7DB(JPEG2000)AT0.86BPP.

    ASCANBESEEN,PSNRANDPSPNRRESULTSARESUPERIORFORTHENEWALGORITHMCOMPAREDTOJPEG2000.ITISALSO

    OBSERVEDVISUALLY EXAMPLESAREINDICATEDINTHEMARKEDAREAS.

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    12/14

    www.seipub.org/ijc International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

    60

    FIG.6SAILS(ZOOMEDIN),MONARCH(ZOOMEDIN)ANDGOLDHILLIMAGES FROMLEFTTORIGHT:ORIGINAL,COMPRESSEDBYTHEWMSEALGORITHM(WMSEALG.)ANDCOMPRESSEDBYJPEG2000.

    PSPNRFORTHESAILSIMAGE:19.2DB(WMSEALG.)AND18.0DB(JPEG2000).

    PSNR:28.9DB(WMSEALG.)AND26.6DB(JPEG2000)AT0.70BPP.

    PSPNRFORTHEMONARCHIMAGE:19.9DB(WMSEALG.)AND19.6DB(JPEG2000).

    PSNR:29.0DB(WMSEALG.)AND28.8DB(JPEG2000)AT0.56BPP.

    PSPNRFORTHEGOLDHILLIMAGE:17.6DB(WMSEALG.)AND16.6DB(JPEG2000).

    PSNR:27.0DB(WMSEALG.)AND24.5DB(JPEG2000)AT0.59BPP.

    ONCEAGAIN,THEPSNRANDPSPNRRESULTSARESUPERIORFORTHENEWALGORITHMCOMPAREDTOJPEG2000(SEE

    EXAMPLESINDICATEDINTHEMARKEDAREAS).

    Summary

    A perceptuallybased model for the RateDistortion

    functionofcolorsubbandcodershasbeenintroduced.

    ThenewmodelapproximatestheWMSEdistortionof

    an image inagivencolorspace,suchasYCbCr.This

    distortion is then minimized to achieve perceptual

    optimizationofthecompression.Whentheweightsin

    the WMSE calculation are taken based on the CSF

    curves of the human visual system, better

    correspondence to image quality assessment by the

    humaneyeisachieved.

    Based on the RateDistortion model, new algorithms

    havebeen introduced consisting of a preprocessing

    stageusingaCCT,followedbyasubbandtransform,

    quantizationstage,andlosslessentropyencoding.Thealgorithms are optimized with regard to the color

    component transform in the preprocessing stage of

    the compression as well as the quantization tables

    usedinthecodingstage,bothwithrespecttoWMSE.

    TheproposedDCTbasedalgorithmoutperformsboth

    JPEG and the corresponding MSE optimized

    algorithm. The DWTbased algorithm, as expected,

    achieveshighercompressionratiosforthesameimage

    quality than DCTbased techniques. We demonstrate

    in this work that even when a relatively basic

    algorithm is used in the postprocessing stage(introducedforEZW),superiorresultsareobtainedby

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    13/14

    InternationalJournalofCommunications(IJC)Volume2Issue2,June2013 www.seipub.org/ijc

    61

    the proposed algorithm when compared to other

    DWTbasedalgorithms,suchasJPEG2000.Thisholds

    even if the same WMSE distortion is used in both

    JPEG2000andtheproposedalgorithm.Ourconclusion

    is thatbased on the new perceptual RateDistortion

    model,

    optimized

    compression

    algorithms

    can

    be

    designed with compression results superior to

    presentlyavailabletechniques.

    ACKNOWLEDGMENT

    This research was supported in partby the HASSIP

    Research Program HPRNCT200200285 of the

    EuropeanCommission,andbytheOllendorffMinerva

    Center.MinervaisfundedthroughtheBMBF.

    REFERENCES

    [1] Faugeras,O.D.DigitalColorImageProcessingWithintheFrameworkofaHumanVisualModel.IEEETrans.

    on Acoustics Speech and Signal Processing ASSP27

    (1979):380393.

    [2] Fiorucci,F.,Baruffa,G.andFrescura,F. Objectiveandsubjective quality assessment between JPEG XR with

    overlap and JPEG 2000.", Journal of Visual

    Communication and Image Representation 23 (Aug.

    2012):835844.

    [3] Gershikov, E. and Porat, M. A RateDistortionApproachtoOptimalColorImageCompression.Proc.

    ofEUSIPCO,Florence,Italy,2006.

    [4] Gershikov, E. and Porat, M. Data Compression ofColor Images using a Probabilistic Linear Transform

    Approach, inLNCS#4310, T. Boyanov,S.Dimova, K.

    Georgiev, G. Nikolov (Eds.), 582589, SpringerVerlag

    BerlinHeidelberg,2007.

    [5] Gershikov, E. and Porat, M. On color transforms andbitallocationforoptimalsubbandimagecompression.

    SignalProcessing:ImageCommunication22(2007):118.

    [6] Gersho, A. and Gray, R. M. Vector Quantization andSignalCompression,Boston,MA:Kluwer,1992,ch.2.

    [7] GoffmanVinopal, L. and Porat, M. Color imagecompression using intercolor correlation. Proc. of

    IEEEICIP(2002): II353 II356.

    [8] JPEG 2000 Part I: Final Draft International Standard(ISO/IECFDIS154441),NCITS ISO/IECJTC1/SC29/WG1

    N1855(Aug.2000).

    [9] Lam, E. Y. and Goodman, J. W. A mathematicalanalysis of the DCT coefficient distributions for

    images. IEEE Trans. on Image Processing 9 (2000):

    16611666.

    [10]LimbJ.O.andRubinsteinC.B.StatisticalDependenceBetween Components of A Differentially Quantized

    ColorSignal.IEEETrans.onCommunicationsCom20

    (Oct.1971):890899.

    [11]Mannos, J. and Sakrison, D. The effects of a visualfidelitycriterionoftheencodingofimages. IEEETrans.

    onInformationTheory20(Jul.1974):525536.

    [12]Ngan, K. N., Leong, K. S. and Singh, H. Adaptivecosine transform coding of images in perceptual

    domain. IEEETrans.onAcoustics,Speech,andSignal

    Processing37(Nov.1989):17431750.

    [13]Rabbani, M. andJoshi, R. An overview of theJPEG2000 still image compression standard. Signal

    Processing:ImageCommunication17(2002):348.

    [14]Rao, K. R. and Yip, P. Discrete cosine transform:algorithms, advantages, applications. AcademicPress,

    1990.

    [15]Richter, T. and Simon, S. On theJPEG 2000 ultrafastmode.Proc.ofICIP(Oct.2012):25012504.

    [16]Richter,T. SpatialConstantQuantizationinJPEGXRisNearly Optimal. Proc. Of the Data Compression

    Conference(March2010):7988.

    [17]Roterman,Y.andPorat,M.Color imagecodingusingregional

    correlation

    of

    primary

    colors.

    Image

    and

    VisionComputing25(2007):637651.

    [18]Satt, A. and Malah, D. Design of Uniform DFT FilterBanksOptimizedforSubbandCodingofSpeech,IEEE

    Trans. on Acoustics, Speech and signal Processing 37

    (Nov.1989):16721679.

    [19]Shapiro, J. M. Embedded Image Coding UsingZerotrees of Wavelet Coefficients, IEEE Trans. on

    SignalProcessing41(1993):33453462.

  • 7/27/2019 Perceptually Optimized Coding of Color Images for BandLimited Information Networks

    14/14

    www.seipub.org/ijc International Journal of Communications (IJC) Volume 2 Issue 2, June 2013

    62

    [20]Taubman,D.S.andMarcellin,M.W.JPEG2000:imagecompression, fundamentals, standards and practice,

    KluwerAcademicPublishers,2002.

    [21]Wallace, G. K. The JPEG still picture compressionstandard.IEEETrans.ConsumerElectronics 38 (1992):

    xviiixxxiv.

    [22]Wang, C. Y., Lee, S. M. and Chang, L. W. DesigningJPEG quantization tables based on human visual

    system. Signal Processing: Image Communication 16

    (2001):501506.

    [23]Yamaguchi,H. EfficientEncoding ofColored PicturesinR,G,BComponents.Trans.onCommunications32

    (Nov.1984):12011209.

    [24]http://www.ece.uvic.ca/mdadams/jasperEvgeny

    Gershikov received his

    Ph.D. in Electrical Engineering from

    Technion Israel Institute of

    Technology in Haifa, Israel in 2011.

    His areas of interest are Signal and

    Image Processing, Color Processing

    and Vision, Computer Vision,

    Pattern Recognition and Speech

    Recognition.