multiscale analysis of images gilad lerman math 5467 (stealing slides from gonzalez & woods, and...

Multiscale Analysis of Images

Gilad LermanMath 5467

(stealing slides from Gonzalez & Woods, and Efros)

The Multiscale Nature of Images

Recall: Gaussian pre-filtering

Gaussian 1/2

Solution: filter the image, then subsample• Filter size should double for each ½ size reduction.

Subsampling with Gaussian pre-filtering

G 1/4 G 1/8Gaussian 1/2

Solution: filter the image, then subsample• Filter size should double for each ½ size reduction.

Image Pyramids

Known as a Gaussian Pyramid [Burt and Adelson, 1983]• In computer graphics, a mip map [Williams, 1983]• A precursor to wavelet transform

A bar in the big images is a hair on the zebra’s nose; in smaller images, a stripe; in the smallest, the animal’s nose

Figure from David Forsyth

Gaussian pyramid construction

filter mask

Repeat• Filter• Subsample

Until minimum resolution reached • can specify desired number of levels (e.g., 3-level pyramid)

The whole pyramid is only 4/3 the size of the original image!

What does blurring take away? (recall)

original

What does blurring take away? (recall)

smoothed (5x5 Gaussian)

High-Pass filter

smoothed – original

Band-pass filtering

Laplacian Pyramid (subband images)Created from Gaussian pyramid by subtraction

Gaussian Pyramid (low-pass images)

Laplacian Pyramid

How can we reconstruct (collapse) this pyramid into the original image?

Need this!

Originalimage

Image resampling (interpolation)

So far, we considered only power-of-two subsampling• What about arbitrary scale reduction?

• How can we increase the size of the image?

Recall how a digital image is formed

• It is a discrete point-sampling of a continuous function• If we could somehow reconstruct the original function, any

new image could be generated, at any resolution and scale

1 2 3 4 5

d = 1 in this example

Image resampling

So far, we considered only power-of-two subsampling• What about arbitrary scale reduction?

• How can we increase the size of the image?

Recall how a digital image is formed

• It is a discrete point-sampling of a continuous function• If we could somehow reconstruct the original function, any

new image could be generated, at any resolution and scale

1 2 3 4 5

d = 1 in this example

Image resampling

So what to do if we don’t know

1 2 3 4 52.5

1 d = 1 in this example

• Answer: guess an approximation• Can be done in a principled way: filtering

Image reconstruction• Convert to a continuous function

• Reconstruct by cross-correlation:

Resampling filtersWhat does the 2D version of this hat function look like?

Better filters give better resampled images• Bicubic is common choice

Why not use a Gaussian?

What if we don’t want whole f, but just one sample?

performs linear interpolation

(tent function) performs bilinear interpolation

Bilinear interpolation

Smapling at f(x,y):

What happens in the frequency domain?

• Laplacian Pyramid gives rise to different frequency bands

• Alternatively can obtain similar pyramids according to frequency bands…

Example of Frequency-bands “pyramid”

Taken from Rajashekar and Simoncelli (http://www.cns.nyu.edu/ftp/lcv/rajashekar08a.pdf)

Applications (more efficient with wavelets)

•Coding

High frequency coefficients need fewer bits, so one can

collapse a compressed Laplacian pyramid•Denoising/Restoration

Setting “most” coefficients in Laplacian pyramid to zero•Image Blending

Denoising demonstration (motivation)

Observation: spatial correlation in noise-free

image (and not the noisy image)

Conclusion: Noise is more evident in the

high-frequency components, than the low-

frequency components (smooth components)

Denoising demonstration (idea)

Partition into sub-bands of frequencies (top: high freq., bottom: low)

y-given image (vectorized), - estimated denoised image (vectorized)

The middle graph – intensity thresholding.

No thresholding for smoothest component and keeping only higher intensities for

higher components (the filter is learned empirically)

In practice, can have blurring and not competitive with NLM, BM3d, dictionary learning

ˆ( )x y

Application: Pyramid Blending

Left pyramid Right pyramidblend

Image Blending

Feathering

Encoding transparency

I(x,y) = (R, G, B, )

Iblend = left Ileft + right Iright

Ileft Iright

leftright

Affect of Window Size

1 left

right0

Affect of Window Size

Good Window Size

“Optimal” Window: smooth but not ghostedBurt and Adelson (83): Choose by pyramids…

Pyramid Blending

Pyramid Blending (Color)

Laplacian Pyramid: Blending

General Approach:1. Build Laplacian pyramids LA and LB from images A and B

2. Build a Gaussian pyramid GR from selected region R (black white corresponding images)

3. Form a combined pyramid LS from LA and LB using nodes of GR as weights:• LS(i,j) = GR(I,j,)*LA(I,j) + (1-GR(I,j))*LB(I,j)

4. Collapse the LS pyramid to get the final blended image

laplacianlevel

left pyramid right pyramid blended pyramid

Blending Regions

Simplification: Two-band Blending

Brown & Lowe, 2003• Only use two bands: high freq. and low freq.• Blends low freq. smoothly• Blend high freq. with no smoothing: use binary mask

Can be explored in a project…

multiscale analysis of images gilad lerman math 5467 (stealing slides from gonzalez & woods, and...

original image

digital image

new image

image pyramidsknown

gaussian pyramid burt

size reduction

subsamplefilter size

original function

Documents

lerman evolving

logan lerman y tom holland

dr. robert lerman,

introduction to the mathematics of image and data analysis...

liz lerman critical response process

lerman booklet

sofia, bulgaria | 9-10 october advanced data access patterns...

2014_12_02 flight int - no 5467

the social web or how flickr changed my life kristina lerman...

· download 328 10400 0-2610-5417, 0-2610-5467...

partnering for profit david campbell and jeff lerman

image enhancement in the spatial domain (chapter 3) math...

2014 cv j lerman web

fisher y lerman dgge

thenmainejews.org/images/26feb2013/portlandtbe50yrs_4.pdfgerald...

image manifolds 16-721: learning-based methods in vision...

social information processing in social news...

kristina lerman university of southern california ·...

yoav lerman thesis

bs 5467-1989