cs448f: image processing for photography and vision the gradient domain
TRANSCRIPT
![Page 1: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/1.jpg)
CS448f: Image Processing For Photography and Vision
The Gradient Domain
![Page 2: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/2.jpg)
Image Gradients
• We can approximate the derivatives of an image using differences
• Equivalent to convolution by [-1 1]• Note the zero boundary condition
10 10 8 9 0 2 3 4 2 4
10 0 -2 1 -9 2 1 1 -2 2
m =
d/dx
dm/dx =
![Page 3: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/3.jpg)
Image Derivatives
• We can get back to the image by integrating – like an integral image
• Differentiating throws away constant terms– The boundary condition allowed us to recover it
10 10 8 9 0 2 3 4 2 4
10 0 -2 1 -9 2 1 1 -2 2m =
∫
∫ m dx =
![Page 4: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/4.jpg)
Image Derivatives
• Can think of it as an extreme coarse/fine decomposition– coarse = boundary term– fine = gradients
![Page 5: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/5.jpg)
In 2D
• Gradient in X = convolution by [-1 1]
• Gradient in Y = convolution by
• If we take both, we have 2n values to represent n pixels– Must be redundant!
-11
![Page 6: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/6.jpg)
Redundancy
• d (dm/dx) / dy = d (dm/dy) / dx• Y derivative of X gradient = X derivative of Y
gradient
![Page 7: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/7.jpg)
Gradient Domain Editing
• Gradient domain techniques– Take image gradients– Mess with them– Try to put the image back together
• After you’ve messed with the gradients, the constraint on the previous slide doesn’t necessarily hold anymore.
![Page 8: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/8.jpg)
The Poisson Solve
• Convolving by [-1 1] is a linear operator: Dx
• Taking the Y gradient is some operator: Dy
• We have desired gradient images gx and gy
• We want to find the image that best produces them
• Solve for an image m such that:
y
x
y
x
g
gm
D
D
![Page 9: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/9.jpg)
The Poisson Solve
• How? Using Least Squares:
• This is a Poisson Equation
y
xTy
Tx
y
xTy
Tx g
gDDm
D
DDD
yTyx
Txy
Tyx
Tx gDgDmDDDD )(
![Page 10: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/10.jpg)
The Poisson Solve
• Dx = Convolution by [-1 1]
• DxT = Convolution by [1 -1]
• The product = Convolution by [1 -2 1]– Approximate second derivative
• DxT Dx + Dy
T Dy = convolution by
11 -4 1
1
![Page 11: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/11.jpg)
The Poisson Solve
• We need to invert:
• How big is the matrix?
• Anyone know any methods for inverting large sparse matrices?
yTyx
Txy
Tyx
Tx gDgDmDDDD )(
![Page 12: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/12.jpg)
Solving Large Linear Systems
• A = DxT Dx + Dy
T Dy
• b = DxTgx + Dy
Tgy
• We need to solve Ax = b
![Page 13: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/13.jpg)
1) Gradient Descent
• x = some initial estimate• For (lots of iterations):
r = b - Axe = rTrα = e / rTArx += αr
![Page 14: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/14.jpg)
2) Conjugate Gradient Descent• x = some initial estimate• d = r = Ax - b• enew = rTr• For (fewer iterations):
α = enew / dTAd
x += αdr = b - Axeold = enew
enew = rTr
d = r + d enew/eold
• (See An Introduction to the Conjugate Gradient Method Without the Agonizing Pain)
![Page 15: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/15.jpg)
3) Coarse to Fine Conj. Grad. Desc.
• Downsample the target gradients• Solve for a small solution• Upsample the solution• Use that as the initial estimate for a new conj.
grad. descent• Not too many iterations required at each level• This is what ImageStack does in -poisson
![Page 16: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/16.jpg)
4) FFT Method
• We’re trying to undo a convolution• Convolutions are multiplications in Fourier
space• Therefore, go to Fourier space and divide
![Page 17: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/17.jpg)
Applications
• How might we like to mess with the gradients?• Let’s try some stuff
![Page 18: CS448f: Image Processing For Photography and Vision The Gradient Domain](https://reader035.vdocuments.us/reader035/viewer/2022072011/56649e005503460f94aea199/html5/thumbnails/18.jpg)
Applications
• Poisson Image Editing– Perez 2003
• GradientShop– Bhat 2009
• Gradient Domain HDR Compression– Fattal et al 2002
• Efficient Gradien-Domain Compositing Using Quadtrees– Agarwala 2007
• Coordinates for Instant Image Cloning– Farbman et al. 2009