what energy functions can be minimized using graph cuts? shai bagon advanced topics in computer...

What Energy Functions Can be Minimized Using

Graph Cuts?

Shai Bagon

Advanced Topics in Computer Vision

June 2010

What is an Energy Function?

a number

suggested solution

For a given problem:Image Segmentation:

237-20

Useful Energy function:

1. Good solution Low energy

2. Tractable Can be minimized

Families of Functions or Outline

• F2 submodular

• Non submodular

• F3

• Beyond F3

Foreground SelectionLet

yi – color of ith pixel

xi ϵ {0,1} BG/FG labels (variables)

Given BG/FG scribbles:Pr(xi|yi)=How likely each pixel to be FG/BG

Pr(xm|xn)=Adjacent pixels should have same label

F2 energy:

E(x)=∑iEi(xi)+∑ijEij(xi,xj)

Submodular

Known concept from set-functions:

E(x) = ∑i Ei(xi) + ∑ij Eij (xi, xj), xi ϵ {0,1}

Syxyxfyxfyfxf ,

20,01,11,00,1 Sffff

Eij(xi,xj):

What does it mean?

B+C-A-D ≥ 0

How toMinimize?

E(x) = ∑i Ei(xi) + ∑ij Eij (xi, xj), xi ϵ {0,1}

Local “beliefs”:

Data termPrior knowledge:

Smoothness term

F2 submodular

Graph Partitioning

A weighted graph G=( V E w )

Special Nodes: s t

s-t cut:

Cost of a cut:

Nice property: 1:1 mapping

s-t cut ↔ {0,1}|V|-2

wijTSCut,

TtSsVTVS

Graph Partitioning - Energy

E(x) = ∑i Ei(xi) + ∑ij Eij (xi, xj)

Graph Partitioning

B+C-A-D

Eij(xi,xj)

C-AC-A

B+C-A-D0= A + + +

Graph Partitioning - Energy

Graph Partitioning

B+C-A-D

Svxv 1

DACBCDEEA

wATScut

TjSiij

st cut binary assignment

cut cost energy of assignment

min cut Energy min.

B=Eij(0,1)

F2 submodular:

Eij(1,0)+Eij(0,1)≥Eij(0,0)+Eij(1,1)

Mapping from energy to graph partition

Min Energy = computing min-cut

Global optimum in poly timefor submodular functions!

Next…

Multi-label F2

E(x)=∑i Ei(xi) + ∑ij Eij(xi,xj) s.t. xi ϵ {1,…,L}

– Fusion moves: solving binary sub-problems– Applications to stereo, stitching, segmentation…

Currentlabeling

suggestedlabeling

“Alpha expansion”

Fusion

Solve Binary problem: xi=0 xi=1

Stereo matching see http://vision.middlebury.edu/stereo/

Ground truthPairwise MRF[Boykov et al. ‘01]

slide by Carsten Rother, ICCV’09

Input:

Panoramic stitching

slide by Carsten Rother, ICCV’09

Panoramic stitching

slide by Pushmeet Kohli, ICCV’09

AutoCollage

http://research.microsoft.com/en-us/um/cambridge/projects/autocollage/ [Rother et. al. Siggraph ‘05 ]

Next…

Multi-label F2

E(x)=∑i Ei(xi) + ∑ij Eij(xi,xj) s.t. xi ϵ {1,…,L}

– Fusion moves: solving binary sub-problems– Applications to stereo, stitching, segmentation…

Non-submodular

Beyond pair-wise interactions: F3

Merging Regionsinput image regions (Ncuts) “edge” prob.

1Prii xi

1loglogi ixi xi

ii ppx

“weak” edge

“strong” edge

pi – prob. of boundary being edgeGOAL: Find labeling xiϵ{0,1} that max:

Taking -log

Merging Regions

log1loglog

loglog1loglog

0:0:0:1:

Adding and subtracting the same number

1loglogi ixi xi

ii ppx merged be likey to210

edgean be likely to210

1log :

ii xwC

Merging Regions

Solving for edges:

Consistency constraints:No “dangling” edge

i iix xwCminarg

x1 x2 x3 EJ

0 0 0 0

1 1 1 0

0 1 1 0

0 0 1 λ

No longer pair-wise:

Minimization trick

21min1 3211,0

xxxzxxxz

Freedman D., Turek MW, Graph cuts with many pixel interactions: theory and applications to shape modeling. Image Vision Computing 2010

Merging Regions

The resulting energy:

+ Pair-wise

- Non submodular!

Jnml nlnmllmn

xxxxxz

Quadratic Pseudo-Boolean Optimization

Kolmogorov V., Carsten R., Minimizing non-submodular functions with graph cuts – a review. PAMI’07

+ All edges with positive capacities

- No constraint

Labeling rule:

partial labeling

otherwise

Properties of partial labeling y:

1. Let z=FUSE(y,x) E(z)≤E(x)

2. y is subset of optimal y*

y is complete:

1. E submodular

2. Exists flipping

(inference in trees)

0?????

rp q s t

000?? 0010?

rp q s t

rp q s tQPBO:

Probe Node p:0 1

What can we say about variables?

•r -> is always 0•s -> is always equal to q•t -> is 0 when q = 1 slide by Pushmeet Kohli, ICCV’09

QBPO - Probing

• Probe nodes in an order until energy unchanged

• Simplified energy preserves global optimality and (sometimes) gives the global minimum

slide by Pushmeet Kohli, ICCV’09

QBPO - Probing

Merging Regions

Result using QPBO-P:

Resultregions (Ncuts)input image

• F3 and more– Minimization trick

• Non submodular– QPBO approx. – partial labeling

Beyond F3…

[Kohli et. al. CVPR ‘07, ‘08, PAMI ’08, IJCV ‘09]

Image Segmentation

E(X) = ∑ ci xi + ∑ dij |xi-xj|i i,j

E: {0,1}n → R

0 →fg, 1→bg

n = number of pixels

[Boykov and Jolly ‘ 01] [Blake et al. ‘04] [Rother et al.`04]

Image Unary Cost Segmentation

Pn Potts Potentials

Patch Dictionary

(Tree)

Cmax 0

{0 if xi = 0, i ϵ p Cmax otherwise

h(Xp) =

[slide credits: Kohli]

Pn Potts Potentials

E(X) = ∑ ci xi + ∑ dij |xi-xj| + ∑ hp (Xp) i i,j p

{0 if xi = 0, i ϵ p Cmax otherwise

h(Xp) =

E: {0,1}n → R

0 →fg, 1→bg

Image Segmentation

E(X) = ∑ ci xi + ∑ dij |xi-xj| + ∑ hp (Xp) i i,j

Image Pairwise Segmentation

Final Segmentation

E: {0,1}n → R

0 →fg, 1→bg

Application: Recognition and Segmentation

from [Kohli et al. ‘08]

Unaries onlyTextonBoost

[Shotton et al. ‘06]

Pairwise CRF only[Shotton et al. ‘06]

Pn Potts

One super-pixelization

another super-pixelization

Robust(soft) Pn Potts model

{0 if xi = 0, i ϵ p f(∑xp) otherwise

h(xp) =p

from [Kohli et al. ‘08]

Robust Pn PottsPn Potts

Application: Recognition and Segmentation

From [Kohli et al. ‘08]

Unaries onlyTextonBoost

[Shotton et al. ‘06]

Pairwise CRF only[Shotton et al. ‘06]

Pn Potts robust Pn Potts robust Pn Potts(different f)

One super-pixelization

another super-pixelization

Same idea for surface-based stereo]Bleyer ‘10[

One input image

Ground truth depth

Stereo with hard-segmentation

Stereo with robust Pn Potts

This approach gets best result on Middlebury Teddy image-pair:

How is it done…

H (X) = F ( ∑ xi )

Most general binary function:

∑ xi

concave

The transformation is to a submodular pair-wise MRF, hence optimization globally optimal

Higher order to Quadratic

• Start with Pn Potts model:

{0 if all xi = 0C1 otherwise

f(x) = x ϵ {0,1}n

min f(x) min C1a + C1 (1-a) ∑xix =x,a ϵ {0,1}

Higher Order Function

Quadratic Submodular Function

∑xi = 0 a=0f(x) = 0

∑xi > 0 a=1f(x) = C1

min f(x) min C1a + C1 (1-a) ∑xix=

x,a ϵ {0,1}

Higher Order Function

C1∑xi

min f(x) min C1a + C1 (1-a) ∑xix=

x,a ϵ {0,1}

Higher Order Submodular

Function

C1∑xi

a=1a=0Lower

envelope of concave

functions is concave

Summary• Submodular F2

• F3 and beyond: minimization trick

• Non submodular– QPBO(P)

• Beyond F3 – Robust HOP

a=1a=0

what energy functions can be minimized using graph cuts? shai bagon advanced topics in computer...

eixi ij eij xi

energy functions

labeling xi

eixi ij eijxi

labelf2 energy

graph partitionmin energy

useful energy function

nonsubmodular functions

Documents

the broken mobile market - shai gottesdiener, perion

y bagon - pokemon 371 - prusaprinters · 2020. 3. 5. · in...

gga conference 09 - shai eilon

shai halimaar

torts ouline reviwer finals shai

workflow, maximized downtime, minimized … · workflow,...

oesletterhead-minimized newimg coop copy

shai levy - standard bearer - august man

t-accounts and double entry accounting...bagon found that he...

oﬀer shai infused design. i. theory

shai ehrmann california state university, los angeles

2009 camp shai year book preview

cloze test shai

chess recognition shai karnei – reem nathan meisels

theory & applications of online learning -...

1 image parsing: unifying segmentation, detection, and...

shai vyakarnam entrepreneurship

driven shai agassi's audacious plan to put electric cars on...

what is a good image segment? a uni ed approach to...

shai shanti fall/winter 2014 lookbook