jigsaws: joint appearance and shape clustering john winn with anitha kannan and carsten rother...

21
Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Upload: bruce-bryant

Post on 19-Jan-2016

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Jigsaws: joint appearance and shape clustering

John Winnwith Anitha Kannan and Carsten Rother

Microsoft Research, Cambridge

Page 2: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Patch models

Used for: Object recognition/detection Object segmentation

But also: Stereo matching, photo stitching Texture synthesis Super-resolution Motion segmentation Image/video compression

Page 3: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Patch models

Patch clustering/codebook (e.g. Leibe & Schiele)

Epitome (Jojic et al.)parameter sharing + translation invariant

Page 4: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Issues with fixed patch size/shape

Patch includes backgroundpatches containing the same object are not clustered together

Patch excludes part of objectpatch is less discriminative

Patch includes occlusionoccluded and unoccluded objects are not clustered together

Page 5: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Patch size?

Small(single pixel)

Large(entire image)

More discriminative

Less sharing

More sharing

Less discriminative

Optimal size/shape?

Depends on: •object size/shape•object variability•size of training set

Size

Page 6: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Aims of jigsaw model

Learn patches (jigsaw pieces) which are

1.Shared: each piece is similar in shape and appearance to many regions of the training images;

2.Discriminative: each piece is as large as possible;

3.Exhaustive: all parts of the training images can be reconstructed from the set of jigsaw pieces.

Page 7: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

The Jigsaw model

Image I1 Offset map L1

...Image I2 Offset map L2 Image IN Offset map LN

Jigsaw J

Page 8: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

The Jigsaw model

Jigsaw J

Image I1 Offset map L1

...Image I2 Offset map L2 Image IN Offset map LN

x

xxxxxP 1)(,)(|Normal),|( LL)I(LJI

)( λμ,

Page 9: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

The Jigsaw model

Jigsaw J

Image I1 Offset map L1

...Image I2 Offset map L2 Image IN Offset map LN

)( λμ,

Potts model:

Page 10: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Toy example

Training image Jigsaw

Learned using EM+ graph cuts

Page 11: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Dog example

Training image

3232 Jigsaw mean

Page 12: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Dog example

Reconstructed image

Learned segmentation

3232 Jigsaw mean

Epitome reconstruction

Page 13: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Faces example

128128 Jigsaw mean

100 6464 imagesSource: Olivetti face database

Page 14: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Learning the ‘pieces’

Image I1 Offset map L1

...Image I2 Offset map L2 Image IN Offset map LN

Jigsaw J

Page 15: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Learning the ‘pieces’

Image I1 Offset map L1

...Image I2 Offset map L2 Image IN Offset map LN

Jigsaw J

Page 16: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Faces example

Results of shape clustering on the face images

Page 17: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

64x64 jigsaw

Object recognition (preliminary) Trained set: 20 street images

Allow patches to deform (as in LayoutCRF, CVPR 2006).

Page 18: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Object recognition (preliminary) Trained set: 20 street images (10 labelled)

64x64 jigsaw

Accuracy improves (~1%) if you include an additional 10 unlabelled images when learning the jigsaw.

Allow patches to deform (as in LayoutCRF, CVPR 2006).

Page 19: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Work in progress…

Training larger jigsaws on 100s of images

Incorporating shape clustering into the probabilistic model

Learning additional invariances e.g. to illumination

Object recognition results on MSRC and other datasets

Page 20: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Conclusions Jigsaw model allows learning the shape

and appearance of objects or object parts in images. Can also handle occlusion.

Clustering shape and appearance much more powerful for recognition than appearance alone.

Can be used as a ‘plug-and-play’ replacement for fixed size patches in any existing patch-based system.

Page 21: Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge

Thank you

[email protected]

http://johnwinn.org