cs4670: computer vision - cornell university...crs baseline lig mrim-fusion (71 alcala avw alcala...

30
Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely

Upload: others

Post on 01-Mar-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

Lecture 29: Recent work in recognition

CS4670: Computer VisionNoah Snavely

Page 2: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

Object recognition

• Category recognition has been the focus of extensive research in the past decade

• Extensive use and development of machine learning techniques, better features

• Moderate-scale datasets derived from the Web– PASCAL VOC: 20 object categories, > 10K images,

> 25K instances, hand-labeled ground truth, annual competitions

Page 3: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

• Twenty object categories (aeroplane to TV/monitor)

• Three challenges:

– Classification challenge (is there an X in this image?)

– Detection challenge (draw a box around every X)

– Segmentation challenge

Page 4: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 5: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 6: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 7: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 8: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

is

is there a cat?

Page 9: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 10: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 11: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 12: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 13: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 14: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 15: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 16: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 17: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 18: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

Chance essentially 0

Page 19: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 20: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 21: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 22: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 23: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 24: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 25: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT
Page 26: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

Best localization methods

• Sliding window-style classifiers

– SVM, Adaboost

– Flexible spatial template: “star model”

• Separate classifiers by viewpoint

• Use of context in classifiers

• Local features

– HoG, SIFT, local histograms of gradient orientations

Page 27: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

HoG features

• Image partitioned into 8x8 blocks

• In each block, compute histogram of gradient orientations

Page 28: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

Flexible Spatial Template (UofC-TTI)

• Hierarchical model [Felzenszwalb et al 2008]

– Coarse template for finding the root part

– Fine-scale templates connected by springs

– Learning automatically from labeled bounding boxes

• Separate models per viewpoint

Page 29: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

Six-component car model

root filters (coarse) part filters (fine) deformation models

side view

frontal view

Page 30: CS4670: Computer Vision - Cornell University...CRS BASELINE LIG MRIM-FUSION (71 ALCALA AVW ALCALA LAVW CRS SOFT-EER (703) _e_ SOFT-BASELINE (700) _e— CIC3M GEN-DIS (69 g) LIG MRIM-COLORSIFT

Six-component person model