context-based vision system for place and object recognition

10
Context-based vision system for place and object recognition Torralba, Murphy, Freeman & Rubin (2003)

Upload: felice

Post on 04-Jan-2016

40 views

Category:

Documents


0 download

DESCRIPTION

Context-based vision system for place and object recognition. Torralba, Murphy, Freeman & Rubin (2003). Context is useful for object recognition. Local and global image representations. Local representation (L) Wavelet decomposition N = 24 (6 orientations, 4 scales) Global representation - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Context-based vision system for place and object recognition

Context-based vision system for place and object

recognition

Torralba, Murphy, Freeman & Rubin (2003)

Page 2: Context-based vision system for place and object recognition

Context is useful for object recognition

Page 3: Context-based vision system for place and object recognition

Local and global image representations

• Local representation (L)– Wavelet decomposition– – N = 24 (6 orientations, 4 scales)

• Global representation– Average across space and downsample to

M x M (M = 4)– PCA to reduce to 80 dimensions–

v tL (x) = v t,k (x){ }

k=1,N

v tG

Page 4: Context-based vision system for place and object recognition

Place recognition

• Transition matrix– Count transitions + Dirichlet smoothing

• Observation likelihood– Appearances: mixture of Gaussians– Uniform weights

Page 5: Context-based vision system for place and object recognition
Page 6: Context-based vision system for place and object recognition

Influence of HMM

Page 7: Context-based vision system for place and object recognition

Dealing with novel places

• Separately trained model for categories

Page 8: Context-based vision system for place and object recognition
Page 9: Context-based vision system for place and object recognition

Using context for object detection

• Estimate probability of object presence using global image features

(Objects independent given location and image features)

P(O |Qt ,v) = P(Oi |Qt ,v t )i

Page 10: Context-based vision system for place and object recognition

Context for object localization

• Coarse “expectation mask”