ankit agrawal jyoti meena. objectives categorization identification

13
Multiclass object recognition Ankit Agrawal Jyoti Meena

Upload: jeffry-higgins

Post on 11-Jan-2016

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

Multiclass object recognition

Ankit AgrawalJyoti Meena

Page 2: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

ObjectivesCategorization

Identification

Page 3: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

IntroductionWe applied a biologically inspired model of visual

object recognition to the multiclass object categorization problem.

Our model based on Serre, Wolf, and Poggio.

We first applied Gabor filters at all positions and scales;

Different feature extraction at different levels{S1 C1 S2 C2 } are done by Algorithm of alternating template matching and max pooling operations.

Page 4: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

Analogy Between various regions And Computation functions

eye

Final object classifier(SVM)

Gabor Filters(S1,C1)

Complex feature filters(C2,S2)

Page 5: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

Different FiltersGabor filter is a linear filter used for edge

detection. Frequency and orientation representations of Gabor filter are similar to those of human visual system, and it has been found to be particularly appropriate for texture representation and discrimination.

Page 6: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

Layers based on ComplexitySimple (“S”) layers are the convolution of

local filters for computing higher-order features from different types of units in the previous layer.

Complex (“C”) layers increase invariance by pooling units. At the same time, the number of units is reduced by sub sampling. At this point, all position and scale information has been removed.

Page 7: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

Pyramidal Model1)Image layer. We convert the image to grayscale.

2)Scale the shorter edge to 140 pixels while maintaining the aspect ratio.

3)Next we create an image pyramid of 10 scales, each a factor of 2^0.25 smaller than the last (using bicubic interpolation).

4) Gabor filter (S1) layer. The S1 layer is computed from the image layer by centering 2D Gabor filters with a full range of orientations at each possible position and scale. Our base model follows and uses 4 orientations.

5)Local variance(C1) A C1 unit’s value is simply the value of the maximum S1 unit that falls within the max filter.

6) Intermediate feature (S2) layer. At every position andscale in the C1 layer, we perform template matches betweenthe patch of C1 units centered at that position and eachof d prototype patches. These prototype patches representthe intermediate-level features of the model.

7) Global invariance (C2) layer. Finally we create a d-dimensional vector, each element of which is the maximumresponse (anywhere in the image) to one of the model’sd prototype patches. At this point, all position and scaleinformation has been remove

.

Page 8: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

SVM CLASSIFIERSupport Vector Machine

Test images are assigned to categories using the majority-voting method.

Select features that are highly weighted by the SVM.

Many will be from the background , and others will have varying degrees of usefulness for the classification task.

We wanted to find out how many features were actually needed, and whether cutting out less useful features would improve performance, as we might expect from machine learning results on the value of sparsity

Page 9: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

ResultsWe have shown that a biologically based model can

compete with other state of the art approaches to object categorization

Strengthening the case for investigating biologically-motivated approaches to object recognition.

Biologically motivated algorithms also have the advantage of being susceptible to massive parallelization.

Localization in larger images takes longer; in both cases the bulk of the time is spent building feature vectors.

Page 10: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

LimitationsExtracted the features for different levels for

training set of two boat pictures but haven’t used the SVM classifier to classify the testing image due to some error in related C++ code.

Brain functions are highly non linear whereas we try to use linearity in computational functions for reducing the complexity .

There is a trade off between complexity and accuracy.

Page 11: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

ConclusionOur methods for increasing sparsity is motivated

by biological vision

Both biological and computer vision systems face the same computational constraints arising from the data.

We would expect computer vision research to benefit from the use of similar basis functions for describing images.

Page 12: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

Referenceshttp://www.mit.edu/~jmutch/papers/cvpr2006_mutch_lowe.pdf

http://cbcl.mit.edu/publications/object-detection-recognition.html

http://www.vision.caltech.edu/Image_Datasets/Caltech101/Caltech101.html

http://www.accessexcellence.org/AE/AEC/CC/vision_background.php

M. Riesenhuber and T. Poggio. Hierarchical models of object recognition in cortex. Nature Neuroscience, 2(11):1019– 1025, 1999.

Page 13: Ankit Agrawal Jyoti Meena. Objectives Categorization Identification

Thank you