ubiquitous fine-grained computer visionskong2/img/fg_cv.pdf · instantiation -- photo aesthetic...

123
Shu Kong Department of Computer Science, UC Irvine Ubiquitous Fine-Grained Computer Vision

Upload: others

Post on 20-Sep-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Shu Kong

Department of Computer Science, UC Irvine

Ubiquitous Fine-Grained Computer Vision

Page 2: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

1. Problem definition

2. Instantiation

3. Challenge and philosophy

4. Fine-grained classification with holistic representation

5. Fine-grained identification by matching local patches

6. Future work and conclusion

Outline

Page 3: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Problem Definition

1. Problem definition

2. Instantiation

3. Challenge and philosophy

4. Fine-grained classification with holistic representation

5. Fine-grained identification by matching local patches

6. Future work and conclusion

Page 4: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Fine-grained

• marginally different or subtle

Problem Definition

Page 5: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Fine-grained

• marginally different or subtle

• involving great attention to detail (Oxford dictionary)

Problem Definition

Page 6: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Fine-grained

• marginally different or subtle

• involving great attention to detail (Oxford dictionary)

Problem Definition

Page 7: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Fine-grained

• marginally different or subtle

• involving great attention to detail (Oxford dictionary)

• The devil is in the details!

• ...and everywhere!

Problem Definition

Page 8: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Fine-grained computer vision

• distinguish subordinate categories within an entry-level category

Problem definition

Page 9: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Fine-grained computer vision

• distinguish subordinate categories within an entry-level category

• tasks are like classification, segmentation, specific case studies, etc.

Problem definition

Page 10: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- classification

Shu Kong, Charless Fowlkes, "Low-rank Bilinear Pooling for Fine-Grained Classification", arXiv:1611.05109, 2016

Page 11: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- classification

Shu Kong, Charless Fowlkes, "Low-rank Bilinear Pooling for Fine-Grained Classification", arXiv:1611.05109, 2016

Page 12: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- classification

Shu Kong, Charless Fowlkes, "Low-rank Bilinear Pooling for Fine-Grained Classification", arXiv:1611.05109, 2016

Page 13: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- identification

image from Surangi W. Punyasena

S. Kong, S. Punyasena, C. Fowlkes, "Spatially Aware Dictionary Learning and Coding for Fossil Pollen Identification", CVPR CVMI, 2016

Page 14: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- identification

S. Kong, S. Punyasena, C. Fowlkes, "Spatially Aware Dictionary Learning and Coding for Fossil Pollen Identification", CVPR CVMI, 2016

modern pollen grain from glauca

fossil pollen pollen grain from glauca

Page 15: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- segmentation

original image semantic segmentation

S. Kong, "Automated Biological Image Analysis using Computer Vision and Machine Learning", Janelia workshop, 2016

Page 16: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- segmentation

original image instance segmentation

S. Kong, "Automated Biological Image Analysis using Computer Vision and Machine Learning", Janelia workshop, 2016

Page 17: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- segmentation

S. Kong, "Automated Biological Image Analysis using Computer Vision and Machine Learning", Janelia workshop, 2016

Page 18: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- photo aesthetic ranking

S. Kong, X. Shen, Z. Lin, R. Mech, C. Fowlkes, "Photo Aesthetics Ranking Network with Attributes and Content Adaptation", ECCV, 2016

18

Page 19: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- photo aesthetic ranking

S. Kong, X. Shen, Z. Lin, R. Mech, C. Fowlkes, "Photo Aesthetics Ranking Network with Attributes and Content Adaptation", ECCV, 2016

1919

Page 20: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

20

Instantiation -- photo aesthetic ranking

S. Kong, X. Shen, Z. Lin, R. Mech, C. Fowlkes, "Photo Aesthetics Ranking Network with Attributes and Content Adaptation", ECCV, 2016

Page 21: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Instantiation -- photo aesthetic ranking

S. Kong, X. Shen, Z. Lin, R. Mech, C. Fowlkes, "Photo Aesthetics Ranking Network with Attributes and Content Adaptation", ECCV, 2016

Page 22: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

1. Problem definition

2. Instantiation

3. Challenge and philosophy

4. Fine-grained classification with holistic representation

5. Fine-grained identification by matching local patches

6. Future work and conclusion

Challenge and philosophy

Page 23: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

• lack of training data– costly data collection and annotation

Challenge and philosophy

Page 24: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

• lack of training data– costly data collection and annotation

• large numbers of categories

Challenge and philosophy

Page 25: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

• lack of training data– costly data collection and annotation

• large numbers of categories– >14,000 birds

– >278,000 butterfly&moth

– >941,000 insects

Challenge and philosophy

Page 26: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

• lack of training data– costly data collection and annotation

• large numbers of categories

• high intra-class vs. low inter-class variance

Challenge and philosophy

Page 27: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

• lack of training data– costly data collection and annotation

• large numbers of categories

• high intra-class vs. low inter-class variance

Challenge and philosophy

Caspian Tern Caspian Tern Elegant Tern

Page 28: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

• lack of training data– costly data collection and annotation

• large numbers of categories

• high intra-class vs. low inter-class variance

• philosophy– finding discriminative parts, and matching them effectively

Challenge and philosophy

Page 29: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

1. Problem definition

2. Instantiation

3. Challenge and philosophy

4. Fine-grained classification with holistic representation

5. Fine-grained identification by matching local patches

6. Future work

7. Conclusion

Holistic representation based method

Page 30: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

recognizing bird species by seeing the photo

Holistic representation based method

Acadian_Flycatcher

Yellow_Billed_Cuckoo

Red_Winged_Blackbird

Yellow_Headed_Blackbird Pelagic_Cormorant

Brandt_Cormorant

Page 31: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

recognizing bird species by seeing the photo

In literature, detecting keypoint/parts and stacking them as holistic representation

Holistic representation based method

Acadian_Flycatcher

Yellow_Billed_Cuckoo

Red_Winged_Blackbird

Yellow_Headed_Blackbird Pelagic_Cormorant

Brandt_Cormorant

picture from Wah et al, 2011

Page 32: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

But, this requires strong-supervised annotation, which is expensive to obtain.

Holistic representation based method

picture from Wah et al, 2011

Page 33: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

But, this requires strong-supervised annotation, which is expensive to obtain.

Preferably in weakly supervised manner --

• solely based on category labels

• without any part annotation/masks.

picture from Wah et al, 2011

Page 34: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

One method for this is called bilinear pooling

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Page 35: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

One method for this is called bilinear pooling

compute second-order statistics of local features, and average them as a single holistic representation

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Page 36: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

One method for this is called bilinear pooling

compute second-order statistics of local features, and average them as a single holistic representation

The local features can be activations at hidden layers of a convolutional neural network (CNN)

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Page 37: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Bilinear Pooling

h

w

c

Page 38: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Bilinear Pooling

h

w

c

Page 39: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Bilinear Pooling

h

w

c

Page 40: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Bilinear Pooling

h

w

c

Page 41: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Bilinear Pooling

h

w

c

Page 42: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Bilinear Pooling CNN -- training in an end-to-end manner

Lin et al., Bilinear CNN models for fine-grained visual recognition, ICCV, 2015

Page 43: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Low-rank Bilinear Pooling

Page 44: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Low-rank Bilinear Pooling

linear SVM

Page 45: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Low-rank Bilinear Pooling

linear SVM

Page 46: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Low-rank Bilinear Pooling

linear SVM

Page 47: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Low-rank Bilinear Pooling

linear SVM

linear SVM in matrix

Page 48: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Holistic representation based method

Low-rank Bilinear Pooling

linear SVM

rank-r SVM

linear SVM in matrix

Page 49: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Low-rank SVM

Holistic representation based method

Page 50: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Low-rank SVM

Holistic representation based method

Page 51: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

Holistic representation based method

1. linear SVM

2. linear SVM in matrix

Page 52: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

Holistic representation based method

1. linear SVM

2. linear SVM in matrix

Page 53: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

Holistic representation based method

1. linear SVM

2. linear SVM in matrix

Page 54: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

Holistic representation based method

1. linear SVM

2. linear SVM in matrix

Page 55: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

Holistic representation based method

1. linear SVM

2. linear SVM in matrix

Page 56: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

Holistic representation based method

1. linear SVM

2. linear SVM in matrix

Page 57: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

maximum Frobenius norm

Holistic representation based method

Page 58: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

maximum Frobenius norm

no need to compute bilinear features when testing

Holistic representation based method

Page 59: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

When bilinear SVM meets bilinear feature

maximum Frobenius norm

no need to compute bilinear features when testing

200 classes, then param size is reduced from 200*512*512 to 200*512*8

Holistic representation based method

Page 60: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

explicitly computing bilinear features

more efficient useful when hw>m

Holistic representation based method

Page 61: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

classifier co-decomposition -- learning a common factor and class-specific parameters of smaller size

Holistic representation based method

Page 62: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

classifier co-decomposition -- learning a common factor and class-specific parameters of smaller size

Holistic representation based method

Page 63: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

classifier co-decomposition -- learning a common factor and class-specific parameters of smaller size

Holistic representation based method

×≈

Page 64: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Studying the two hyperparameters

– low dimension m

– low rank r

Holistic representation based method

Page 65: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Studying the two hyperparameters -- m and r

Holistic representation based method

Page 66: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Studying the two hyperparameters -- m and r

Holistic representation based method

Page 67: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Studying the two hyperparameters -- m and r

Holistic representation based method

Page 68: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Studying the two hyperparameters -- m and r

Holistic representation based method

Page 69: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Studying the two hyperparameters -- m and r

if 200 classes, then param size is reduced from 200*512*512 (~5.2 x 10e7 single precision)to (200*8*100+100*512) (~2.1 x 10e5 single precision)

Holistic representation based method

Page 70: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Details on the complexity

Holistic representation based method

Page 71: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Quantitative evaluation on benchmark datasets

Holistic representation based method

Page 72: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Quantitative evaluation on benchmark datasets

Holistic representation based method

Page 73: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Qualitative evaluation for understanding the model

Holistic representation based method

Page 74: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Qualitative evaluation for understanding the model

– gradient map --- backpropogating error to input image

Holistic representation based method

Page 75: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Qualitative evaluation for understanding the model

– gradient map --- backpropogating error to input image

– average activation map

Holistic representation based method

Page 76: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Qualitative evaluation for understanding the model

– gradient map --- backpropogating error to input image

– average activation map

– simplying input image by removing superpixels

Holistic representation based method

Page 77: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Qualitative evaluation for understanding the model

Holistic representation based method

Page 78: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Conclusion

Holistic representation based method

Page 79: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Conclusion

1. a more compact and powerful model by coupling bilinear classifier and bilinear feature for fine-grained classification

Holistic representation based method

Page 80: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Conclusion

1. a more compact and powerful model by coupling bilinear classifier and bilinear feature for fine-grained classification

2. a new direction for a weakly supervised visual learning

Holistic representation based method

Page 81: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Conclusion

1. a more compact and powerful model by coupling bilinear classifier and bilinear feature for fine-grained classification

2. a new direction for a weakly supervised visual learning

3. useful for learning interpretable attentions

Holistic representation based method

Page 82: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

1. Problem definition

2. Instantiation

3. Challenge and philosophy

4. Fine-grained classification with holistic representation

5. Fine-grained identification by matching local patches

6. Future work and conclusion

Patch-match based method

Page 83: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

patch-match based approach for pollen grain identification

Patch-match based method

Page 84: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Patch-match based method

image from Surangi W. Punyasena

patch-match based approach for pollen grain identification

problem

Skilled experts trained for years have to identify by eye

Page 85: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

• Pollen grains are ubiquitous and well preserved in the fossil record

Why do we care about identifying pollen?

Page 86: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

• Pollen grains are ubiquitous and well preserved in the fossil record

• Identification of pollen samples allows for analysis of plant biodiversity and evolution, understanding history of long-term climate change, etc...

Why do we care about identifying pollen?

Page 87: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

A specific dataset for this exploration

Patch-match based method

1. arbitrary viewpoint of the pollen grains

Page 88: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

A specific dataset for this exploration

Patch-match based method

1. arbitrary viewpoint of the pollen grains 2. Large intra-class and small inter-class variation

Page 89: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Why not holistic representation?

Quantitative Result on Fossil Pollen

S. Kong, S. Punyasena, C. Fowlkes, "Spatially Aware Dictionary Learning and Coding for Fossil Pollen Identification", CVPR CVMI, 2016

Page 90: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Why not holistic representation?

1. It is expensive to collect and annotate data.

Quantitative Result on Fossil Pollen

S. Kong, S. Punyasena, C. Fowlkes, "Spatially Aware Dictionary Learning and Coding for Fossil Pollen Identification", CVPR CVMI, 2016

Page 91: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Why not holistic representation?

1. It is expensive to collect and annotate data.

2 . There a re no t enough t ra in ing da ta us ing ho l i s t i c representation.

Quantitative Result on Fossil Pollen

S. Kong, S. Punyasena, C. Fowlkes, "Spatially Aware Dictionary Learning and Coding for Fossil Pollen Identification", CVPR CVMI, 2016

Page 92: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Why not holistic representation?

1. It is expensive to collect and annotate data.

2 . There a re no t enough t ra in ing da ta us ing ho l i s t i c representation.

Quantitative Result on Fossil Pollen

S. Kong, S. Punyasena, C. Fowlkes, "Spatially Aware Dictionary Learning and Coding for Fossil Pollen Identification", CVPR CVMI, 2016

Page 93: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Why not holistic representation?

1. It is expensive to collect and annotate data.

2 . There a re no t enough t ra in ing da ta us ing ho l i s t i c representation.

Therefore, it's better to match local patches with geometric constraints.

Quantitative Result on Fossil Pollen

S. Kong, S. Punyasena, C. Fowlkes, "Spatially Aware Dictionary Learning and Coding for Fossil Pollen Identification", CVPR CVMI, 2016

Page 94: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

our patch-match based method

The patch-match method needs images to be alligned

Page 95: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

in-plate rotation viewpoint calibration

perform k-medoids clustering on an affinity graph of training set,

Page 96: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

in-plate rotation viewpoint calibration

perform k-medoids clustering on an affinity graph of training set, where pairwise similarity is based on Euclidean distance of pollen grain silhouette

Page 97: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

in-plate rotation viewpoint calibration

perform k-medoids clustering on an affinity graph of training set, where pairwise similarity is based on Euclidean distance of pollen grain silhouette

Page 98: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

our patch-match based method

patch exemplar selection

patch match by sparse coding

SVMtraining stage testing stage

Page 99: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

discriminative patch selection

Page 100: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Exemplar Selection

From a finite set of patches, V, we'd like to select M patches, which should be/have

discriminative patch selection

Page 101: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Exemplar Selection

From a finite set of patches, V, we'd like to select M patches, which should be/have

1. representative in feature space

discriminative patch selection

Page 102: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Exemplar Selection

From a finite set of patches, V, we'd like to select M patches, which should be/have

1. representative in feature space2. spatially distributed in input space

discriminative patch selection

Page 103: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Exemplar Selection

From a finite set of patches, V, we'd like to select M patches, which should be/have

1. representative in feature space2. spatially distributed in input space3. discriminative

discriminative patch selection

Page 104: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Exemplar Selection

From a finite set of patches, V, we'd like to select M patches, which should be/have

1. representative in feature space2. spatially distributed in input space3. discriminative 4. class balance

discriminative patch selection

Page 105: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Exemplar Selection

From a finite set of patches, V, we'd like to select M patches, which should be/have

1. representative in feature space2. spatially distributed in input space3. discriminative 4. class balance5. cluster compactness

discriminative patch selection

Page 106: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Exemplar Selection

From a finite set of patches, V, we'd like to select M patches, which should be/have

1. representative in feature space2. spatially distributed in input space3. discriminative 4. class balance5. cluster compactness

We index the selected patches by A

discriminative patch selection

Page 107: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Distract a bit

Maximizing the following set function is NP-hard.

example: representational power

Page 108: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Distract a bit

Maximizing the following set function is NP-hard.

A more general, well-known problem is the facility location problem, for example optimally placing sensors to monitor temperature.

example: representational power

photo credited by Andreas Krause

Page 109: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Identification by patch-match sparse coding1. Automatic patch exemplar selection (dictionary learning)based on discriminative and generative criteria

selected discrminative patches

Page 110: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Identification by patch-match sparse coding1. Automatic patch exemplar selection (dictionary learning)based on discriminative and generative criteria

selected discrminative patches

Automatically selected patches

Page 111: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Identification by patch-match sparse coding1. Automatic patch exemplar selection (dictionary learning)based on discriminative and generative criteria

selected discrminative patches

Page 112: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Identification by patch-match sparse coding1. Automatic patch exemplar selection (dictionary learning)2. Spatially-aware sparse coding (SACO) - penalize dictionary elements from distant spatial locations

patch-match for identification

Page 113: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

spatially aware coding (SACO)

Exemplar patches (dictionary)

Spatial weights

Test patch

Page 114: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

feedforward shrinkage function by transforming dictionary patches into convolutional filters

SACO -- Faster Matching

Page 115: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

feedforward shrinkage function by transforming dictionary patches into convolutional filters

SACO -- Faster Matching

Page 116: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

feedforward shrinkage function by transforming dictionary patches into convolutional filters

SACO -- Faster Matching

Page 117: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

SACO -- Faster Matching

feedforward shrinkage function by transforming dictionary patches into convolutional filters

Page 118: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Quantitative Result on Fossil Pollen

Represent patch using CNN feature extractor (VGG19)Global average pooling of sparse codes by SACOlinear SVM

Substantially outperforms standard CNN and Fisher-vector based approaches!

S. Kong, S. Punyasena, C. Fowlkes, "Spatially Aware Dictionary Learning and Coding for Fossil Pollen Identification", CVPR CVMI, 2016

Page 119: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

quantitative result on modern pollen

We apply our approach to modern pollen grain identification.

Surangi W Punyasena, David K Tcheng, Cassandra Wesseln, Pietra G Mueller, Classifying black and white spruce pollen using layered machine learning, New Phytologist, 2012

Our methodActual

P. Glauca P. Mariana

PredictedP. Glauca 0.969 0.030

P. Mariana 0.021 0.980

Page 120: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Identifying Fossil Pollen with Modern Reference

Fossil pollen grains are degraded over time.

using patches from modern pollen reference to identify fossilized ones

modern pollen grain from glauca fossil pollen pollen grain from glauca

Page 121: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

Identifying Fossil Pollen with Modern Reference

• Use our method to select patches from modern pollen grains

• Use the selected modern patches to identify fossil ones

• We achieve 69% accuracy wrt expert labels.

modern fossil

Page 122: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

1. Problem definition

2. Instantiation

3. Challenge and philosophy

4. Fine-grained classification with holistic representation

5. Fine-grained identification by matching local patches

6. Future work and conclusion

Outline

Page 123: Ubiquitous Fine-Grained Computer Visionskong2/img/fg_CV.pdf · Instantiation -- photo aesthetic ranking ... 2015. Holistic representation based method One method for this is called

leaving blank

Content after this page is not suitable for people to watch!