fully convoluional networks for semantic...

16

Fully Convolutional Networks for Semantic Segmentation Jonathan Long, Evan Shelhamer, Trevor Darrell Presenter: Hannah Li

Upload: others

Post on 27-Aug-2020

5 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Fully Convolutional Networks for Semantic Segmentation

Jonathan Long, Evan Shelhamer, Trevor Darrell

Presenter: Hannah Li

Page 2: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Fixed dimension and throws away spatial coordinatesConvolutional Networks

Fully Convolutional Networks

Page 3: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Fully Convolutional Networks

xi,j - location (i, j) for a particular layeryi,j - location (i, j) for the following layerk - kernel (filter) sizes - stridefks - determines the layer type:

- matrix multiplication for convolution or average pooling- spatial max for max pooling- elementwise nonlinearity for an activation function

To find the gradient descent on the whole image, you only need to find the gradient descent of the final layer

Page 4: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Output dimensions are reduced to keep filters small and

computational requirements reasonable

However, we need dense pixels…

Page 5: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Solution: Deconvolution/Upsampling

stride = 1 stride = 2

https://github.com/vdumoulin/conv_arithmetic

Page 6: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-pixels.pdf

Deconvolution

Bilinear interpolation

Page 7: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

From classifier to dense FCN1. Discard the final layer2. Convert all fully connected layers to convolutions3. Append convolution with channel dimension 21 to

predict score for the PASCAL classes

Page 8: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Page 9: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Page 10: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

FCN for segmentation Technique: combining final

prediction layer with lower layers with finer strides

Not detailed enough

Combining fine layers and coarse layers lets the model make local predictions that respect global

structure.

Page 11: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Results

20% improvement for mean IoU

286 times faster

*Simultaneous detection and segmentation. ECCV 2014

*

Page 12: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Conclusions

• “Fully convolutional” networks take inputs of any size

• Adapt AlexNet, VGG net, GoogLeNetinto fully convolutional networks

• Fine-tune them to the segmentation task

• New architecture: combines semantic info from coarse layer with info from shallow, fine layer

• 20% relative improvement to 62.2% mean IU (PASCAL VOC 2012)

Page 13: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Extras

Page 14: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Fully Convolutional Networks

xi,j - location (i, j) for a particular layeryi,j - location (i, j) for the following layerk - kernel (filter) sizes - stridefks - determines the layer type:

- matrix multiplication for convolution or average pooling- spatial max for max pooling- elementwise nonlinearity for an activation function

- Kernel size and stride for this functional form obeys the transformation rule above- Computes a nonlinear filter (as opposed to a general nonlinear function)

FCN naturally operates on an input of any size!

Page 15: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

Loss Function

• Loss function is a sum over the spatial dimension of the final layer

• Gradient of loss function is a sum over the gradients of each of its spatial components

• Takes all of the final layer receptive fields as a minibatch

The receptive fields overlap significantly ->More efficient!

5 times faster than AlexNet

Spatial output maps are suitable dense problems like semantic

segmentation

Page 16: Fully Convoluional Networks for Semantic Segmentationvicente/recognition/2016/presentations/fcns.pdf · predictions that respect global structure. Results 20% improvement for mean

czoneindia.comczoneindia.com/download/Kundli-Manual.pdf · What is Kundli PRO for Windows? Windows compatibility ... Varshphal for predictions in respect to time. Some of the key

Collective Emotions and Protest Vote · Collective emotions hence generate patterns of \emotional contagion" that lead to signi cantly di erent predictions with respect to other models

Kistefos AS · • This presentation may include certain forward-looking statements, estimates, predictions, influences and projections with respect to anticipated future prefomance

KR12 Semantic Index and TBox optimisation with respect to dependencies

Fatigue Predictions using Statistical Inference within the ...dutiosb.twi.tudelft.nl/~meulen/Fatigue Predictions eindversie.pdf · Fatigue Predictions using Statistical Inference

Making Predictions

Football predictions

Verb Semantic Structures in Memory for Sentences: Evidence ...groups.psych.northwestern.edu/gentner//papers/Gentner81d.pdfdifferent predictions when applied to these representations

Library Budget Predictions for 2015 public · E-Journal Budgets predictions • •

Using Standardized Semantic Technologies For Discovery And ... · Matching algorithms are domain-independent, hence futureproof with respect to changes in the - domain Foundation

Introduction to Semantic Web and Ontologies Hasan TÜRKSOY Compiled, partly based on various online tutorials and presentations, with respect to their authors

Technology, Media & Telecommunications Predictions 2016docs.media.bitpipe.com/.../Deloitte-TMT-predictions-2016.pdf · 2016-01-22 · Technology, Media & Telecommunications Predictions

CAMDEX - Semantic Scholar · diagnosis rarely presents difficulties. Differential diagnosis Both in respect of therapy and scientific investigation, the failure to differentiate between

An Introductory Essay - ResearchGate · 220 C. ROSENZWEIG AND F. N. TUBIELLO Although many uncertainties exist with respect to the spatial and temporal variations of these predictions,

2010 Predictions

Predictions 2010

Semantic Comprehension Deficits JISHA 26 (1), 43-61 ...ishaindia.org.in/jisha_Vol26_1_articles_final/semantic... · aphasia with respect to auditory, ... of real versus nonsense consonant-vowel-consonant

Predictions idc2013

Neocurtilla (Gryllotalpa) hexadactyla - Semantic Scholar · heteromorph was found to assort completely randomly with respect to every other heteromorphic pair andwithrespect to thesingle

Geostatistics Predictions with Anisotropy and Simulationscarlos/Academicos/Cursos/Geostatistics/Classes... · Geostatistics Predictions with Anisotropy and Simulations ... Predictions

Correcting density functional theory for accurate ......predictions of the compound thermodynamic stability with respect to decomposition into the competing phases which can be used

TOP 10 PREDICTIONS IDC Predictions 2012: Competing for 2020

Previous predictions

PREDICTIONS 2008

Semantic hierarchies for image annotation : a surveyimagine.enpc.fr/~audibert/Mes articles/AMT10.pdfFigure 2: Research trends with respect to semantic image annotation, captured by

Precise minds in uncertain worlds - Semantic Scholar€¦ · chain of interdependent predictions to match on different levels, from complex features to low-level stimulus charac-teristics

Detecting Fine-Grained Cross-Lingual Semantic Divergences … · 2020. 11. 11. · ity model, while token-level predictions have the potential of further distinguishing between coarse

Corporate Semantic Web · Web-basiertes Semantisches Unternehmen (Semantic Enterprise) Corporate Semantic Web •Semantic Applications •Semantic Knowledge •Semantic Content Front

Predictions, Hopes and Aspirations for U.S. Healthcare in 2015€¦ · • The real predictions for 2015 • The aspirational predictions for 2015 • Poll questions for your predictions

Weather predictions

Mrs. Dorsett’s Classroom Procedures. CLASS RULE… “RESPECT” Respect yourself. Respect your family. Respect your friends. Respect your classmates. Respect

Top-10 Economic Predictions for 2010 10 Economic Predictions for

2013 predictions

Grammatical predictions reveal influences of semantic

TPP11 (CPTPP): Its Implications for Japan-Latin America ... · Free Trade Agreement (NAFTA). With respect to the TPP agreement, contrary to most predictions, however, one year later