morphological multi-scale decomposition and efficient...

Morphological Multi-scale Decomposition and efficient representations with Auto-Encoders

April 23th - September 28th

SupervisorsMVA supervisor:

Bastien PONCHON Internship Defense - september 21th 2018

Agenda01. Introduction

02. Part-based Representation using Non-Negative Matrix Factorization

03. Part-based Representation using Auto-Encoders

04. Using a Deeper Architecture

05. Conclusion

01 - Introduction

Representation Learning and Part-Based representation

○atom images,

A few recaps on flat mathematical morphology

Dilation by a structuring element SE:

commutes with supremum.5

Erosion by a structuring element SE:

A few recaps on flat mathematical morphology

Max-Approximation to Morphological Operators

Motivation for Non-Negative and Sparse representation

Objectives and Motivations of the Internship

○○

○ Universal approximator theorem:

Evaluation and Data of the Proposed Models

○ Approximation error of the representation

○ Max-approximation error to the dilation

○ Sparsity of the encoding○ Classification Accuracy

02 - Non-Negative Matrix Factorization

General Presentation02 -

○ Matrix factorization algorithm:

data matrixdictionary matrixencoding matrix

○ separable factorial articulation family:●

●●

Addition of sparsity constraints (Hoyer 2004)02 -

Sparsity measure of vector :

After each update of and in the NMF algorithm, the encodings and atoms are projected on the space verifying:

Results - Sh = 0.602 -

Original images and reconstruction - Reconstruction error: 0.0109

Histogram of the encodings - Sparsity metric: 0.650

Atom images of the representation

Results - Max-Approximation to dilation02 -

Dilation of the original images by a disk of radius 1

Max-approximation to the dilation by a disk of radius 1

03 - Part-Based Representation using Auto-Encoders

Auto-encoder loss function, minimized during training:

Shallow Auto-Encoders 03 -

ReconstructionInput image Encoder Latent representation

Max-approximation

Decoder

The rows of are the atom images of the learned representation !

“Dilated” Decoder

Enforcing the Sparsity of the Encoding03 -

Regularization of the auto-encoder:

Various choices for the sparsity-regularization function:

expected activation of each hidden unit fixed level

Enforcing Non-Negativity of the Atoms of the Dictionary03 -

Two common approaches:○

○●●

Stronger decay of the negative weights

Results - Reconstructions03 -

Original images

p=0.05, beta=0.001

p=0.01, beta=0.005

No Constraint

p=0.2, beta=0.001

p=0.1, beta=0.01

Results - Encodings03 -

Original images

p=0.05, beta=0.001

p=0.01, beta=0.005

No Constraint

p=0.2, beta=0.001

p=0.1, beta=0.01

Results - Atoms03 -

p=0.01, beta=0.005

No Constraint

p=0.1, beta=0.01

Results - Max-approximations to dilation03 -

Original images

p=0.05, beta=0.001

p=0.01, beta=0.005

No Constraint

p=0.2, beta=0.001

p=0.01, beta=0.01

04 - Using a Deeper Architecture

An Asymmetric Auto-Encoder04 -

Motivations:○○○○

ReconstructionInput image infoGANLatent

representation

Max-approximation

Decoder

“Dilated” Decoder“InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets”, Chen et al. 2016

○○

Results - Reconstructions04 -

No Constraint

p=0.05, beta=0.005

p=0.01, beta=0.01

Results - Encodings04 -

No Constraint

p=0.05, beta=0.005

p=0.01, beta=0.01

Results - Atoms04 -

p=0.01, beta=0.01

No Constraint

p=0.05, beta=0.005

Results - Max-Approximations to dilation04 -

No Constraint

p=0.05, beta=0.005

p=0.01, beta=0.01

06 - Conclusion and Future Works

Conclusion - Reconstructions06 -

NMF with Sparsity Constraint Sh=0.6

Sparse, Non-Negative Shallow AE with p=0.05, beta=0.001

Sparse, Non-Negative Asymmetric AE with p=0.05, beta=0.005

Original Images

Conclusion - Encodings06 -

Sparse, Non-Negative Asymmetric AE with p=0.05, beta=0.005 -

Conclusion - Atoms06 -

Sparse, Non-Negative Asymmetric AE with p=0.05, beta=0.005

Conclusion - Max-Approximations to dilation06 -

Sparse, Non-Negative Asymmetric AE with p=0.05, beta=0.005 -

Dilation of Original Images

Conclusion and possible improvements06 -

05 - Multi-Scales Morphological Decompositions

Additive Morphological Decomposition05 -

One of the considered Morphological Decomposition:○

○○

Positive Additive Decomposition Using Openings by Reconstruction05 -

Results05 -

○atom images,

Representation (latent features)

Representation Learning and Part-Based representation

morphological multi-scale decomposition and efficient...

Documents

morphological synthesis

morphological image processing, polarization...

collusion marc bourreau - télécom...

oligopoly marc bourreau - télécom...

ii -...

morphological characterization of cryptosporidium parvum...

morphological types of languages -...

bacteriophage morphological characterization by using...

detach and adapt learning cross-domain disentangled deep...

investigating classroom interaction investigating classroom...

infogan: interpretable representation learning by...

infogan: interpretable representation learning by ... ·...

morphological antialiasing

infogan : interpretable representation learning by...

mapping the electrophysiological and morphological ... ·...

morphological analysis

infogan: interpretable representation learning by...

morphological doubling theory: evidence for morphological

computed tomography image origin identification based...

film critics - ses-perso.telecom-paristech.fr