structure learning with deep neuronal networks 6 th network modeling workshop, 6/6/2013 patrick...

Structure learning

with deep neuronal networks

6th Network Modeling Workshop, 6/6/2013

Patrick Michl

/6/2013

Patrick MichlNetwork Modeling

Agenda

Autoencoders

Biological Model

Validation & Implementation

/6/2013

Real world data usually is high dimensional …

Dataset Model

Autoencoders

/6/2013

… which makes structural analysis and modeling complicated!

Dataset Model

Autoencoders

/6/2013

Dimensionality reduction techinques like PCA …

Dataset Model

Autoencoders

/6/2013

… can not preserve complex structures!

Dataset Model

𝑥2=α 𝑥1+β

Autoencoders

/6/2013

Therefore the analysis of unknown structures …

Dataset Model

Autoencoders

/6/2013

… needs more considerate nonlinear techniques!

Dataset Model

𝑥2= 𝑓 (𝑥1)

Autoencoders

/6/2013

Autoencoders are artificial neuronal networks …

Autoencoder

• Artificial Neuronal Network

Autoencoders

input data X

output data X‘

Perceptrons

Gaussian Units

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Perceptrons

Gaussian Units

Perceptron1

Gauss UnitsR

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Perceptrons

Gaussian Units

/6/2013

Autoencoder

• Artificial Neuronal Network• Multiple hidden layers

Autoencoders

… with multiple hidden layers.

Gaussian Units

input data X

output data X‘

Perceptrons

(Visible layers)

(Hidden layers)

/6/2013

Autoencoder

Autoencoders

Such networks are called deep networks.

Gaussian Units

input data X

output data X‘

Perceptrons

(Visible layers)

(Hidden layers)

/6/2013

Autoencoder

Autoencoders

Gaussian Units

input data X

output data X‘

Perceptrons

(Visible layers)

(Hidden layers)Definition (deep network)

Deep networks are artificial neuronal networks with multiple hidden layers

/6/2013

Autoencoder

Autoencoders

Gaussian Units

input data X

output data X‘

Perceptrons

(Visible layers)

(Hidden layers)

• Deep network

/6/2013

Autoencoder

Autoencoders

Autoencoders have a symmetric topology …

Gaussian Units

input data X

output data X‘

Perceptrons

(Visible layers)

(Hidden layers)

• Deep network• Symmetric topology

/6/2013

Autoencoder

Autoencoders

… with an odd number of hidden layers.

Gaussian Units

input data X

output data X‘

Perceptrons

(Visible layers)

(Hidden layers)

• Deep network• Symmetric topology

/6/2013

Autoencoder

Autoencoders

The small layer in the center works lika an information bottleneck

input data X

output data X‘

• Deep network• Symmetric topology• Information bottleneck

Bottleneck

/6/2013

Autoencoder

Autoencoders

... that creates a low dimensional code for each sample in the input data.

input data X

output data X‘

• Deep network• Symmetric topology• Information bottleneck

Bottleneck

/6/2013

Autoencoder

Autoencoders

The upper stack does the encoding …

input data X

output data X‘

• Deep network• Symmetric topology• Information bottleneck• Encoder

Encoder

/6/2013

Autoencoder

Autoencoders

… and the lower stack does the decoding.

input data X

output data X‘

• Deep network• Symmetric topology• Information bottleneck• Encoder• Decoder

Encoder

Decoder

/6/2013

• Deep network• Symmetric topology• Information bottleneck• Encoder• Decoder

Autoencoder

Autoencoders

… and the lower stack does the decoding.

input data X

output data X‘

Encoder

Decoder

Definition (deep network)

Deep networks are artificial neuronal networks with multiple hidden layers

Definition (autoencoder)

Autoencoders are deep networks with a symmetric topology and an odd number of hiddern layers, containing a encoder, a low dimensional representation and a decoder.

/6/2013

Autoencoder

Autoencoders

Autoencoders can be used to reduce the dimension of data …

input data X

output data X‘

Problem: dimensionality of data

Idea:1. Train autoencoder to minimize the distance

between input X and output X‘2. Encode X to low dimensional code Y3. Decode low dimensional code Y to output X‘4. Output X‘ is low dimensional

/6/2013

Autoencoder

Autoencoders

… if we can train them!

input data X

output data X‘

Problem: dimensionality of data

Idea:1. Train autoencoder to minimize the distance

between input X and output X‘2. Encode X to low dimensional code Y3. Decode low dimensional code Y to output X‘4. Output X‘ is low dimensional

/6/2013

Autoencoder

Autoencoders

In feedforward ANNs backpropagation is a good approach.

input data X

output data X‘

Training

Backpropagation

/6/2013

Backpropagation

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

(1) The distance (error) between current output X‘ and wanted output Y is computed. This gives a error function

/6/2013

Backpropagation

Autoencoder

Autoencoders

In feedforward ANNs backpropagation is the choice

input data X

output data X‘

Training

Backpropagation

Example (linear neuronal unit with two inputs)

/6/2013

Backpropagation

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

(2) By calculating we get a vector that shows in a direction which decreases the error

(3) We update the parameters to decrease the error

/6/2013

Backpropagation

Autoencoder

Autoencoders

In feedforward ANNs backpropagation is the choice

input data X

output data X‘

Training

Backpropagation

(2) By calculating we get a vector that shows in a direction which decreases the error

(3) We update the parameters to decrease the error(4) We repeat that

/6/2013

Autoencoder

Autoencoders

… the problem are the multiple hidden layers!

input data X

output data X‘

Training

Backpropagation

Problem: Deep Network

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation is known to be slow far away from the output layer …

Backpropagation

Problem: Deep Network• Very slow training

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

… and can converge to poor local minima.

Backpropagation

Problem: Deep Network• Very slow training• Maybe bad solution

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

Idea: Initialize close to a good solution

The task is to initialize the parameters close to a good solution!

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

Idea: Initialize close to a good solution• Pretraining

Therefore the training of autoencoders has a pretraining phase …

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

Idea: Initialize close to a good solution• Pretraining• Restricted Boltzmann Machines

… which uses Restricted Boltzmann Machines (RBMs)

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

Restricted Boltzmann Machine

• RBMs are Markov Random Fields

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

• RBMs are Markov Random Fields

Markov Random Field

Every unit influences every neighbor

The coupling is undirected

Motivation (Ising Model)A set of magnetic dipoles (spins)

is arranged in a graph (lattice)

where neighbors are

coupled with a given strengt

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

• RBMs are Markov Random Fields• Bipartite topology: visible (v), hidden (h)• Use local energy to calculate the probabilities of values

Training:contrastive divergency(Gibbs Sampling)

v1 v2 v3 v4

/6/2013

Autoencoder

Autoencoders

input data X

output data X‘

Training

Backpropagation

Gibbs Sampling

/6/2013

Autoencoders

Autoencoder

The top layer RBM transforms real value data into binary codes.

Training

/6/2013

Autoencoders

Autoencoder

Therefore visible units are modeled with gaussians to encode data …

v1 v2 v3 v4

h3 h4 h5h1

Training

/6/2013

Autoencoders

Autoencoder

… and many hidden units with simoids to encode dependencies

v1 v2 v3 v4

h3 h4 h5h1

Training

/6/2013

Autoencoders

Autoencoder

The objective function is the sum of the local energies.

Local Energy

𝐸𝑣≔−∑h

𝑤 h𝑣

𝑥𝑣

𝜎𝑣

+(𝑥𝑣−𝑏𝑣 )2

2𝜎 𝑣2

v1 v2 v3 v4

h3 h4 h5h1

Training

/6/2013

Autoencoders

Autoencoder

Reduction

The next RBM layer maps the dependency encoding…

Training

/6/2013

Autoencoders

Autoencoder

Reduction

… from the upper layer …

v1 v2 v3 v4

Training

/6/2013

Autoencoders

Autoencoder

Reduction

… to a smaller number of simoids …

v1 v2 v3 v4

Training

/6/2013

Autoencoders

Autoencoder

Reduction

… which can be trained faster than the top layer

Local Energy𝐸𝑣≔−∑

𝑤 h𝑣 𝑥𝑣 𝑥h+𝑥h𝑏h

𝐸h≔−∑𝑣

𝑤 h𝑣 𝑥𝑣 𝑥h+𝑥𝑣𝑏𝑣

v1 v2 v3 v4

Training

/6/2013

Autoencoders

Autoencoder

Unrolling

The symmetric topology allows us to skip further training.

Training

/6/2013

Autoencoders

Autoencoder

Unrolling

The symmetric topology allows us to skip further training.

Training

/6/2013

After pretraining backpropagation usually finds good solutions

Autoencoders

Autoencoder

Training

• PretrainingTop RBM (GRBM)Reduction RBMsUnrolling

• FinetuningBackpropagation

/6/2013

The algorithmic complexity of RBM training depends on the network size

Autoencoders

Autoencoder

Training

• Complexity: O(inw)i: number of iterationsn: number of nodesw: number of weights

• Memory Complexity: O(w)

/6/2013

Agenda

Autoencoders

Biological Model

Validation & Implementation

/6/2013

Patrick MichlNetwork Modeling Network Modeling

Restricted Boltzmann Machines (RBM)

How to model the topological structure?

/6/2013

We define S and E as visible data Layer …

Network ModelingRestricted Boltzmann Machines (RBM)

/6/2013

We identify S and E with the visible layer …

/6/2013

… and the TFs with the hidden layer in a RBM

/6/2013

The training of the RBM gives us a model

/6/2013

Agenda

Autoencoder

Biological Model

Implementation & Results

/6/2013

Results

Validation of the results

• Needs information about the true regulation• Needs information about the descriptive power of the data

/6/2013

Results

Validation of the results

• Needs information about the true regulation• Needs information about the descriptive power of the data

Without this infomation validation can only be done,

using artificial datasets!

/6/2013

Results

Artificial datasets

We simulate data in three steps:

/6/2013

Results

Artificial datasets

We simulate data in three steps

Step 1

Choose number of Genes (E+S) and create random bimodal distributed data

/6/2013

Results

Artificial datasets

Step 1

Step 2

Manipulate data in a fixed order

/6/2013

Results

Artificial datasets

Step 1

Step 2

Manipulate data in a fixed order

Step 3

Add noise to manipulated data

and normalize data

/6/2013

Simulation

Results

Step 1Number of visible nodes 8 (4E, 4S)

Create random data:

Random {-1, +1} + N(0,

/6/2013

Simulation

Results

NoiseStep 2Manipulate data

/6/2013

Simulation

Results

Step 3Add noise: N(0,

/6/2013

Results

We analyse the data Xwith an RBM

/6/2013

Results

We train an autoencoder with 9 hidden layersand 165 nodes:

Layer 1 & 9: 32 hidden unitsLayer 2 & 8: 24 hidden unitsLayer 3 & 7: 16 hidden unitsLayer 4 & 6: 8 hidden unitsLayer 5: 5 hidden units

input data X

output data X‘

/6/2013

Results

We transform the data from X to X‘And reduce the dimensionality

/6/2013

Results

We analyse thetransformed data X‘with an RBM

/6/2013

Results

Lets compare the models

/6/2013

Results

Another Example with more nodes and larger autoencoder

/6/2013

Conclusion

• Autoencoders can improve modeling significantly by reducing the dimensionality of data

• Autoencoders preserve complex structures in their multilayer perceptron network. Analysing those networks (for example with knockout tests) could give more structural information

• The drawback are high computational costsSince the field of deep learning is getting more popular (Face recognition / Voice recognition, Image transformation). Many new improvements in facing the computational costs have been made.

/6/2013

Acknowledgement

eilsLABS

Prof. Dr. Rainer König

Prof. Dr. Roland Eils

Network Modeling Group

structure learning with deep neuronal networks 6 th network modeling workshop, 6/6/2013 patrick...

patrick michl slide

deep network slide

th network modeling

deep neuronal networks

modeling complicated

odd number of hidden

gauss units r slide

structural analysis

Documents

lipofuscinosis ceroidea neuronal 19 lfc.pdfneuroradiol j....

mapping function onto neuronal morphology -...

mr. florain michl

cse511 brain & memory modeling lect05-6: large-scale...

947-s001 neuronal

unique features of neuronal autophagy: considerations for...

defectos migracion neuronal

tacto neuronal

13263225 excited states and photochemistry of organic...

neuronal networks observed with resting state functional...

the sensorimotor strategies and neuronal representations of...

u·m·i · 2015. 6. 8. · speciesinconstructing neuronal...

neuronal transmission

“neuronal apocalypse”

the theory of neuronal cognition - connecting...

public procurement and public budgets – management and...

neuronal function

01 neuronal function

neuronal dynamics: nonlinear integrate-and-fire...

substrate topography determines neuronal polarization and...