data visualisation using topographic mappings

Post on 02-Jan-2016

36 Views

Category:

Documents

4 Downloads

Preview:

Click to see full reader

DESCRIPTION

Data Visualisation using Topographic Mappings. Colin Fyfe The University of Paisley, Scotland. Outline. Topographic clustering. Topographic Product of Experts, ToPoE Simulations Products and mixtures of experts. Harmonic Topographic Mapping, HaToM 2 Varieties of HaToM. Trunk. Head. - PowerPoint PPT Presentation

TRANSCRIPT

Data Visualisation using Topographic Mappings

Colin Fyfe

The University of Paisley,

Scotland

Outline

• Topographic clustering.

• Topographic Product of Experts, ToPoE

• Simulations

• Products and mixtures of experts.

• Harmonic Topographic Mapping, HaToM

• 2 Varieties of HaToM

The somatosensory homunculus

• Larger area of cortex for more sensitive body parts.

Lower lip

Upper lipFace

Eye

Thumb

GenitalsToesFoot

LegTrunkHeadArmHand

LittleRing

MiddleIndex

Thumb

Pharynx

Kohonen’s Self-organizing Map

• camtasia\som.html

Orientation selectivity

Topology preservation

• Data space Feature Space

• Nearby Nearby

• Distant Distant (SOM)

• Nearby Nearby (SOM)

• Distant Distant

The Model

• K latent points in a latent space with some structure.

• Each mapped through M basis functions to feature space.

• Then mapped to data space to K points in data space using W matrix (M by D)

• Aim is to fit model to data to make data as likely as possible by adjusting W

Mental Model

• 1 2 3 4 5 6 ……

Details

• t1, t2, t3, … ,tK (points in latent space)

• f1(), f2(), …, fM() (basis functions creating feature space)

• Matrix Φ (K by M), where φkm = fm(tk), projections of latent points to feature space.

• Matrix W (M by D) so that ΦW maps latent points to data space. tk mk

Products of Gaussian Experts

)||||2

exp()(

))||||(2

exp()(

2

1

2

1

nk

K

kn

nk

K

kn

xmxp

xmxp

Maximise the likelihood of the data under the model

M

mkmmd

kd

kd

K

k

ndkmmdn

wm

mxw

1

)(

)(

1

)( )(

K

kknn mxxp

1

2))(log(

Using Responsibilities

M

mkmmd

kd

knkd

K

k

ndkmmdn

wm

rmxw

1

)(

)(

1

)( )(

||||||||

)exp(

)exp(

1

2

2

jnjm

M

mmnjn

jjn

knkn

mxwxd

d

dr

Comparison with GTM

)||||2

exp()2(

1)|()()( 22

kn

D

kknn mx

Kkxpkpxp

))||||(2

exp()2()( 2

1

2knnk

K

k

D

n rxmxp

t nt

nkknxxk d

dr

n )exp(

)exp(2

2

|

Growing ToPoEs

Demonstration

Advantages

• Growing : need only change Φ which goes from K by M to (K+1) by M.

• W is approximately correct and just refines its learning.

• Pruning uses the responsibility: if a latent point is never the most responsible point for any data point, remove it.

• Keep all other points at their positions in latent space and keep training.

)||(2

exp(1

)( 11

knnk

K

kn rxm

Zxp

Twinned ToPoEs

• Single underlying latent space

• Single Φ

• Two sets of weights

• Single responsibility – distance between current projections and both data points.

Twinned ToPoEs - 2

Using φkm=tanh(mtk)

Products of Experts

Responsibilities with Tanh()

1

19

37

55

S1

S1

1

0

0.2

0.4

0.6

0.8

1

A close up

Harmonic Averages

• Walk d km at 5 km/h, then d km at 10 km/h• Total time = d/5 +d/10• Average Speed = 2d/(d/5+d/10)=

• Harmonic Average =

10

1

5

12

K

k ka

K

1

1

K-Harmonic Means beats K-Means and MoG using EM

• Perf =

N

iK

kki mx

K

1

12

1

N

iK

l ilik

ki

k

dd

mxK

m

Perf

1

1

22

4 )1

(

)(4

N

iK

l ilik

N

iiK

l ilik

k

dd

x

dd

m

1

1

22

4

1

1

22

4

)1

(

1

)1

(

1

Growing Harmony Topology Preservation

• Initialise K to 2. Init W randomly.

1. Init K latent points and M basis functions.

2. Calculate mk=φkW, k=1,…,K.

1. Calculate dik, i=1,…,N, k=1,…,K

2. Re-calculate mk, k=1,…,K. (Harmonic alg.)

3. If more, go back to 1.

3. Re-calculate W=(ΦTΦ+γI)-1ΦTШ

4. K= K + 1. If more, go back to 1.

Disadvantages-Advantages ?

• Don’t have special rules for points for which no latent point takes responsibility.

• But must grow otherwise twists.

• Independent of initialisation ?

• Computational cost ?

Generalised K-Harmonic Meansfor Automatic Boosting

• Perf =

N

iK

kp

ki mx

K

1

1

1

N

iK

lpil

pik

ki

k

dd

mxpK

m

Perf

1

1

21 )1

(

)(2

N

iK

lpil

pik

N

iiK

lpil

pik

k

dd

x

dd

m

1

1

22

1

1

22

)1

(

1

)1

(

1

Two versions of HaToM

• D-HaToM (Data driven HaToM) :– W and m change only when adding a new

latent point– Allows the data to influence more the

clustering

• M-HaToM (Model driven HaToM) :– W and m change in every iteration– The data is continually constrained by the

model

Simulations(1): 1D dataset

K=2 K=4 K=8

K=20

M-HaToMD-HaToM

Simulations(3): algae dataset

D-HaToM

M-HaToM

Wider rij Narrower rij

Conclusion

• New forms of topographic mapping.

• Based on latent space concept but– free from probabilistic constraints.

• Product Mixture of experts.– automatic setting of local variances.

• Two types based on K-harmonic Means

• Very sensitive to data, or not, as required

top related