treesom: cluster analysis in the self-organizing map

13
1 Intelligent Database Systems Lab 國國國國國國國國 National Yunlin University of Science and T echnology TreeSOM: Cluster analysi s in the self-organizing map Advisor : Dr. Hsu Presenter : Zih-Hui Lin Author :Elena V. Samsonovaa, Joost N. Ko kb, Ad P. IJzermana Neural Networks 19 (2006) 935–949

Upload: nhi

Post on 09-Jan-2016

58 views

Category:

Documents


0 download

DESCRIPTION

TreeSOM: Cluster analysis in the self-organizing map. Advisor : Dr. Hsu Presenter : Zih-Hui Lin Author :Elena V. Samsonovaa, Joost N. Kokb, Ad P. IJzermana. Neural Networks 19 (2006) 935–949. Outline. Motivation Objective Introduction Method Conclusions Case studies. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: TreeSOM: Cluster analysis in the self-organizing map

1Intelligent Database Systems Lab

國立雲林科技大學National Yunlin University of Science and Technology

TreeSOM: Cluster analysis in the self-organizing map

Advisor : Dr. Hsu

Presenter : Zih-Hui Lin

Author :Elena V. Samsonovaa, Joost N. Kokb, Ad P. IJzermana

Neural Networks 19 (2006) 935–949

Page 2: TreeSOM: Cluster analysis in the self-organizing map

2

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

Motivation Objective Introduction Method Conclusions Case studies

Outline

Page 3: TreeSOM: Cluster analysis in the self-organizing map

3

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

Motivation

different map initializations and input order of data elements may result in different clusterings─ This is a lengthy and laborious task, so far not automat

ed. ─ For large data sets it becomes intractable for manual an

alysis, forcing the user to select a single “good” SOM and accept it as the final result omitting tests of confidence.

Page 4: TreeSOM: Cluster analysis in the self-organizing map

4

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

Objective

We present a new method for cluster analysis and finding reliable clustering.

Page 5: TreeSOM: Cluster analysis in the self-organizing map

5

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

Method 100 SOMs with different random seeds were produced (

Protein data)─ Map size 5 x 4─ Phase 1: starting learning rate 0.2, starting radius 6, 1000 iterations; ─ phase 2: starting learning rate 0.02, starting radius 3, 100,000 iterations.

Clustering ─ Cluster discovery─ Calibration

Consensus trees ─ SOM as a Tree─ Clustering confidence─ The most representative SOM

Page 6: TreeSOM: Cluster analysis in the self-organizing map

6

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

Clustering 2.1. Cluster discovery 2.2. Calibration

Page 7: TreeSOM: Cluster analysis in the self-organizing map

7

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Consensus tree

3.1. SOM as a tree

The Besting clustering

Page 8: TreeSOM: Cluster analysis in the self-organizing map

8

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

1.all the nodes in all the trees in the set are evaluated in terms of their leaf sets, and equivalent nodes are grouped.2.The number of nodes in each equivalence group divided by the total number of nodes in all trees gives the confidence value of each node in the group.3.

4. The distance between two sets P and Q equals the average distance between each element from P and from Q.

Consensus tree

3.2 Clustering confidence

Page 9: TreeSOM: Cluster analysis in the self-organizing map

9

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

Consensus tree 3.3. The most representative SOM

Page 10: TreeSOM: Cluster analysis in the self-organizing map

10

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Conclusions

In this paper we present a new look at self-organizing maps, improving their applicability to clustering problems and facilitating comparisons of clustering results with those of hierarchical classifiers.

Page 11: TreeSOM: Cluster analysis in the self-organizing map

11

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

Introduction

Protein data─ GPCRs

Consensus tree vs. phylogenetic trees

Page 12: TreeSOM: Cluster analysis in the self-organizing map

12

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Case study

Page 13: TreeSOM: Cluster analysis in the self-organizing map

13

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.My opinion

Advantage We can find the most confident clusters easily.

Drawback ….

Application …..