2015/6/201 minimum spanning tree partitioning algorithm for microaggregation 報告者:林惠珍

Post on 20-Dec-2015

223 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

112/04/18 1

Minimum Spanning Tree Partitioning Algorithm fo

r Microaggregation

報告者:林惠珍

112/04/18 2

Outline

The Microaggregation problem MST partitioning algorithm Experimental results Conclusions and future work

112/04/18 3

The Microaggregation problem

Microdata set( n records, p numerical attributes)

To partition the n points into groups so as to minimize the objective function: SSE

subject to

It is equivalent to minimizing a standardized information loss measure L= SSE / SST.

112/04/18 4

Microaggregation vs. clustering problem

Constraint

Problem

The number of clusters

cluster size

Microaggregation No Yes

Clustering Yes No

So, modify the strategy for selecting edges for deletion from the MST for microaggregation problem. In decreasing length order Each tree has at least k nodes

112/04/18 5

MST Partitioning Algorithm

MST construction( Prim’s algorithm) Edge cutting Cluster formation

112/04/18 6

k=5

112/04/18 7

112/04/18 8

Experimental methods

D: Diameter-based fixed size method C: Centroid-based fixed size method M: MST-partitioning algorithm M-d: MST-partitioning followed by clusters of

size >= 2k partitioned by D M-c: MST-partitioning followed by clusters of

size >= 2k partitioned by C

112/04/18 9

112/04/18 10

Experimental results

112/04/18 11

Conclusions

The more pronounced the inherent clustering effects in the data, the greater is the advantage of using the our methods.

MST partitioning-based method should be considered as a potential candidate for any practical application.

112/04/18 12

Future work

To adapt some of the ideas used to solve other clustering problems to this constrained version.

To explore methods where minimum group size is treated as a soft constraint associated with a preference level.

top related