an interval mst procedure rebecca nugent w/ werner stuetzle november 16, 2004
Post on 21-Dec-2015
214 views
TRANSCRIPT
The Minimal Spanning Tree
The MST of a graph is the spanning tree with a minimal sum of edge weights
Essentially the “lowest cost” network to connect a group of vertices/data points.
Most commonly used with an edge weight of distance between two points
The MST cont.
Several common algorithms
Kruskal’s adds edges in increasing orderCan form disconnected point segmentsAll fragments eventually join
The MST cont.
In/Out Algorithm (Prim’s)Start with an “in” pointFind the closest “out” point. Connect the two.Now find the closest “out” point to either of the
two “in” points. Connect.Etc.Need only remember the 2nd closest distance
from previous step.
New Edge Weight
Are interested in using the MST to represent the underlying “shape” of the density of the data
Use the minimum of the density between two points as the pair’s edge weight
The MST structure should indicate the modality of the data
Points in high density areas/peaks should be “close”
Points separated by a “valley” should be “far”
If we assign the min density to a pair, a low density point in a tail will cause ties in a large number of edges – these ties are broken by Eucl. distance
Finding the Minimum
Only need to have ordering of edge weights to find MST
(Note that any monotonic transformation of the edge weights preserves the MST structure)
Can instead find an interval bounding the minimum
Finding the Minimum
Once the intervals have been found, some may overlap.
Refine the intervals until apparent which edge to add.
May not need to refine until all intervals are non-overlapping – can be selective in choosing edges
0 10 20 30 40 50
-3.0
-2.0
-1.0
0.0
0 10 20 30 40 50
-3.0
-2.0
-1.0
0.0
0 10 20 30 40 50
0.10
0.14
0.18
0.22
-1 0 1 2 3 4
0.10
0.12
0.14
0.16
data
k.de
1
2
3
4
5
67
8
910
0 10 20 30 40 50
-3.0
-2.0
-1.0
0.0
0 10 20 30 40 50
-3.0
-2.0
-1.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0
0.15
00.
160
0.17
0
t
kern
el d
ensi
ty e
stim
ate
16
0.0 0.2 0.4 0.6 0.8 1.0
0.17
20.
174
0.17
60.
178
t
kern
el d
ensi
ty e
stim
ate
16
0 10 20 30 40 50
0.04
0.08
0.12
0.16
0.0 0.1 0.2 0.3 0.4 0.5
0.15
00.
154
0.15
80.
162
t
kern
el d
ensi
ty e
stim
ate
16
0.0 0.1 0.2 0.3 0.4 0.5
0.16
40.
168
0.17
2
t
kern
el d
ensi
ty e
stim
ate
16
0.0 0.1 0.2 0.3 0.4 0.5
0.17
50.
177
0.17
9
t
kern
el d
ensi
ty e
stim
ate
16
0.00 0.10 0.20 0.30
0.16
90.
171
0.17
30.
175
t
kern
el d
ensi
ty e
stim
ate
910
0 10 20 30 40 50
0.16
30.
165
0.16
70.
169
0.00 0.05 0.10 0.15
0.17
780.
1782
0.17
860.
1790
t
kern
el d
ensi
ty e
stim
ate
610
0.00 0.05 0.10 0.15
0.17
550.
1765
0.17
75
t
kern
el d
ensi
ty e
stim
ate
610
0.00 0.05 0.10 0.15
0.17
300.
1740
0.17
50
t
kern
el d
ensi
ty e
stim
ate
910
0.00 0.05 0.10 0.15
0.16
900.
1700
0.17
100.
1720
t
kern
el d
ensi
ty e
stim
ate
910
0 10 20 30 40 50
0.10
0.14
0.18
0.22