proceedings dtic · ad-a252 552 the onr workshop on discrete structures in classification 05 may-...
TRANSCRIPT
AD-A252 552
THE ONR WORKSHOP ONDISCRETE STRUCTURES IN CLASSIFICATION
05 MAY- 06 MAY 1992
RAMADA RENAISSANCE HOTELWASHINGTON-DULLES
PROCEEDINGS
DTICELECTE
_______JUL 0 119921This document has been -approved 8 ADfor public rclease and sale; itsdistribution is unlimited.
Organized by: F.R. McMorris, University of LouisvilleMarc J. Lipman, Office of Naval Research
GRANT: N00014-92-J-1574R&T: 4111066---01
92-1714092 6 " IIINhImIIIII
DISCLAIMER NOTICE
THIS DOCUMENT IS BEST
QUALITY AVAILABLE. THE COPY
FURNISHED TO DTIC CONTAINED
A SIGNIFICANT NUMBER OF
PAGES WHICH DO NOT
REPRODUCE LEGIBLY.
PREAMBLE
The objective of this workshop was to determine the state-of-the-art in Discrete Structured Classification, and chart new directions ofresearch relevant to Navy needs. Thus it had two goals. The first wasfor the Principal Investigators currently funded by ONR in this core areaof Discrete Mathematics to inform each other and their Scientific Officerof their current research activities. The second was for the PIs and Navyscientists to meet each other and exchange problems and ideas with thehope of developing significant research partnerships.
The first goal was grandly achieved by direct PI presentations.Progress on the second was made through the problem session androundtable discussions.
Accesion For
NTIS CRA&IDTI, TABUi,.:inounced
Justfiica t iun .......... .......
By ...... .............. ................
Dist ibutior I
Aval c:,..'Di,;t S ,I-ta
Statement A per telecon Marc LipmanONR/Code 1111Arlington, VA 22217-5000
NWW 6/30/92
%. . p
e P, .p :. . F.0
8".. p: . ...
.0
* * S
*..SSSSS * *S 5 .. * ** 5 5* :.*
s..::..:'.: *S
***%**.S*.2. 0.. t* h
4 MS
ATTENDEES
Dr. W.O. AlltopCode 3807Applied Mathematics GroupNaval Air Weapons CenterChina Lake, CA 93555-6001
Dr. Gary ChartrandDepartment of Mathematics and StatisticsWestern Michigan UniversityKalamazoo, MI 49008
* Dr. Donald W. DearholtDepartment of Computer ScienceMississippi State UniversityMississippi State, MS 39762
Dr. Pierre HansenGERAD and lcole des Hautes 6tudes CommercialesDepartement des M~thodes Quantitatives et Systames d'Information5255 avenue DecellesMontr6al, Quebec H3T IV6Canada
Dr. M.F. JanowitzDepartment of Mathematics and StatisticsUniversity of MassachusettsAmherst, MA 01003
Dr. Brigitte JaumardGERAD and lcole Polytechnique de MontrdalDdpartement de Math~matiques Appliqu6esSuccursale A, Case Postale 6079Montr6al, Quebec H3C 3A7Canada
Dr. Marc J. LipmanCode 1111Office of Naval Research800 N. Quincy St.Arlington, VA 22217
Dr. F.R. McMorrisDepartment of MathematicsUniversity of LouisvilleLouisville, KY 40292
Dr. K.B. ReidDepartment of MathematicsCalifornia State UniversitySan Marcos, CA 92069
Dr. Fred S. RobertsDepartment of MathematicsRutgers UniversityNew Brunswick, NJ 08903
. Dr. Uriel G. RothblumRUTCORRutgers UniversityNew Brunswick, NJ 08903
Dr. Ralph WachterCode 1133Office of Naval Research800 N. Quincy St.Arlington, VA 22217
ONR WORKSHOP ON
DISCRETE STRUCTURED CLASSIFICATION
Tuesday 5 May
0900 MARC LIPMAN
0930 BUCK McMORRIS
1030 PIERRE HANSENBRIGITTE JAUMARD
1200 Lunch
1300 BROOKS REID
1400 DON DEARHOLT
1500 Break
1515 Problem session - MARC LIPMAN
1600 Roundtable
1800 Conference Dinner
Wednesday 6 May
0900 MARC LIPMAN
0930 GARY CHARTRAND
1030 break
1100 URIEL ROTHBLUM
1200 Lunch
1300 MEL JANOWITZ
1400 Roundtable
1500 Wrap
Sm
THE FOLLOWING PAGES CONSIST OF ABSTRACTS ANDCOPIES OF THE TRANSPARENCIES USED FOR THEPRESENTATIONS.
- :3 EC'.z :3 4- 4-
I-04:~ 0 2~00oE
0
U).
cn 0) C;) A2
cCO (DcjE 0)
00 4-
.0
0 0 0C
CC ~ 0 C Uc
WI--0 ~ Co c.)
0) ~ c m _
CD "
E~ co
C ~-C
UU E) E
mi 0 . C,
C)
ucw amhA.LU
zz
LU-
0z =zi LIm W
=1of 4 00*Cf) lL LA .
=L=44 71.L
CO ~c O Oui cc CL p
LLCM
zurL(
I. c
WI'. LU 0
LUJ
Im- 2z 0 uCO- z
1mm LUm
CO) -' 0 LU 0CO) _U <L
CM) 2Iz z o- LU Z
CIDCm LLO
2Lm C0 Lcm) ZmLLWZ LUJ
LAi LLU ocA
~~LUJ. O TVC m C O
0 ~ L
LU_U LU
C . CO) C.
0... z0A LU
CM) CO)= c 0 0o I.- ui
Z cc
>0z0 LUg
cn LU 0 ZU aLC~
2 CM
LLJJ
I.. LUL. LU
0U 0
=e L lU
z ca =
2C o L L
0 LU EA
CO L' CUD LU ,O
CD-
cncZ0 N 0um
LA 0zr4-zN
-ooFi- w wAI
wE CO) CO) 2
>00
-- LI. 0 0)
LLi
z CD ~~I..I OZ4 2
Zc.Z
CO) CA~
C1
4n ,-
Im I doo% COL6U L IM
NeJ
LLWLLJ
lLLJ IN=
CM 0=
~~z
Lai 0 0 0 LPon~ ~ ~ -L nI.U O
ZZ TEO) mi= C- AIR 4c 2iL
LLJ~ a L-C)zzL
= Z 0
CL cz
An axiomatic approach to the aggregationof trees and other discrete structures
F.R. McMorrisDepartment of Mathematics
University of Louisville
Suppose several classifications (usually structured as trees) have beenconstructed for the same set of objects. How can we (why should we) forma "consensus" classification that captures the common agreement of theoriginal classifications? During the past twenty years many methods havebeen developed relevant to the comparison and consensus of classifications.One approach based on the central ideas of Social Choice Theory pioneeredby K. Arrow, has been extended to produce a general axiomatic model toaddress the above question. This talk will present an overview of thisapproach and indicate some recent results.
4
P2~P3
P4
Pi
Figure 2.1 aWaveform Segments
PZ (root)
P1 P2
P2 P4
P2 P3
Figure 2.1 bThe Relational Tree
Erich and Foith enumerate several properties of thnse trees [1]. For our work, the
most Interesting is that RT's partition the set of one-dimensional functions into
.equivalence classes. The partitions may be viewed as clusters In a pattern feature
H~ere~ :d n oArun6 er #fV voe r-
eacVN o4 er linearly7 ora&eri
4 he -aLern-atI*
MM-1~l r~jl usedk 49' eactl
pairwist comrrisor% of :W
* Faculty salaries
Y4ew flyiSj4 Uew Music al
New 4 Ic- b6d5
Vl V V 4 _
a a I a. at Ia a at g,a. '- a
a, a. 3: 4 a
00C,4 00
.0000 qr 8
0 00 " .0
0000 % e 00
00 44~-
- r- r-
CqN 00W cW
~0O 00' Go % en -
00q00 0O ' 000 Dr- -I
1. 00 0 n 0 4
00 C- r--a r- r 0% - 0v ON rN
C7 O n NI0 -0
m r--~-q 4 0 0 %C
< z 00 r-- -00IO00 rr-W1 W1
The i\-( vo~es 'in V
,&ttte vcAkr focesS catenaJ 4v ni ar
tn-T 44 4c-m
c~ec,~ic~1~'T~AL. 4we c * cs otre e &IT.
ThtO-M A. -T6.aowzw&4ij x~
a &jwvtna &U dirfb o( T.
X% e 4eI f e.4moma SpecWia ~r
a 16 akIF 61t
60 6Z
**~~ -kr %*h *. ~ I ~ I *
f -
ISincere ~r4~rAy va4 ing 4~a~eAJa
£Z1~41I~S
I
- -.... - - - - --. ~-,. --.. ------.--.-. .- -- ------------. --- ~ -. - ..- - -
.. ~ ~' ~
O1(\~~ IN~yEtve's
I~~~~ levelcj av.- &i;lo at verrVu '~ t
#~v -isbJ ABvmt~alkwiAk ZWk (lejve aO)WJ~-~n
Aie2v ei% d j& &IO
The S tpl en s w
wGiven 6 rnajr41 Outnmrt Toiu4'am
~ ~4* sopiAizAe Jecu irn.
E52.451(
I'
4 1 '24'1'
2* 41 1
1) T 4 I i o is 4t iECi"M XiyIV4o.456
X IS Wbt. S6f WsicA- &eC.
Mowev~r 4Enam~ PiVaIsu WM4
OSub~ynvl e IV) 4 ~wn. .eAA To' swlvl
C, Arise4' ft*ewH1 *
4' Cl
all" ~~ ~C ,s ~ s oo
IL~c, is the ile tc.
~cuSt T 4 r* h bes 4 uM e
Problem.% For w ich awaaiwwM& is
&hereafa &3 nJa ca &Ini~ -an
SIceve &%d +%e 64~4AWsionb T
I2KP~Q'I Majcbrity Tournamarvs:
SIncara~ sekt ba
VrAM,3 clfciiofts uv~catr
Ornend rrlrnt rrceure
I&RtThek A~nL Letw~een
kvr 79oim drz8 ionsv a
so lsi&cataA Majority vot-Irl
toa&t. A r L Mai 31 Cg'iql)z2
*, 40
purple. werices ave .. IeIOCMI
T6 a t N trAxse IC
isabr 141weVA
J G Cr'0xkmk'rii PurIL ~ics ar ce.~da olivs
preference cailrbrv% ff G w)& reA Fte-4to V (4heset ofvev~ce at G, -n this
emam~~ L%)uyAtthe &eA 0cm wI;M
llns)n*
ThisAt3 av is ibis aSai& 4 At 4
Irealiabio" ek G "uibrh s 6IwA
adC.
Lat 6 be~ a csrnecA I ra rh% IB
orAer ni. Lua Yi ang C cerie Aves
(niot nelw cAisjeit&' S'ACdS of
vedi!ces of 6. IV Iv aId CI
Th. rah %an a[ G wit esyfeel -6 V
~~IC is4 i~vMrke &A
u. hat(a .6 is ah rc 1f4
1I x E ahd d (x.4) A(x,,b)I
II
Let b a of ~~no&w wia*arcs xvA we%! vauw dA A .fe A.0
D is (nn#tmareaau4e,,iwiar
*D i's C %nt ra hr
n = ~v% t+22r
WOO
3AW-1 1Lsb
f b-.00"Oirab'
\A~hcIoi&tentd 3vaW6 are raabizlte
by ThEESI
Theoram. Ifn oivti lraf bo is
hentb is transitive.
St-vaeTh e Ilri~ifAa I Jbho order 8'AweI's mit (a. n.1%- ua ba We
~~lir1e
1t~EORH.Let 0 he ah oritnteA 3rar
04 ordr 4's
bi'1s (bi . .T@O~lALI.b
D istas Ia avJ D
CorI.ri hoihlueA uladr3
Let t. 6e a TI,, irrr,
arl let c 6e a %4e in
G
bi* etJ;&(T),. T-tm $A- 1 ) cm.s J1s tkrcts With.i d;ibj --rdd. ,hen the
trmAiv. n-'oorn is
Iby T
ec9*) orwbxA CV
vecV(T)
rian J' T Y.; A~x tEAiu 4 ~aI ll~E
cgr&mic1 jlT = i x;0 6w Cx) I wtorAU iiev(T) I
FUDLA M4WTAL RE uur VcT TEOR7s
~Abraick -we~i4h cat4oj 4ta4~CI&s4 di a ;6ivIC veI4A
Or Iwo aaiace4 ve4tces.
Theofanr 2. C(7civiKs, m68) Tht mne4;an
is4,~ Cw~ am e ay~y. 4wam
ro"Gacoc.Au cs siir
* Wiilot% o cowwfovalrkc
Th~oem !. The %m-cerrlr o'i T cmsowi 1 ove
vetv or~ two aA4j&ctw* VcA"Ces
Them 41. The %ammeAban ol Tmu 4iverkce soi - bin T,
'Mrem S. T~- -meAi-&n of T is containe4
cam r0Frofer.)
L~t4L
~ th {~ri-s ofva4 *ccs o4 a tree 1
EG) = max&A.o, j A . s
6~ ~ v r~o.O mem~r ab
otOfW 4c o~f T-.
9I-awfw%4W, ce4o A crdoiy - Cho
.-r 'oV ayj , . ri 4,,, " ir'i'er
I et ~~~iA
kw-Lev
ii (~ie~~)IIIv I I II I 1 k A t
Tirm .
a~~
'~- _____
Ip
q7 '78 1 -77 110 166
VI49 & ~ 34-7 to"66 3q 21 39 Z3 52 h17
11 0 I 21 303 .1 I
II3 & I1 10 I 21 C.
0 1 3 3 & 16
0 01 3 3 abe
ExMAmptc vt. TO P. 6ATER (A
Ab c C f 9 CCT;&)I 6 77 8 62 z 10 tOt 13 I11 Is Is 6
3 Is 15 15 18 It 21 21 b.c4 7-0 19 zZ 23 27 C
3 " 13 25 29 lo 33 C
35 36 7." a 33 43q c.7 5 34 a% 13 13t3s 4 cJ41
b S 35 34 43 42 51 4# 4'7 41 32 57 47% a16 S1 41 40 Sl so 61
ISIS 47 43 43 54 s4 o cIA
2.4 A 6fi q I11 "71 loo Sr2S 10S JJ -TI IS n 181. S7UP IS 12 % 73 27 75 1o4 SS C27 114 SS -TS Iq 74 16S 11 f2S lb q5 16 51 75 8lf 183
2 1"1 W6 77 T 2 51Iloci S10 lt S-1 U 973 77 1810 16f31ji ~ 5l7 1 7 110 .10
LUT T=LVE) ?A TRE,/ VIZ
bEE. Icor xEN an~d ric--epa-iye Ajvef
t.a I ekr1 BxI U evo 6 4'e L4- (i
16 Y .* kk e... IL -a -1ev, cA 4.kw
c4 NeA ces 'In a largesl- confane 4
.4
T~ h e-. 6all 6wA% we;5V4 ee A$ I d To
Ati - .AW (qT; t comuI~d at -0l verke
* c*~ T o4 rnriwWnvw% 4tAW~ bwwAuiw e;1 4 .
~~pIMtn
-%e s
__ a C..e fjVL~tA
Ir i
. T.
rIr;. We aS' JT
no - 1 x _ 'A l%
6vjt 0) 33 3i. 2.1 36 31I S
1I( 432 1 9 21 16 31 10
___ 21 7 21 30 8RIb A 8 1 7 (61 7 721_ _
63 l -s -)234 G
Lw,4-111, I 1"2 3 L4. 2. b
&O A XMr C- 4RN VVI") a GWeaht C) ,
SOVe center verwc'
CA(T) PCT tCCO, ;WT,
%W0 V... ?a~u cycl *t
-0/2 17a
lar l 4t i~h IWO' ~ol oftiw C3#vs
j~j, Urd exnah i G wilk
p in 4W) a EC 0))IA IA i'i
m&K4m Aorwi c amr Em& ajRiAIiq
b1 4IwI W.4e s b wi r~ V, b4c(
K.ai R ac Ceatro;s 6. center inb Aeesf* I1&TWORKS 21 CLI'q) 11.17,
*Plura1~4y .w4cKenc.. clIrm reahza4
by 4w'.., T.: mc*4avy mv4i 4sat.e+
*Plor-A;4y rr 4reON"a 4; tha reallejby truers M Onv r.ttmaIn nwumvseV8A. 'o a "tar ;fn b; 5ftA. M tbcmaji csOM
Am l PrI "~a %V .caAwk vet4 es in a"to mpari. Grhs &A Cmim-Iw~ (ida
f (wiWniW.k "nvlar) RcA sn al 1 1 lTANly prftuc baWM an clbIst in? 7SWD
ICLUSTER ANALYSIS ALGORITHMS
Pierre HansenGERAD and Ecole des Hautes Etudes Commerciales
D6partement des Mthodes Quantitatives et Syst~mes d'Information5255 avenue Decelles
Montreal, Canada H3T 1V6
Brigitte JaumardGERAD and Ecole Polytechnique de Montreal
D(partement de Math~matiques Appliqu(esSuccursale A, Case Postale 6079
Montr6al, Canada H3C 3A7
Cluster analysis addresses the following very general problem: given a set of entities.find subsets of it, called clusters, which are homogeneous. i.e., such that entities withinthe same cluster resemble one another, and/or well separated. i.e.. such that entities indifferent clusters differ one from the other.
From the late 60's, Mathematical Programming has been applied to cluster analysis.This allowed to:
(i) formulate precisely many problems of cluster analysis as mathematical programs:i.e., optimization problems with an explicit objective function and constraints:
(ii) study the computational complexity of these problems;
(iii) obtain new polynomial algorithms for easy problems, with a low. and sometimeslowest possible, complexity by careful study of each of their steps and of the datastructures necessary to their implementation;
(iv) obtain new and practically efficient algorithms for NP-hard problems;
(v) derive theoretical properties of existing or new algorithms;
(vi) make new clustering methods available to researchers in various fields in the formof computer packages.
)I
Our work in the last few years and planned for follows and extends such lines. We exploretwo avenues of research. On one hand, we define several new problems of cluster analysisin the areas of divisive hierarchical clustering, partitioning, constrained partitioning andclustering with asymmetric dissimilarities. Each time, complexity issues are or will beexplored and new exact algorithms designed (as well as heuristics in the cases where exactsolution is too time-consuming). The main tools used will be graph theory, combinatorialoptimization (mainly nonlinear 0-1 programming) as well as, for the first time in clusteranalysis, exact methods of global optimization.
On the other hand, we complement the study of algorithms by analysis of the stepspreceding and following their use: ways to construct dissimilarity indices, automatedmethods to select "best" partitions among efficient ones, derivation of robustness andsensitivity concepts for clusters and partitions, new and more informative ways to rep-resent partitions and hierarchies. This work will be done in parallel with the analysis ofseveral data sets for real world problems.
The first part of the presentation will review steps of a cluster analysis study and il-lustrate the mathematical programming approach. Bicriterion cluster analysis with thesplit and diameter criteria will be used for this purpose. The second part will reviewrecent work on average linkage divisive hierarchical clustering, maximum split clusteringwith connectivity constraints, the Weber problem on the sphere and using espaliers torepresent results.
2
PARTIT(ONIN6 -PROBLEIISAN6A L6 OR TH~ MS M
CLUSTER ANALYSISi rto cHNN turco. , i
umivE~ttry 6 RA b Js reetv
06 4*4W M. 4I 1*, 4 4 kf PA 444
40
CLUSTV&S (Oft. CL&S) SNVULP Sc'*Nwi~ Irpi
GLUSTCA~ A NA &Y SI S 0 DCP&Jirtsvsreps or 4 ev~ kecl
ID47A AmAa.YSL5 TCCUU6~ji.dS Jr a A
AN*LVSIS.,~ IIUL-ri Of DMDWA*L *7 A
P bsAvt~a* Wt. I Pus
*A~bis cPtfo LLEv *p * L V!5iii1 AF$)S
**si. Lwf--t or ","Tot~
ml~r Mc41
54IouLb Nor Aesems~LEW7
ftaS~oAw D*?U ^V t~
uses Or C4LUSIc& 4Ak'
&X#c#& Qdra. Fa* ST*wV4&tA eS f
f S *.igA (wC-K A~ .~4 SI'C$
* Wv~A i, M tm T~iW74e
cU7x* ( mowca.,&;bA,*I
* Oit(,4hDI ~~ * aoNFri..A (efra.
STEPS OP A CLUST(1k AMJAiLYsts$ TV&?
*t() Sh"IC 0 =4 ag, at t
(M Ob SCktV^T I AeJ bit Mr* uS1I #CAjtS-
WC~ b ISse$I IL * t re r$
d, 4 in a~ c'i~*
*egrA~g * CC444 Mrr ~ioiv
oee', 5 'ro ot pw& 'r A L WS1
two -i
(si'~~~~~ tip~ ~tWe1,v 4oi*~C L rE%-
L 0* db C
4% Cal' a i
beje j A it~vr cIWIV u
(rDos-,bLY *&W*lTW*A ff?*T~1s
y Pass/ 6LC?
IVst/c ARI'rE eO
CMIX I. blAmrvot aw v4 A*A oTieewIa
L*#A6vr Aw-tr St7rWC1J ?we csM?eiu
*'rhKCS146,Lb Soft rrpc VAILOE GeivrU AY*w
blS~t . * ry 04 £*eme..mie
Sun~L Sunp vAwm buA17'E4 byAPA~r.
Soft-sunryp * v~Atwr AeDImis om++L
L)I4~pci'rTU ep hi5Siii&*itm i,,mo eer
WELL - POSEG *Nt*cLPO*P
rxA~ OR ftuatT~c PvLY"D*L &A harO
rX ISTEmetav Pj+ J#+T #t .p
lu(pArr #j OP CLUsUert
J* THRC5&4OLb CY AItCAtA
SPLIT OP A CLuSrr& : SftAL&.Usr wh1sum44ir
Amobrcm~ Ao curbDTv tooftat ir *A#&DU *or s
&erw.'es;j Irwo emwr im *v e "v-s,5VA
dtcc )z "Ai
Ri~bug e A Cb9S~& : DVS *.
CPrI'reEs u'P TICK ef4'r-A g~zl Y 0e4 is 9v^u
&r lm-) *Dr#J
(Okcesirca. a r- c,
Ax)i pogn I Ms:1: 7-fts PIPAAlC 4uG&.I*fC*#J A#SrAC-C
SP'LIT F A PA~TOJPg:niu
SPLIT OF: TfflS MA1I'rIeWS C. &U 7PA
htip It M 4 SS,# i i i.Ati ry As-7VT 'CjEA CoW iti rs I.,
at:... nl ijkIt Ift 1 e'i .D 4 C
LrL F A 4*7IDSJ &)CAVr d mC7
op THIS5 P*A7IT10W 's 0CaLUSvCA (Sot n AtX to 4
* (3..01 J,&.E S1.040ci
TAIS PA1WDM lfs &-t.Irra4
0 tar wA
OF 0 ft ,IUpb($ 5.'( t~P4POCh!
TH~o~rno Np ST)Nu~ff ,t it F# £f."vt& * r
coot PL cdra
A55octArr *"rjtTA--M 6zv WIH#
SPLIT V*LUEi Fve4 *LL C.LVSfTC AMO&77"I d7,,A,S
Oj' 0 Alf L£uAL TO -r.6 wcgt?4 op 7rN
06ES OF A tMigji*l HU P*IJSJDOJO ToEc of a
4*AelLAA'4LY I ( M. C A7mrt C j*P. N. If *e) onve ~I w #s,
S , 1&v9.4A Ir # &.900t 87"0 P40 v I PCs ppj&.v vxmo
Sp IT
(71wcjLeD L 4 L64P it 5.1f
jA" (Cq , Ct WIT" I ~i w@ ~.i
p94- m
W It L6 ft) 0
C.&LV5? (W fteposenDt SPLIT)
~ 4P.,\ 4jaC. C ILI ftmm W C16A.4 du £jUC
cxt4hIPl It
0 o~:14C~~ ?P
Rst 4 'teVa'
4 2. U
SOUPTy PP.Cd-fLg -- a-.p-
ea MIA
I LOWIAL 4 1 M s -64 two.
(.OIJ)4t~~ra L&M DPJCO'I.
TI A.SAO P g5j? L )u tvADp e ;A.-
SIMOILE LIOVK4c.( t-L~gue~ $Ee~i VWOL6 Di~
( 09s k ors i~
P0 Iwirtjd.1 S~ar'e'.
F Iw A A sr op 0 A OLl srovf Pain's AL&
A*&#K 946CS op ofur . cjtc&$ou
V44.E &os~ *P MST P-9f4W
romb CeuVSTCQI ciCh. cowM*4 tggrofPegV or
I" *L C.
aBI os 4 a . a ot.b ~orm e
lox
P~t#S.EM L FlINE 4n ~ Ap UP ~4 t rs f .i
op 0 10Ir CLUS TL$
a lbNe~rn I ( 5&ut~crmt * .i. t F47CI
III I I*PI rr p*itl 1Ou *oi At*O E
*L&ktTt9 64APN 40Le*CMG
m ~rre*& A *'Tv
t~. I~7 F.C eF3
CLTgo GF M &*vltl#l I A#e4 OIcLI 04 s NA sC
"PL.Cn tt
'N *Ail MG EFp'C~'r WNCOJ lSIM6 ..
VEAY bISSIftILAR CJTO To CS *7 r"Ie CA/D$ or
CigAIl OP PAIAWISC I in#L4. C~iTt rir iw
vldl
ou~sc~r..s WI$I(#J U1 C .L.A
CV I - toAlrS~o
4 0p4O
~.iPe~eCP A'4 @CPI&
PRM1 L 0PAL6.DVlrfl IN ft$?VGr&
Impos lisll v*&Uac F04 s Ar )
puslul"6VIA1Dt(I ATr CKTACMtVsE or
tll*ugir cbst or To * sr&.uAh SnALCIT .0
iw F&Acra&f weiLKS Feit 'a-loo c,,1,Ttes
W*&(t or THC is 3001c sTAvGTUv.g
A M &LYSI 15es ArsuLrs 'T .oPc84 ?N4C
~e1CSf-PLIT P1*fIS 4 V*AresC ,,4f
awbi c*r. ON s *9 ST~tcw0vA
* SH'&LC cvI6ccvr P*ATWlaeo ~grost C~
S* SU ag(Ac4:c 4, ms~f rest
v"r O tg A rs C
tP 0r ##1
thtICA ~ K4AIV .NWvti 9i 1~ bAggC Lq...
I A i r i &.1 s.?# ea
RAM~vK fbe C, aV Now - tmoAc*. &~
I Clio fTeV $oU r*G4 FeaP. *rrse* nL *&,&or
S~t. VIr covfe-s'c. PAe44&,h
nEtJi.I f
1A441 11,
a z .. A
Ik. si I~e % c low
WE POO
Figure 3 Diameter split--ap - Uiform data
d
109
0 21,24 s23 #22
97.9
87.1
* 31
76.4 * 32
g 420143 51 .52
65.744 --54 :53
55.0 o .6 3 62
67 4, 66 6 5
.61o .69 .614L1* 3"611
-. 7144.3 o : 782*72 *82 I
86.85o 95 93 :92I "
33.6.101
103 .. 102.1:04
22.9 6. O5. Son e.E
12.1 Is WI(1$ e
1.143 Is
1.53 6.11 10.7 15.3 19.9
'S
259
Table 2. 24 psychlogi~cal tests ( A fA I
Test B-coeff. 61 bicrit. Factorpartition partition associated
1. Visual Perception I2. Cubes I t Spaial3. Paper Form Board 1 Relations
4. Flags 2
5. General Information 2
6. Paragr Compreihension 3
7. Sentence Completion 2 3 Verbal
. ord Classification 2 3
9. Word Maning 2
10. Adition e e
12. Code 3 4 Perceptual
12. Counting Dots3
13. Straight-Cuirved Capitals3
14. Word Reconition 515. Nuer Reco tion Reconiion16. Figure Recognition 4 ( 5C
.17. Object-Number (4 s
18. Nunber-Figure 4scatv
19. Figure-Word
20. Deduction 5
21. Numerical Puzzles 5
22. Problem Reasoning 5
23. Series Completion 5
24. Arithmetic Problems 5
258
Figure 1. Diameter-split map: 24 psychological tests IA4 bi'* r )1.07 d
1.01
0.942 -21
.23 -22
0.875 024 031
0 32
34 * 330. 35 * 41
142 0-51
0.743
52
o.676 7
0.610 *10~4
I
0.543
0.477
0.410 A
SOME .oo i TAI AirD .usr)Z1O4
PkDSLcIS WI'rq TkRe'SHOLO 7-)'p(
*cit IkiP
piteaLCit 5 FIN )A A wea6H'r comsfRA**/w
Il~xmum$PLir P*,t~irt,# os: io h
t% CLU5SE4.i (EU'ir w e* lf'eT
fl&$Oflvf vciaomr Poo. AC'.Psre*a. w )
0&j v*Lvir p SpLi, postA. * iv1A
v*LUE I- THE CDOC$ OF "rff '%Sr WmTkd01U"
T~Ov~g * .4vmw PP~oitI~J or*VT o , j)11sr4r4c'
or A SeLU1em Z MA P*CKIW6 PkestaD1
Epfgicwr A&L6G4PirmQ Po.~ rmc L,*rrc'
(eta41L4.e T0711 st 14*) AL~w ra iestwg
PAe06teftS Wl7Tm Pi: COO s*P JJD*#AA '( #Wfrv
M*X/muf So.. awucrr/OA;
*P04fth (Arevc~aiesw ro SrYtvcq ' , e#ft.EAA
*A'6*g.D w*A.K wir b.stme-eme
3. hAEI4.)su/it TYPr i C rfE4
* SUln..OF. SPL ITS OF A PDARTIrIO&
i4(.*~~ It* 1% C ile *11 Voi£f
SU .~ OF A iAhCTFAS Or- AX~bTDDD
$u pot .~, T A I*8ulne,,
4 PU- (q~) z hC k
Flewb A SUAMI0iua SPLITS
or"COAC(p.m. 0.7AUPW*Ab 0 . A4M( if) 7WEtC
AM &MmAq~L ?A&?1semp POOL 4P)O flAXIhM SVO - or
L 1Ag.ga Co. l tf t114 1-007k PIT f.TCtA,1
*6T'fc v@A'CL LlkV 46 "0LG.AI'$tor
1'If P0 &(S" C*w V*I SOL Ve L a
V'0 rur *uiA. (04*1f OF T'r ~ SiM 0 L 10 A4 ec
G u ascyclic . 141 vertices / amIi-s
11
4 X
Il max ixwn sumn- of- s#6s 7wtst..
"a r awrspw 6
kw~qof t6khi u&Lf acorkFWkVAM43
G1'.%
PAOPLE~ ~d ~CA- MII~uPt Sum OP hAlltarict.
THE& ie~m A uP 0 wAL 1 0 ft) ##SY *it0rs
" 00jo.( CASE f t VP904)
AL 40 9 Tb4I US IPJ~ (o~ rA Aa.6r44* Peuqk Met
IL+, 4 (4-- II 0-Ut V
jt VV Al*V &4t)*T'I)'*
Q0 (II a l@e YISVL £4~P *% I5eLV*MALSO/
As NtPeSSe&LEe VA~~ J.' V ~ ESF0 .L =2 0(AtD1)
EFHEMIM (F.tv.0 .74.V"Ajte I I" Tot e~ c4
Tilt wmar6 op
OT fAit~& rig,, tr v7 O(Pi) &4'*gA-,r v44Lvgs v#*4Af
WIT c t &-rono 6* .O(A' eIA) ddAtfUAO
PA06.E 4) FDPJO A n ~Jhton S Um-F -RA Dit~
V I l 6 (P C. jM.LRbSC(PhcMINDM
"tRE1 To V n'x4E-a.#rCe#C4 CASE)
A L S OF lNd7Clt(ST : 'DI"guUS C4SCI esoi. E~et NTAosAW*P14
'AJD' PACw DA" 4 PA'euI
'r vN c4E i Pcom s rg4iriEe
SThRA OF A e.LLS1CA C,
V VI A LL EfotI~ o F -1gf CLUSTE& 0 F
T~ $nOP ALL &jsSR,,L*trers 4 17 hsfw( TMC ft4Eb I lJ)
TigA1If (RaY r 9TOWD 110 *U* 7 atCSU
I k
CLS~WCA CL-,?C $ V^ ~
ALL *SDULA4DTir*EoesP*A
£tJ~~~e~eC$n I Mc.TCA
/#W-r (D' V CYTC A$7* Thr 0*421csT
SA A
gp)WOW
PLO&Lrn. to Fui~j A PA4t~r'ioj OF D ImT4
1% CLUSITEA WITH, ,'ga'MUP' SUM -OF$744
4=0-r k P )1AA 04L V1
EALEA~kDT~ l,* ,KoAKC1L 94
IN CAUSTEAIW& 6 flPL VVY at
ALSO VLA$1dM 1v7Im CAPAC?7v/ (Su c o~r4
tivi.V(Y 'Ig e-
Pp,&ur 11 . F/N *T7E'O
( INTOe M CLUS'EAS) W ITH MlauaD4Ui
%vt% op - (L IwS A wdIb mi motiolUD%
W 87H i P NVTS S u hVAA $$I ODSrio LsitDt)
CLOSEL'Y RC9L*rro PX*6LgDr w l 4 EA*.
1% 1c-ftAvs *A A CHI AlitGHD I *-W f"
A ve. VLl TIES
PDA P1,9rb 3M . oT,,c~cA.
1RAtri7gu& OfAl nCirm a OF Ccus-resq AJ44.y.Sis
Fr\#j POA.PEA&?cs
$ DU4Vdct*r c*& A*A P 4iA
.(Vivo&f44 p ,.
(4M vaia*",6 i ~
Ao,*~ cAO. + 6A4 ft TC&A y)' A. ALowee ro
SfV Ceutptcqrf
So* rt cs cvAepi #4~s ^ licI7f T~ f4er~*
0~* M*7'e. At~oMG tdr.## oSrda3 Ait * &m
A44-q
*FAC ILITY LOCATION
PROBLEMS
OW' A SPHERE
Pier're H-anserL /HEC - ERRD M1on rga 1
*roit Taumard 6E )?RD ( E7colpe %oecniue-
de lontrial
S te'rbate K6au /Ecole Poy LecIhnicue dA. Hon /I
NA. D) Skudenl
(.Dar ~LRao~, RZ)
A s)qbip has lobIe locater) in a speci Fc reyn i-n)jr
* P)i ppine- Basin vilhin 14w. qua~laera) frmeA b
13 t.O 00 6 < 9 4 150.00o
12.500 < T 18.500
anJ mush be. ready Io intervrene in case df nRae,
a an one OF 15 Pen r& in the. easlemr
6misp~ier fleie dcs m ay occur accortlin, 6 avaidle
Fore casLs w),c), are used as .eu Assum;.
A4e cost as a linear furncthOP OF peoJeic di4ance-,
t c c .. (5 I to MMj' 3 e. wef kicd averd~e
of CrcA C- S.tt O- o kIie va r I.auQS c+
s-.. .52.c
1.,),05
Ports and their locAhons
fzrts 9T~) in de~ree~srehs
Mapohi (t6. oo, 3M.hI )
Rm( it. 5 2 4 S. -t)'1
Mord iShu (.. t , 4s.33)
Rbad Jam (30.33, 4R. 2?-) 0.,5
Mvscs (43 C0.5-8.58)0.
C'ris~mas X~s. (-10.5 S0O 105. u)
* aarl (3o..910 115.25)0.
M~anila (14.59, 120 .3s )I
Ta ipe L (..0 1z.so)15
1UP ~ (35.401 f33. -7 )G ua m (&-47, 14.8)Port PMoresl (-.5 10, 16-4)
i erre~ H RN S E N" G ERR D -HEC Monlria I
Gri,4cJRUMPRD ,Ecole. Po)yLe-chniquede. rHontriaI
Bruno SJMEONE Universi~y off Romn&
Marhn KPOMASSE rl5C. SbVcerztOcoac. Polyecchnicu~ Jc, Monlr(4l
02 C 0 4 ,/ s1
-- 9
Z- 11-5
3 1010# 8(j- /0 &?7 01fc
4j 32 CC ql 5y 0( ((;10
{3 o, fj07 , {Oi,7{/ f047 {07; {o7 oqI~~~~~ .5a ~I ~;eO~7;{5OJf~
o' { 0+1t7 ~OJq; {07 f0qj os {os 87 -
AVEP6 oLINI(ACE b1I/IiV
HIEr'.4PCHICAL CLUSTCAINC
Ecos. E~ OFE~ nOJ..E4
15PL6IT& 74Ulf
6E~R1~ ECOL& POL.YTECH~JLdL'E hOAJrkE*L
ELI $AbTEr D0* StLVA
FOLYTECIMIAVE nO^)rAE.4L
O~%o,/ s..o* i se&- oFr ye,
-Ddo m~rix dF c;ssiniarb'ies:
C= C,, f par~ili-o? 0F 0 into cludtiers
T7z (§0) seb 0F biftribn5 OF Q1
pI
.-Disir Hiera rch 'ca) Ciusterin /or~im
M do
F, accordirj b0 CriLrxiori I
/ T tr -v e &7, CCOrdr ;
ri ri ,jr j r
L~ma Kimi a o F be. czrage Jissimi/ai-4~
0f 1Ae bats ;iarhbn ror ap ll useers:
C40ose j sud, Lhah- maxi-/I t1
CrItera' Q ,raximigabor 0F ke- civerc~ ,%sm/d elvwe pairs of eibfics one. in t,6 r1 ,~
one in C+
' Prodc1fon dE binary varjates z,,t a sso a led
WAe~ eacJ, erA/ 0,~ e9 suJ w t-
I if va)enc i"
fo A ei dh~,(-
mrx =d n; 2*(Cj) is Kn e__ivamt _
ma_2 S
-I o i ALL 9d "k 70*t 61#k >
EWAh KPEA!
EN wH ILL .
110 0 It6 V TbO Ae-eEl-A*#*Tr j41
0 P bA7)IJC& 0PCAec
* /b ite
~ No) +~ (LA.(I
,E CT A LGo t r mpi
* bNKELSACI.''S Li9P1* (0- CSC
CONSDIA * I4YekaoLic 0-oI Poto6AADI
IAX AI(X)C b (K) > Oft *aL
LVT (x*) . THEM, X* IS #P7IPFAL
IF Aa ooiy i.seu&4 beg) -Npcx4)) e
1TCA~hu AVCOiU C JOS Ikt&?*tAIk'c
A AL 0A 1711)? A AL C, (bovisiy AvrAhor-&ug
Cot CtuLV7laIi 6 )
SEL&CT rlA'i&&.C SOLUTIONJX .'
TAb&i
COMEPLTL VALUE PACX)
Xe
L Eo IE A WCU A. Srl L S $e&UTaom
Tht FLOW9(C&t~~p.
VTR E $c IEEKANJ OP711VAL So~urioxor,ro (~ MYV AN EXCT~ f+-0-'~~
l'Ii ORt AAr vCo u-iePL4A/( omf)
I F (xv ()( X) -N~ 4 "joyTP
*TkikvXSE LTEk~t
UP IPAIU 71 pesc CHM& V#~&&C ALI*
*SONEI
generated problems similar to those of the previous series of experiments are presented in Tables 9and 10. It appears that: (i) HTABU gives slightly better values than HDALC; (ii) computing timesof HTABU are smaller than those of HDALC. Therefore, a direct application of Tabu Search tothe hyperbolic Problem (P) appears to be preferable to the Tabu Search heuristic solution of thesequence of quadratic 0-1 problems given by Dinkelbach's lemma (1967).
BJ HDAL.C DALC
Pulkwa D. 77. M". D,, . M" D. MDri.F77, 7N -9 M 7w ?a 1 ,'
Pg 7 764 73 r13 r73 7w 43P 7W asr To Tn = ?l30 77 623
PS 4 .2 7sa S2 732 a W 2 542P6 S32 732 S L12 ,' SMQ W2 7=3 S32PF46S 7*7 46 465 717 4" 4"s 1- "1s 4' Ot"re. 43 -ok 439 436 704 436 4319 IM 4
pic =3 'm2 330 333 OMg 333 = O "033
tclu (- :6_) 0.01 0.02 0.01 0.10 0.11 0.11 0.31 0.32 032
Table 5: Comparison of heuristics and exact algorithm (13 psychological tests of Harman)
BJ HDALCPaition Dun V,, MD,, Don 'Dm MD.
P, 40 2 4062 40 2 4 2 4 J
P2 IM 3516 I3 2664 434 2664Pg 175 303 1755 256 37n2 No IAA6 66
P 130 3M 130 36 713Ps 131313 334 3M 34
P. 1W 304 ISIS 20 3156 264t ION 3116 IM 2216 3196 264
A3 106 3274 I=6 2112 3025 264Pg 1069 3144 1099 2M 25 24 2uPIo M 3001 M8 193 27W 28
Pic 706 2W66 706 1506 2272 223Pic 447 2653 447 719 127 141
0c.2 (0d) o. 1 0.451 0.43 4.96 151.04 51.02
Table 6: Heuristic solutions for Fisher's iris data (N=150)
14
AVERAGE DISSRIALARTIES
BJ HDALC QDAL; HQDLC
- I- 1706W 1 172412 173145 169610 172373 *T2JY 13 921 172711 1T2991 1707993 17028 170654 170257 171972 171967 164696 171990 172063 164472 1 rM66 171329 161924
100 4 19666 170743 16M66 171591 172201 13344 171644 172401 !63719 173036 171449 15756S 169532 N 1637 1635O32 171034 172421 161765 171183 172821 13473 17232 37115 1539
10 167470 ITO407 167470 16640 172107 16027 164 172324 159029 170146 170919 14022115 164597 170316 164557 166965 171470 154504 16546 171527 153028 Ii310 IT79 1249962 1410 103 171489 173W IS:W 7 171659 1YT5 171480 17240, T1OiW 1491763 171456 174515 16615 171462 173350 i474 171458 174515 166151 I7303S 17.1507 14364
200 4 171290 172602 154544 171221 17500 1512 171299 172602 1S4554 172231 17-2115 1358S 171067 I 3663 171552 1=3340 15733 171067 172697 I363 171274 172390 1563910 170031 171616 143491 166725 172011 147302 17001 171686 143491 16977. 171711 14703515 169642 171329 140615 166041 171306 145602 i6642 171329 140815 149154 1#006 1319967 14 1U, 4 11"17 11s" 147231 r117 1T23W8 1-T7W 14347 17126 1676823 1635$7 169594 109357 171417 17237 167100 171027 173107 168273 171600 172122 166424
300 4 169259 1697 1699 170747 171754 163740 170730 17246 16525,4 171410 172369 1500055 16M 169P.41 16066 1703M 172102 183619 170616 172466 155251 171976 171855 IS407210 1662= 16943 168270 19612 171573 153331 Io7 171906 143663 171245 171661 13"79515 16755 169366 167556 16"444 171069 150221 163672 171054 136313 169601 170923 126190
- 1 W8 1W0372 1U!H T0 IB4 ! 144 1"1'W 1723 171i14 122 ITY'1!4 170O489
3 163227 169346 163227 171302 169744 150428 170645 172536 166749 171417 I7076 161514400 4 1660 160324 19110 171275 170496 16SW.2 170452 171111 155441 1-0413 1 70433 161028
S386924 160320 1l24 172823 170460 153301 1702 17099 15726 1700 170296 15306510 148424 169264 166424 16S644 17052 142275 169731 170970 140173 171373 170210 1335515 167737 1 63221 167737 16P28" 170371 1396"6 162 1n693 13729 163422 170171 121299I lTWY -NOW 139113 7T33 715W -Tom 1171M 171917I T 1s 1 U 11"1(138 i0m3 160131 163235 10131 170567 171630 10770 170373 170162 IS96M I7IO I,0044 161369
S0 4 1 = 16 32 160M36 170373 171235 156738 170233 170816 1566 16996# 169353 1525315 1662 140196 1827 170731 171272 149645 170115 171044 I11156 171144 166624 14582110 166485 169170 166485 19M 170748 138304 149532 170244 12037S 166923 1630 135575
15 1 167941 19140 167941 169031 I170404 13756 166GM 170350 125766 169661 169221 126275
AVERAGE TIME
BJ HDALoC QDALC HQDAL
N D. . MD D.. V . MD. D.. 7. MD... D,. Ua. MD..-1UF -- M -- FI -- OrT -ii"Or -io 10.91 31.23 W15 U F
1.43 1.53 1.49 47.75 54.9 54-97 144.36 149.99 149.9S 139.73 144.14 143.592.14 3.37 3.22 67.21 5.94 6.84 233.46 256.61 256.50 211.8 229.12 229-0
,0 3a 6.37 ,S 130.12 163.23 183.17 342.13 384.36 384.24 326.01 364.84 364.78M 1 6.17 9.3 6.42 147.03 242.93 241.M1 602.21 679.18 6-3.06 5.00 661.33 6 0.19
Table 9: Results for large problems
17
S
Image Representation, Generalized Clustering, and Search in
Proximity Graphs and Pathfinder Networks
Donald W. DearholtDepartment of Computer ScienceMississippi State University.MS 39762
e-mail address: [email protected]: 601-325-2756
Abstract
Pathfinder associative networks (PFNETs) were originated to model humansemantic memory, and have proven particularly effective with associativeaspects of the organization of knowledge. Theoretical connections havebeen established with graph theory, path algebras, proximity graphs,computational geometry, clustering, and search procedures. The PFNET paradigmis being used in human-computer interface systems, and in the organization ofdata in a database for an experimental robotic vision system. Characteristicsof PFNETs include (1) the preservation of minimum-distance paths betweenentities, (2) the clustering of similar entities through the edge structure.(3) the consequent support of higher levels of abstraction, and (4) thecapability of generating proximity graphs, such as the relative neighborhoodgraph and the (open lune) gabriel graph. Monotonic search networks (MSNETs).closely related to PFNETs, provide for search in which no backtracking is evernecessary in domains having (objective) distance measures, and also support theclustering features of PFNETs. These studies have motivated investigations ofcluster learning and conceptual clustering from the perspective of primitivetransformations, with representation of the clusterings by means of PFNETs.Models of cluster learning in which the clusters represent either ordered orunordered sets, and which may or may not overlap, are being considered. Thesequential application of transformations on clusters to a graph-generationalgorithm is being considered as a learning paradigm, and this perspectiveappears to support a constructive view of clustering. Co-occurrence ofentities is an essential component of the process.
PROPERTIES AND APPLICATIONS OF
PATHFINDER-BASED ASSOCIATIVE NETWORKS
Don Dearholt
DEPARTMENT OF COMPUTER SCIENCE
MISSISSIPPI STATE UNIVERSITY
Developed under the sponsorship of:
AIR FORCE HUMAN RESOURCES LABORATORY
INSTRUMENTATION DIRECTORATE,
WHITE SANDS MISSILE RANGE
COMPUTING RESEARCH LABORATORY, NMSU
INTERNATIONAL BUSINESS MACHINES, INC.
NATIONAL SCIENCE FOUNDATION
NATIONAL AERONAUTICS AND SPACE ADMINISTRATION
OFFICE OF NAVAL RESEARCH
S
I
PHILOSOPHICAL STANCE: BETTER MODELING OF HUMANINTELLIGENCE WILL LEAD TO BETTER Al
THE NETWORKS WE ARE STUDYING:
DESCRIBE, SUMMARIZE, AND DISPLAY DATA
SUGGEST A PSYCHOLOGICAL MODEL ABOUT
MENTAL REPRESENTATIONS
COMPLEMENT MDS AND CLUSTER ANALYSIS
PROVIDE A PARADIGM FOR:
KNOWLEDGE REPRESENTATION
MODELS OF CLASSIFICATION
ORGANIZATION OF DATABASE SYSTEMS
SPREADING ACTIVATION (SEARCH)
D
p
OUTLINE
I. MOTIVATION, PERSPECTIVE, AND OBJECTIVES
II. PATHFINDER NETWORKS
A. DEFINITIONS AND PROPERTIES
B. APPLICATIONS
* III. CLUSTER LEARNING AND DYNAMIC SYSTEMS
A. MOTIVATIONS AND APPROACH
B. DEFLNITIONS AND PROPERTIES
C. APPLICATIONS
I
* RESEARCH OBJECTIVES
I. THEORETICAL
DEVELOP AND TEST METRICS
RELATIONSHIPS:
GRAPH THEORY
PATH ALGEBRAS
PROXIMITY GRAPHS (RNG, GG, DTG)
LEVELS OF ABSTRACTION
I. EMPIRICAL
SEMANTIC MEMORY
CLASSIFICATION MODELS
PROPOSITIONAL ANALYSIS
KNOWLEDGE EXTRACTION FROM EXPERTS
III. APPLICATION DOMAINS
ORGANIZATION OF CONCEPTS
INTERFACES--INFORMATION RETRIEVAL, HELP SYSTEMS
DATABASE ORGANIZATION
PERCEPTION--OUTLINES OF OBJECTS
p
THE BIGGEST CHALLENGE
FOR Al AND COGNITIVE MODELING:
TO DESIGN A SYSTEM WHICH DOES MANY THINGS WELL,
ALTHOUGH EACH ALGORITHM MIGHT NOT BE OPTIMAL
ASSOCIATIONAL ORGANIZATION
CLUSTERING
"p SEVERAL LEVELS OF ABSTRACTION
CLASSIFICATION
SEARCH
DESCRIPTION OF DECISIONS
p
I
DEFINITION
A PATHFINDER NETWORK (PFNET) IS A GRAPH BASED ON(
PAIRWISE ESTIMATES OR MEASURES OF DISTANCES
BETWEEN ENTITIES.
EACH ENTITY CORRESPONDS TO A NODE.
EACH PAIR OF NODES IN A PFNET IS CONNECTED DIRECTLY
BY AN EDGE WHOSE WEIGHT IS THE DISTANCE BETWEEN
THE TWO ENTITIES,
UNLESS THERE IS A SHORTER ALTERNATIVE PATH.
I
DIRECTED PFNET FOR NINE COUNTRIES
p
14
RUSI
FRANCE
NGLAND
USA CUBA
27 MEXICO 1
17JAMAICA
R-METRIC IS INFINITY
G-PARAMETER IS EIGHT
TOUCHSCREEN DISPLAY FOR EMPIRICAL DATA
HOW SIVLAR IS A TO B?
A
1t
B
OK +
DISTANCE + SIMILARITY --- K
THE PARAMETERS OF A PFNET
R-METRIC:
RULE FOR FINDING THE LENGTH OF A PATH WITH KEDGES
K
L(P) WR]IIR
1=1
R PATH LENGTH DATA SCALE
I SUM OF WEIGHTS RATIO
2 EUCLIDEAN RATIO
00 MAXIMUM WEIGHT RATIO, ORDINAL
0
THE PARAMETERS OF A PFNET
Q-PARAMETER:
"DIMENSION" OF GENERALIZED TRIANGLE INEQUALITIES SATISFIED
B
A1 C* I
D
A <[BR-CR+DR]I lR
0
THE TRIANGLE INEQUALITY
B
Ai E C
D
E B +C
A <E +D B +C +D
THE GENERALIZED TRIANGLE INEQUALITY
-- B
A c
D
A _ [BR+CR+DR]IlR
PURPOSE: TO PRESERVE MINIMAL-DISTANCE PATHS
p
THEORETICAL RESULTS
FOR A GIVEN DISTANCE MATRIX,
PFNET(R ,Q):
IS UNIQUE,
PRESERVES GEODETIC DISTANCES,
LINKS NEAREST NEIGHBORS, AND
CONTAINS THE SAME INFOPMATION AS THE
MINIMUM METHOD OF HIERARCHICAL CLUSTERING
PFNET(R =-,,Q =N-l) IS THE UNION OF ALL MINTREES
PFNET(R 2,Q) IS A SPANNING SUBGRAPH OF PFNET(R I,Q)
IFF R 1_R 2
PFNET(R ,Q 2) IS A SPANNING SUBGRAPH OF PFNET(R ,Q 1)
1FF Q 1<Q 2
MONOTONIC TRANSFORMATIONS PRESERVE
STRUCTURE FOR ALL PFNET(R =oo,Q)
MULTIPLICATIVE TRANSFORMATIONS PRESERVE
STRUCTURE FOR ALL PFNET(R, Q)
S
0 CO 0
A M
-n -n un 'n-
.-- ~ 0
E- ..- v~ LIC)
w- 4)O A2
C)3
on -LLu~ .. ~-.n c
VZ 0T c) r-
o C3C:)W~* II 0
so C.
C _D
co1
1 0am
IW 4w -
00-0
Op" M
*0v
COMPUTER VISION
GOAL: SCAN THE ENVIRONMENT AND MAKE DECISIONS
WITHOUT HUMAN INTERACTION
REQUIRES: KNOWLEDGE REPRESENTATION
CLASSIFICATION
, ABILITY TO DESCRIBE SCENE
RECONSTRUCT SCENE
ENHANCE SCENE
MODIFY SCENE
A HIERARCHY OF
EMPTY NEIGHBORHOOD GRAPHS
PFNET (L 2, oo, N-1)
RNG (L 2 ) = PFNET (L 2' oo 2)
PFNET (L 2, r, 2)
GG (L 2 ) MSNET(L 2 )
MGG (L 2 ) = PENET (L 2' 2, 2)/
DTG (L 2)
PFNET(L 2, 1, 2)
EACH GRAPH IS A SPANNING SUBGRAPH
OF THE GRAPH BELOW IT
MOTIVATIONS FOR STUDY OF CLUSTERINGS
LEARNING VIA CLUSTERINGS
MODEL DYNAMIC SYSTEMS
UTILIZE PATHFINDER NETWORKS
ASSOCIATIVITY
CLUSTERING
LEVELS OF ABSTRACTION
p
CONCEPT/CLUSTER LEARNING MODEL
SHOULD SUPPORT:
SETS OF CLUSTERS AS THE DOMAIN
PROGRESSION THRU CLUSTERINGS AS LEARNING OCCURS
p A DISTANCE MEASURE BETWEEN CLUSTERINGS
INTERPRETATION WITHIN THE PATHFINDER PARADIGM
p
I
MODEL FOR DYNAMIC SYSTEMS
SHOULD SUPPORT:
SETS OF CLUSTERS AS THE DOMAIN
PROGRESSION THRU CLUSTERINGS AS SYSTEM CHANGES
A DISTANCE MEASURE BETWEEN CLUSTERINGS
PATHFINDER OR PROXIMITY GRAPH PARADIGM
I
ADVANTAGES
PSYCHOLOGICAL FOUNDATIONS
REPETITION OR REHEARSAL CAN BE MODELED
REPRESENTATIONS AS ASSOCIATIVE GRAPHS
KNOWN GRAPH-THEORETIC PROPERTIES
VERIDICAL GRAPHICAL DISPLAY
I
ASSUMPTIONS
NOTATION DENOTES CLUSTERS VIA ()s
CO-OCCURRENCES DERIVED FROM ()s NOTATION
FEATURES OF ENTITIES WON'T BE CONSIDERED
S REPETITION INCREMENTS CO-OCCURRENCE
CO-OCC + DISSIM = CONSTANT
I
I
METRIC AXIOMS
REFLEXIVITY
SYMMETRY
TRIANGLE INEQUALITY--INVALID FOR CO-OCCURRENCES
IPFNs IMPOSE THE GENERALIZED TRIANGLE INEQUALITY
I
COMPONENTS OF THE MODEL
DOMAIN D = (A, B, C, D, ... I
SAMPLE CLUSTERING Ci = ((A, B, C), D, E)
PRIMITIVE OPERATIONS
ADD ENTITY TO D
REMOVE ENTITY FROM D
MERGE TWO CLUSTERS INTO ONE CLUSTER
SPLIT ONE CLUSTER INTO TWO CLUSTERS
WHERE AN ENTITY IS A CLUSTER
EXAMPLE: THE LEARNING PARADIGM
CO = [AB, C, DEqF)
so= -000000 -000000 -000000-0 00000 -00 0000-
Cl ( (A, B, C), D, E, F)
S1= -110001 -100011 -00000 0-000000 -0
A 1 B @ 000000-I
EXAMPLE CONTINUED:
C2 =((A, B, C)2, (D, E), F}
S2= -220002-200022-000
I0 0001-0A 0000 0-
C3 =((A, B, C)3, ((D, E), F))
S3= -330 003-300033 -000000-21
A BD E 001
11 33C F
EXAMPLE CONTINUED:
C4 = (A, B, C)4, ((D, E), F)2, (B, D)}
S4= -440004-410044-0000 10-3 2
4 2 000320-
EXAMPLE: THE CONCURRENT PARADIGM
C = [(A, B, C), ((D, E), r'), (B, D)}so= - 11 00 0
1 -1 10 0
0 10-21
A DE 0 00 10-
APPLICATIONS
METRIC FOR DIFFERENCES IN EXPERTISE
IDENTIFICATION OF ERRORS
CONSENSUS
AUTOMATIC SYSTEM ADAPTATION WITH LEARNING
HUMAN-COMPUTER INTERFACE
ROBOTICS VISION DATABASE
TRANSFORMATIONS, DISTANCE,
AND DISTANCE GRAPHS
Gary ChartrandWestern Michigan University
ABSTRACT
Several transformations are described - between graphs and between subgraphs in a graph.Each of these transformations gives rise to a distance (between graphs or between subgraphs in a
graph). The relations between collections of graphs or between the subgraphs of a specified size
within a (connected) graph can be described by graphs themselves, called distance graphs. In.addition to describing these concepts, another distance between induced subgraphs of a specified
order and the corresponding distance graphs are also discussed.
2
Introduction
The distance between two vertices in a connected graph is the length of the shortest path
connecting the vertices. Distance is one of the most fundamental concepts in graph theory.
Algorithms for determining distance in graphs are well known while applications involving
distance in graphs are varied and numerous. Indeed, so much work has been done on this subject
that Buckley and Harary wrote a book in 1990 entirely devoted to distance in graphs. Distance in
graphs has been generalized in several ways, most notably perhaps to Steiner distance.
The combination of distance and graphs occurs in many other ways. One of these concerns
distance between graphs. It would be more accurate to speak of distance between certain pairs of
graphs because in many instances distance is defined between graphs having some specified
properties. In all such distances, the distance between two graphs is 0 if and only if they are
isomorphic. Hence, the distance between two graphs is a measure of the structural difference
between the graphs.
Transformations
Some distances between graphs involve the idea of transformations. Let G and H be
two (p, q) graphs. We say that G can be transformed into H by an edge rotation if Gcontains distinct vertices u, v, and w such that uv r E(G), uw e E(G), and H - G - uv + uw.
More generally, we say that G can be r-transformed into H if there exists a sequence G = G0,
G 1 ... , Gn = H (n > 0) of graphs such that Gi can be transformed into Gi+ by an edge
rotation for i = 0, 1, ..., n - 1.
An edge slide is a restricted version of an edge rotation. A graph G can be transformed
into a graph H by an edge slide if G contains distinct vertices u, v, and w such that uvE
E(G),vw E E(G), uw 0 E(G) and H = G - uv + uw. If a graph H is isomorphic to a graph G
or H can be obtained from a graph G by a sequence of edge slides, we say that G can be s-
transformed into H. For example, the graph H of Figure 1 can be obtained from the graph G
by an edge rotation, but H cannot be obtained from G by an edge slide. On the other hand, the
graph H' can be obtained from G' by an edge slide (as well as by an edge rotation).
3
V W V wG: V2 H: ZU Z U Z
V W V W
U Z U Z
Figure 1
It was shown by Chartrand, Saba, and Zou that every (p, q) graph G can be r-
transformed into any other (p, q) graph H. It was also shown by Johnson that s-transformation
preserves connectedness. Further, a graph G can be s-transformed into a graph H if and only if
G and H have the same number of components and corresponding components of G and H
have the same order and same size.
SMetrics Based on Transformations
Associated with these transformations are two metrics defined on graphs. Let G and H
be two graphs having the same order and same size. The edge rotation distance or, more simply,
the r-distance dr(G, H) between 6 and H is the smallest nonnegative integer n for which
there exists a sequence G 0, G 1, ..... G n of graphs such that G6= G0, H =G, and G i can be
obtained from G_ by an edge rotation for i =1, 2, ..... n. For example, the edge rotation
distance between graphs G and H shown in Figure 2 is dr(G, H) = 3.
G: p OH:C 0
Figure 2
The following properties of edge rotation distance were established by Chartrand, Saba,
and Zou.
S
4
Proposition 1 If G and H are two graphs having the same order and same size, then
dr(G, H) = dr(G, H).
It was shown that every nonnegative integer is the r-distance between some pair of graphs.
Proposition 2 For every nonnegative integer n, there exist graphs G and H such that
dr(G, H) = n.
Prior to presenting an upper bound for the r-distance between two graphs, we introduce
another concept. For nonempty graphs G1 and G2, a greatest common subgraph of G1 and G2
is defined as any graph G of maximum size without isolated vertices that is (isomorphic to) a
subgraph of both G1 and G2.
Proposition 3 Let G and H be two (p, q) graphs with q > 1, and let s be the size of a
greatest common subgraph of G and H. Then dr(G, H) < 2(q - s). Moreover, this bound is
sharp.
Another distance between graphs is associated with edge slide and was discussed by
Johnson and by Benadd, Goddard, McKee, and Winter. Let G be a graph with components Gi,
* 1 i < k, and H a graph with components Hi, 1 < i < k, such that Gi and Hi have the same
order and same size. We define the edge slide distance or, simply, the s-distance ds(G, H)
between G and H as the smallest nonnegative integer n for which there exists a sequence GG0, G1 = .... Gn H of graphs such that, for i = 1, 2, ... , n, Gi can be obtained from Gi 1 by
an edge slide. If G and H are the graphs presented in Figure 3, then the edge slide distance
between G and H is ds(G, H) = 2.
G: H:
<XFigure 3
Note that dr(G, H) = 1 for the graphs G and H of Figure 3. It is straightforward to
show that dr(G, H):5 ds(G, H) for every pair G, H of connected graphs having the same order
and same size. The following result is perhaps less obvious.
5
* Proposition 4 For every pair m, n of positive integers with m < n, there exist graphs G andH such that dr(G, H) = m and ds(G, H) = n.
Distance Graphs
Let S be a set of (nonisomorphic) (p, q) graphs. Then we define the edge rotation
distance graph Dr(S) of S as the graph with the vertex set S such that two vertices G and H
of Dr(S) are adjacent if and only if dr(G, H) = 1. A graph F is an edge rotation distance graphif F = Dr(S) for some set S of graphs.
Let S' be a set of (nonisomorphic) graphs having the same number of components,
labeled in such a way that the ith components of all graphs have the same order and same size.Then we define the edge slide distance graph Ds(S') of S' analogously.
It was shown by Chartrand, Goddard, Henning, Lesniak, Swart, and Wall that every
graph is an edge slide distance graph and it was conjectured that all graphs are edge rotationdistance graphs. A number of classes of graphs are known to be edge rotation distance graphs.
The next two results are due to Chartrand, Goddard, Henning, Lesniak, Swart, and Wall.
* Proposition 5 Complete graphs, cycles and trees are edge rotation distance graphs.
Proposition 6 Every line graph is an edge rotation distance graph.
Proposition 7 (Faudree, Schelp, Lesniak, Gyrfds, and Lehel) The complete bipartite graphsK3,3 and K2,p (p _ 1) are edge rotation distance graphs.
Proposition 8 (Jarrett) For every pair m, n of positive integers, the graph Kmn is an edge
rotation distance graph.
F-Transformations
Let G and H be two (p, q) graphs, both containing a subgraph isomorphic to a givengraph F of order at least 2. We say that G can be transformed into H by an F-rotation (orsimply, G can be F-rotated into H) if there exist distinct vertices u, v, and w of G and asubgraph F' of G isomorphic to F such that u e V(F'), (v, w) 9 V(F'), uv e E(G), uw eE(G), and H = G - uv + uw. For example, if F - K1,3, then the graph G of Figure 4 can beKi, 3-rotated into H and H'.
G: H:v W
x x x
U ZU Z U Z
Figure 4
More generally, we say that a graph G can be F-transformed into H if either (1) G =_H
or (2) there exists a sequence G = Go, GI, ..., Gn = H of graphs such that, for i = 0, 1, ... ,
n - 1, the graph Gi can be F-rotated into Gi+1 . For instance, the graph G of Figure 5 cannot
be K1,4-rotated into H, but G can be K1,4 -transformed into H.
G( = Go): G1• H(= G2):
Figure 5
Observe that K 2-rotation and K2 -rotation are edge rotation and edge slide, respectively.
Clearly, if a graph G can be F-transformed into a graph H, then G and H have the same
order, same size, and both contain a subgraph isomorphic to F. Unfortunately, the converse is
not true, in general. For instance, the graphs G and H of Figure 6 have the same order andsame size, and both G and H contain a subgraph isomorphic to C4 , but G cannot be C4 -
transformed into H. In fact, G can be C4 -transformed only into itself.
p
7
G: ~H
:C
Figure 6
One may ask the question: What are necessary and sufficient conditions for one of two
graphs G and H to be F-transformed into the other? We have already seen the answer to this
question if F = K2 or F = K2. The following results are due to Jarrett.
Proposition 9 Let F be any nontrivial connected graph. If a connected graph G can be F-
transformed into a graph H, then H is connected.
Corollary 9a Let F be a nontrivial connected graph. A graph G can be F-transformed into agraph H if and only if the graph G has components G1, G2, ..., Gk, the graph H has
components H1, H2, ..., Hk, and Gi can be F-transformed into Hi for every i (1 <i <k).
Proposition 10 Let F be a connected graph of order p' with 8(F) = I, and let G and H
be two (nonisomorphic) connected (p, q) graphs, each containing an induced subgraph
isomorphic to F. Then G can be F-transformed into H.
Next we show that if 8(F) > 1 or F is not an induced subgraph of G or H, the result
does not necessarily hold. For example, for the graph F of Figure 7 we have 8(F) = 2 > 1.
Although the graphs G and H contain F as an induced subgraph, G cannot be F-transformed
into H. In fact, G cannot be F-transformed into any graph different from G.
F: G: H:
Figure 7
For the graph F of Figure 8 we have 8(F) = 1, but F is not an induced subgraph of G,
and H cannot be obtained from G by an F-transformation. Indeed, G can be F-transformedP only into itself and graphs G' and G" of Figure 8.
. °<88ID ~~~F: o>
G ( H:C>
Figure 8
With each F-transformation another metric can be defined. Let F be a graph of orderp' : 2 and let S be a set of (p, q) graphs such that for every pair G, H of graphs in S, the
graph G can be F-transformed into H. The F-distance F-d(G, H) between G and H is
defined as the minimum number of F-rotations necessary to transform G into H. For the graphs
F, G, and H of Figure 9, we have F-d(G, H) = 2.
l* F: G: H:
Figure 9
If a graph G cannot be F-transformed into a graph H we set F-d(G, H) = 00. It is
obvious that K2-d(G, H) : F-d(G, H).
Proposition 11 Let F be a graph of order p ( 2) and let n be a nonnegative integer. Then
there exists a pair G1, G2 of graphs such that F-d(G 1, G2) = n.
Let F be a graph of order p' (> 2) and let S be a set of (p, q) graphs, each containinga subgraph isomorphic to F. Then the F-distance graph D(S) of S is that graph whose vertex
set is S and in which two vertices G and H are adjacent if and only if F-d(G, H) = 1. Forexample, if F - K3 and S is the set of graphs G1, G2, G3, and G4 shown in Figure 10, then
S(S) =- K4 - e.S
G2G:
G3
Figure 10
Proposition 12 Let F be a nontrivial connected graph. Then every graph is an F-distancegraph.
Subgraph Distance
Let G, and G2 be edge-induced subgraphs of the same size in a graph G. The subgraphG2 can be obtained from G1 by an edge rotation if there exist distinct vertices u, v, and w suchthat uv = E(G1), uw O E(G1), and G2 = G1 - uv + uw. More generally, G1 can be r-
transformed into G2 if GI = G2 or G2 can be obtained from G1 by a sequence of edge
rotations. It was shown by Chartrand, Johns, Novotny, and Oellermann that every edge-inducedsubgraph of a connected graph G can be r-transformed into any edge-induced subgraph of Ghaving the same size. The edge rotation distance dr(GI, G2) between G1 and G2 is theminimum number of edge rotations required to r-transform G1 into G2. For the graph G ofFigure 11, the subgraph G3 can be obtained from G1 by an edge rotation so that dr(Gl,G3) "-1.On the other hand, G3 cannot be obtained from G2 by an edge rotation, but G3 can be obtained
from G2 by an r-transformation and dr(G2, G3) = 3.
10
z z v
x Y u t yU
G2 : z v G3:
x y y u
Figure 11
One can also define a subgraph transformation based on edge slide. Let G1 and G2 be
two edge-induced subgraphs of the same size in G. We say that G2 can be obtained from G,by an edge slide if there exist distinct vertices u, v, and w of G such that uv E E(G1 ), uw 4
E(G1), vw E E(G), and G2 = GI - uv + uw. For example, for the graph G of Figure 12, the
subgraph G2 can be obtained from G, by an edge slide. More generally, we say that G, can
be s-transformed into G2 if either (1) G, = G2 or (2) G2 can be obtained from G 1 by a
sequence of edge slides.
w wG: GI: G2:
v U v u v U
z z z
Figure 12
As we mentioned earlier, for every pair H, H' of edge-induced subgraphs of the same size
in a connected graph G, the subgraph H can be r-transformed into H'. Unfortunately, this is
not the case for s-transformations. For example, if G is the graph of Figure 13, then H cannot
be s-transformed into H'. In fact, H can be s-transformed only into itself.
I
11
G: H. H': V\ X
u w y u w w y
Figure 13
Let e and f be edges of a graph G. A triangular e-f walk of G is a finite, alternating
sequence e = e0 , T 1 , e l , T 2, ..., en_ 1, Tn , en = f of edges and triangles such that ei_1 and e i
belong to Ti (1 < i < n). A triangular e-f path is a triangular e-f walk in which no edges or
triangles are repeated. The number n of triangles in the triangular path is called its length. In the
graph G of Figure 14 there exists a triangular e-f path (with Ti = ({ei_ 1, el}), i = 1, 2, 3), but
there is no triangular e-g path.
e = e el e2 e3 =f g
Figure 14
It is straightforward to show that every triangular e-f walk in a graph contains a triangular
e-f path.
Observe that for every two edges e and e' of a triangle T, the subgraph ({e')) can be
obtained from (fe)) by an edge slide. Therefore, if in a graph G there exists a triangular e-f
path, then G1 = ({e)) can be s-transformed into G2 = ({f).
Whenever edges e and f belong to a 3-cycle in G, we denote this triangle by T(e, f)
and call it a slide induced triangle. With every edge slide there is associated a unique triangle T,
namely, if G2 = G, - e + f, then T = T(e, f). These observations are useful in proving the
following result by Jarrett.
Proposition 13 Let G be a connected graph of size q a 1, and let q' be an integer with
1 < q' < q. For every pair G1, G2 of subgraphs of G having size q', the subgraph G, can be
s-transformed into G2 if and only if every two edges of G are connected by a triangular path.
I
12
Triangular Line Graphs
For a given graph G, we define its triangular line graph 7(G) as that graph with vertex set
E(G) such that two vertices e and f of '2(G) are adjacent if and only if T(e, f) is a triangle of
G. For G = K4 - e, the graph 7(G) is shown in Figure 15.
G: e3 "(G): e2 e3
eee
el el e4
Figure 15
It follows from the definition that '7(G) is a spanning subgraph of the line graph L(G).
The next result is perhaps less obvious. These are due to Jarrett.
Proposition 14 Let G be a connected graph of order p > 2. Then '1(G) = L(G) if and only
if G = Kp.
Proposition 15 Let G be a connected nontrivial graph. For every pair G1, G2 of edge-
induced subgraphs of G having size 1, the subgraph G, can be s-transformed into G2 if and
only if '(G) is connected.
Corollary 15a Let G be a nontrivial connected graph. Then for every pair G 1, G2 of edge-
induced subgraphs of G having the same size, G, can be s-transformed into G2 if and only if
'7(G) is connected.
For integers n > 2, the nth iterated triangular line graph T'n(G) of a graph G is defined
to be '7(Tn-l(G)), where T'(G) denotes 7(G) and Tqn-(G) is assumed to be nonempty.
Clearly, Tn(G) is a subgraph of the nth iterated line graph Ln(G) of G. In fact, for n =
1, 'T1(G) = "7(G) is a spanning subgraph of L1 (G) = L(G).
Note that every triangle T in G gives rise to a triangle T' in '2(G) with a one-to-one
correspondence between the edges of T and the vertices of T'. Moreover, if T1 and T2 are two
triangles of G, then the corresponding triangles T' and Tj of '1(G) are edge-disjoint. For
suppose, to the contrary, that T' and Tj have an edge in common or, equivalently, T' and T'
13
have two common vertices, say e and f. Necessarily, e and f are common edges of T1 and
T2, which implies that T1 = T2 . Thus '7(G) has at least as many triangles as G has. We show
that only for K4 -free graphs G are the number of triangles in G and '7(G) equal.
Proposition 16 Let t(G) denote the number of triangles in a graph G. Then t(G) = t(7(G))
if and only if G is K4-free.
Proposition 17 Let G be a K4-free graph. Then Tn(G) = T 2 (G). for n > 2.
The previous result does not hold for a graph G = K4 . However, T 3(K4 ) = 8K3 and,
therefore, for n > 3 we have Tn(K4 ) = T 3(K 3). The graphs G = K4 and Ti(K4 ), 1 < i < 3,
are shown in Figure 16.
Conjecture For every graph G containing at least one triangle, there exists an integer k > 0,
such that for n > k, Tn(G) = Tk(G).
G'7(G):
T 2 (G): T 3(G):
Figure 16
n-Subgraph Distance Graphs
The n-subgraph distance graphs were introduced by Chartrand, Hevia, Jarrett, Saba, and
VanderJagt. Let G be a graph of size q (> 1) and let n be an integer with 1 < n < q. The n-
subgraph distance graph Ln(G) of G is that graph whose vertices correspond to the edge-
14
induced subgraphs of size n in G and where two vertices of Ln(G) are adjacent if and only if. the edge rotation distance between corresponding subgraphs is 1. It is convenient to label thevertices of Ln(G) by the edge sets of the corresponding subgraphs or simply by listing the edges.
Each edge in a vertex label is called a coordinate. Since the coordinates are elements of a set, the
order in which the coordinates of a vertex are listed is irrelevant. For example, if a vertex ofLn(G) corresponds to the subgraph of G induced by the edge set (e l , e2, ..., er}, then we may
label this vertex as e l , e2, ..., en or ei u X, where X = (ejI 11 _j < n,j * i, or simply as eiX.
For the graph G K1 + (K1 u K2) of Figure 17, the graphs Li(G), i = 1, 2, 3, 4, are shown.
aa
G: 0L()
bb c
dd
L2b L3 (G): bcd
a, acd Cabd
abc
bd cd
L4(G): abcd0
Figure 17
Observe that L4 (G) = K1 for the graph G of Figure 17. In general, Lq(G) = K1 for a
graph G of size q. Furthermore, LI(G)= L3 (G) for the graph G of Figure 17. This fact
illustrates the following result.
Proposition 18 Let G be a graph of size q, and let n be an integer with 1 < n < q. ThenLn(G ) = Lq-n(G ).
15
The graphs Ln(G), 1 < n < q = E(G), are also called generalized line graphs since the 1-
subgraph distance graph LI(G) is the line graph of G. We shall also refer to these graphs as n-
subgraph rotation distance graphs to distinguish them from the n-subgraph slide distance graphs,
which we are about to describe. We begin with the definition of n-subgraph slide distance.
Let G be a graph of size q (> 1), and let G1 and G2 be two edge-induced subgraphs of
G having the same size n (1 < n < q). We define the n-subgraph slide distance ds(G 1, G2)
between G, and G2 as the smallest nonnegative integer k for which there exists a sequence
H0 , H I, .... Hk of subgraphs of G such that G, = H0,G 2 = Hk and, for i= 1,2 ... k,H i
can be obtained from Hi_1 by an edge slide. If no such k exists, we define ds(G1, G2) = 00. If
G = K4 - e, and G1 and G2 are two subgraphs of G shown in Figure 18, then
ds(GI, G 2 ) = 2.
G: b GI" b G2:
a c
Z I 0____d d
Figure 18
We define the n-subgraph slide distance graph Sn(G) of G as the graph whose vertices
correspond to the edge-induced subgraphs of size n and where two vertices G1 and G2 of
Sn(G) are adjacent if and only if ds(G 1, G2 ) = 1. It is straightforward to see that SI(G) = !(G),
and, therefore, SI(G) is a spanning subgraph of LI(G). In general, for n >_ 1, Sn(G) is a
spanning subgraph of Ln(G). For the graph G = K4 - e, the graphs Si(G), 1 < i ! 5, are shown
in Figure 19.
16
G: b SIG: a b
azcZ
d d c
ceabd ade acd
bd ace
S4(G): S5(G) =-K1 :
bcde acdeabcde
abee abde
Figure 19
Observe that for the graph G =-K4 - e, we have S2(G) S3 (G) and SI(G)- S4 (G).
This fact can be generalized as follows.
Proposition 19 Let G be a graph of size q Q: 1) and let n be an integer with 1 n < q.
Then Sn(G) =-Sq-n(G).
17
Subgraph Distance for Subgraphs of the Same Order
For a connected graph G of order p and an integer n such that 1 < n < p, let F and H
be induced subgraphs of G of order n. We define a pairing 7C from the set V(F), say {v1, v2 ,
.... vn }, to the set V(H) as a one-to-one correspondence that associates a vertex of V(F) with
one of V(H). The distance induced by 7c between F and H is defined as
n
dg(F, H) = d(vi, it(vi))i= I
and the subgraph distance between F and H is
d(F, H) min 4(F, H).
This concept was introduced by Chartrand, Johns, Novotny, and Oellermann. Observe that if F
consists of a single vertex, say u, and H consists of a single vertex, say v, then d(F, H) = d(u,
v). Thus d(F, H) is a generalized distance defined in terms of subgraphs. As an example, Figure
20 gives a connected graph G, two induced subgraphs F and H of G, a listing of all pairings.from V(F) to V(H), and d(F, H).
18
Vi Ul Vi
G: F: _H: 0u1 x y
U2 (
U3 V2 V3
U3 W V2 V3
Pairings ui Vi d(u i, vi) dnk(F, H)u1 v1 2
U u2 V2 3 8
U1 V1 2It2 U2 V3 4 8
U3 V2 2Ul V2 3
7C3 U2 V 1 3 9
_ _u3 v3 3
Ul V2 3714 u2 v3 4 10
u3 Vl 3
uI v3 45 u2 Vl 3 9
U3 v 2 2uI v3 4
6 u2 v2 3 10
I u3 V1 3
d(F, H) = 8
Figure 20
19
n-Vertex Graphs
Let G be a connected graph of order p and let F and H be subgraphs of order n with
1 _< n _ p - 1. Then d(F, H) = 1 if and only if there exist adjacent vertices u r V(F) and v E
V(H) such that V(F - u) = V(H - v). We define the n-vertex graph of G as that graph Gn
whose vertices are the induced subgraphs of order n in G and two vertices F and H of Gn are
adjacent if and only if d(F, H) = 1. A graph and its 2-vertex graph are shown in Figure 21.
G j: G2
Figure 21
01,
The optimality of clustering and monotone optimal assembliesbyUriel G. Rothblum
The purpose of the study is to present recent resultswhich provide sufficient conditions for the optimality ofclustered partitions. In particular, we Introduce a formaldefinition of clustering by calling a partition clustered ifconvex hulls of the sets of the partition are disjoint* It isshown that if given vectors Al , so. , An In Rk are to bpartitioned into m groups, of predetermined sizes, so as tomaximize an objective which is a quasi-convex function ofthe sums of the vectors in each set, then a clustered optimalpartition exists. Further, if quasi-convexity is replaced bystrict quasi-convexity, every optimal partition is clustered.Computational implications of the results are discussed.
The techniques we developed enabled us todetermine sufficient conditions for the optimality ofmonotone assemblies. Here we consider the probles ofS identifying multipartitions which occur when items ofdifferent types are to be partitioned into sets, for example,in system assembly, components of different type areassigned to modules which compose the system. We showthat if the system is coherent and if the components in eachmodule are In series, there Is a reliability maximizingassembly which is monotone, i.e., it has a single modulewhich gets the best parts of each type (according to Itsspecification), there is another module which gets thesecond best parts of each type (according to itsspecification), and so on, till finally, there is one modulewhich gets the worst parts of each type (according to itsspecification).
p
CLASS(FICRTONJ AWL CLUQSTERIWN&o
PROSLEMS NRME TA FOLLOW3I4 STRQCTURFE:
PARRTITION Ft NUMER OF ENTITIES INTO
6 ROU K
ok E 9c4 ~TTty HARS SEYERL NUMiEdL
o ~ t~ PARTITIvON VsS I~t~~
C-F:XMPLES.' SThRICS - kAY'PoFS(s TESTJV~t
INheNrcky 6 ROMP I N (
~4lY;eY oF PAMs it4 R I~gC
LccTION% PRoBLEMS
comMON?RJ RcTnE - CLUX~IN6t
S 6,POU5 TO6ET~iER Er4TMTES ~~
SIMHiL Aj
?ARTI*UON(N6& FO%I.EIS - bECFIWITON.
Vd"(DISTINCT oECTORrivI
6-nORL IS To HNIMUZ
RZITMCE ?ota~u. Isi GVEN~ f~RmoR
UNRESTRkcTECD MLEIA (s UNR\\CTM
MAI1N TWEOWEtA
IS C JLLU WASI CONflE I
PN % IS Q Uf\S- CoICAVCRE IN ~ikV~PBE
Y\GTHEA >I TESTING
O'TCII OF EM1N
PART7Tlttj: ?PRTITIOtN INTO FR(CEPT VA
?,%b UtIEcT Vic) (MROUPS
gotC -V N! ?%. O 1LITY OF~ F IMT TYS't U teWl'
~ (j~ ~~PA 4 CZ( ~'~
EYzc E FA R
S GQS- COWScVE TH-EN TWE
RESTRICTETb ?ROLEM Mc-RW ~ OPTI!IRL ?RRYMTOM4
VJ(TH (lIIN ONVE2( VAJLLS
IFI S-ONCS\VC T IEN TAE
\)EKTRTEb PROBLEM~ AS RN O)?TIiFU. 'PRR71TION
*1 7ti S-SolNT Q-oNic HULL&
CO~s I b FI THE' POLY~TOPC? INJ
IT v~?RoVEtb TFKAT T"AE 1ZTE1 otJ
C1~('P OLYTCPE CPof\F..?Nb TO
IF C> STRICTj,\ GOA1 C0O4CRNF4
E\IE'y cp~mhNL ?fRRT114F
COMPUTATIONAL Sl %NI'\CRNCE
T4E<E FRE" CLLZD CotE(UTIVE PI R'TITION
-- OF (oNsUTI'V" PRRTITIoN.-"
-.: ,.,t- CfNT CoN \c .kLS
U
° .
-J; oP RLL WFITlOMS
-* OF" 1RU ?WT~Toc-. I ?DENTjIi N
o cA.,),-rRJC'tulJkE' PR-TW ?OLY4Ot4I1L It
~Nt EFFICiFNT METh~bS FOR SYSTEMATic-
e NU VA ? foK M
L('/"NT1ON ?Ro$LEMS-ClKCL L 5L Kfi-t{1
SPPOSE ccloo AR ~TO BE PR11TIoWT\CT Sc fS S TO N(NINMIZE
&Ll1 ~~ L4ES'WCkl 1
PRkbt PPLY -THeCREMi
C? OE\T IVE/ JI oPC~fAMVE
SNR ~ES e S RU C-TUR E
R~ATRAZRY COREF IET '&TRUCTJf
"s E ON) sTT o'F
gNO RMUtNV VA 0lb X-
LN ORIDIM, APPROACH TO CoNsmlsus VUNCTIONS
0 G. D. Crown and M. F. Janowitz
Abstract. Let G be a finite nonempty set. A consensus function
can be viewed as a mapping F:Gk - G. For n e Gk, F(n) is often
constructed from certain building blocks. For a family of
relations on a set X, these might be the pairs (xy) that
constitute the output relation. In its most general setting, one
can work with a triple (G,S,X) where S is the set of building
blocks and :G --)P(S) is a one-one function. A neutral consensus
function F is characterized by the existence of a family VF of
subsets of Vk = {1,2,...,k) having the property that for any k-
tuple n4 Gk, s e z(F(W)) if and only if {i: s e ' (gi)} DF-
Assuming that Vk e DF, and o * VF, we investigate conditions on
that are equivalent to: A e VF implies Vk\A E VF; A VF impliesVk\A e DF; VF is an order filter (principal filter, ultrafilter)
of Vk. The work is in progress, but here are some sample results:
Theorem. Assume G has elements g,h such that 7(g)vy(h) is not of
the form ,(v), while ?(g)c(h) - 7 (w) for some w e G. Than forany neutral F, it is true that A,B EDF implies AB *o.
(Included here would be any nondistributive lattice, as well as
any meet semilattice that is not a lattice). There are also some
general results that imply the following: If G is a lattice
having a pair of comparable sup-irreducibles, then DF is an order
filter of Vk. If G is an atomistic lattice having distinct atoms
a,b,c such that c s avb, then 2F is a principal filter (so any
neutral consensus method is an oligarchy). If G is an atomistic
join semilattice that is not a lattice having distinct atoms
a,b,c such that c s avb < 1, or if G is a nondistributive join
semilattice that is not a lattice having at least 4 atoms, such
that every line c-ntains exactly 2 points, then any neutral
consensus method is a dictatorship.
0
*ORDINAL CONSENSUS METHODS
G. D. Crown and M. F. Janowitz
(Preliminary Report)
What is a consensus method?
G = finite set of objects
(the objects to be summarized)
V = indexing set (the "voters")
(fixed, possibly infinite)
P = G = set of profiles.
Thus for R E P,
= (go) CEV or n:VH--G.
Consensus method: F:Pi -G.
Kon )ceT Rw.j,#- \J.,-Ue)
\Jjer- Or A A-V . -eL.'& FT'r- -c
~ Q~e-T
c,,- C oSeL- TejZc4..
6L-' oI-a r-j t 41C e
*For a profile n, will approach the
construction of F(n) by means of a'stability" family.
Pair (S,T), where S is a set and
: :Gi->P(S) is a one-one
mapping.
Neutral consensus method F:There is a family D = DF of
subsets of V such that
(i) o i D, but V E D.(ii) s E 7 (F (n)) if f s E 2),I
where s = {a: s E (n (a)}.
Goal: Relate properties of D with
properties of -.
* Examples of stability families
1. G = a set of binary relations
on a set X. (weak orders,
linear orders, partial orders,
quasiorders, etc.) S = XxX, and
(R)= {(x, y): xRy}
or
(R)= {(x, y): xRy fails}
2. G = finite join (or meet)
semilattice.
S = sup-irreducibles in that
a,b < s = avb < s.
T(g) = {s E S: s < g}, or
T(g) = {s E S: s $ g}.
(or the dual of this example)
3. Trees and Pyramids. X = set
with Ixi a: 5. A set 9 of
subsets of X is a trE-, if
(a) X E T, and 0 o i.
(b) {x} E 9T for all x E X.
(c) For A, B E T, AnB E {o,, A, B}
YTi s a pyramid i f (c) i s
replaced by
(d) For A,B E Y, AnB E T.
Take S = proper subsets of X,
and z (Y) = {A: A E Y} . There
are many similar examples.
) I~c~~ elA &zE cA %3 3
Cc-F-cr,- c44 c- 1 - 3 -T
Dho1 T,,4Ier i- V
9. A? ==:-P A~ n
~~r-Aur-c,i ( ~ A-&~s
(TT -r_
*Some Sample Results
F = neutral consensus method withF"
(S, -) is a stability family.
Theorem 1. If IVi a 2, (a) * (b)
(a) For any such F,
A E D implies A' i D.
(b) 3g,h E G such that y(g)uy(h)
is not of the form x'(v) for
any V E G.
I f I V 3, (c) * (d)
(c) For any such F,
B 4 D implies B' E D.
(d) There exist g, h E G such that
z(g) n (h) is not of the form
*Z(w) for any w E G.
Theorem 2. If there exist g,h E G
such that
(g) uz (h) is not of the form (v) ,
but
(g) n (h)= y(w) for some w E G,
then AB E D implies AnB 0 o.
Corollary. Let G be any non-
distributive lattice, or any join
*semilattice that is not a lattice.
Take S = sup-irreducibles, and
T(g) = {s e S: s -< g}.
Then A, B E D implies AnB # 0.
-- A VERSION OF ARRow's THEOREM
a1 ,a2, a3 E S have the following
properties:
(1) 3glg 2 ,g3 E G I (i, j,k) any
permutation of (1,2,3), then
a. E y (g.) {a a k } n (gi) 0
(2) 3z E G 9 ui,(ai) _; z(z).
i (3) 3w E G 9 {al,a 2 ,a 3 }nz(w) = 0-.
(4) {a I ,a 2 , a 3 } is transitive in
the sense that a 1 ,a 2 E 'Y (g)
implies that a 3 E T(g).
Theorem 3. If the above conditions
are satisfied and F is a neutral
consensus method, then D is a
*- lattice filter of V.
Remark. The above notion of
transitivity of course comes from
transitivity of relations:
xRy, yRz implies xRz.
But what makes it of further
interest is the fact that in a
join semilattice, c -5 avb makes
(a,b,c) a transitive triple.
SFor relations, can talk about
a triple { (a,b) , (c,d) , (x,y)} being
transitive if
aRb,cRd implies xRy.
In general, if T _ S and S E S,
can define (T,s) to be transitive
if T _; T(g) implies s E 'Y(g)
Theorem 4. Let a, b E S; x, y, z E G.
Assume:
(1) a e (g) b e (g).
(2) a E (x).(3) a (y), b r= y(y).
(4) b (z).
Then F neutral implies ) an order
filter.
S Theorem 5. Assume x, y,g,z E G, and(1) (x)uT(y) # z (w)
(2) (x) nry(y) = 0.
(3) (x) uy (y) c x (g)
(4) -(z) = 0.
Then F neutral implies D is closed
under intersections.
Theorem 6. Assume x, y, g E G, and
(1) (x) uv (y) z (v) for any v.
(2) z(x)rnz(y) - (w) for some w.(3) y (x) ux (y) c y (g).
(4) D is an order filter.
Then D is a lattice filter.
Apply these and similar results to
lattices having more than 1
*member:
Theorem 7. G is a lattice, and
S = sup-irreducibles. These are
equivalent:
(1) G is not distributive.
(2) For any neutral F, ) is a
lattice filter.
Defn: A join-semilattice G is
distributive if every principal
filter of G is a distributive
lattice.
Theorem 8. G is a nondistributive
join semilattice not a lattice,
with S its sup-irreducibles.
For any neutral consensus method F
*on G, V is an ultrafilter.
In progress: Work with
distributive join-semilattices.
The setting: A stability
family (S, X) such that:
(1) x(a)u?-(b) = z(v).
(2) Y(a)nz(b) is either z(w) or 0.