algorithmic facets of human centricity in computing with fuzzy sets isda-2009, pisa, italy, november...

Algorithmic Facets of Human Centricity in Computing with Fuzzy

Sets

ISDA-2009, Pisa, Italy, November 30-December 2, 2009

[email protected]

Agenda

Human centricity and information granules

Design of information granules – approaches of knowledge-basedclustering

Granular representation of computing with fuzzy sets

Human Centricity and information granules

Information granules as conceptual entities inherently associated with human pursuits (decision-making, perceptioncontrol, prediction)

Interaction with and processing in intelligent systems realized at the level of information granules (fuzzy sets, rough sets, intervals…)

Emergence of Human-Centric computing (HC2)

Knowledge sharing and collaboration in distributed systems

Human Centricity and fuzzy sets

Two fundamental quests:

Construction of information granules (fuzzy sets);use of existing experimental evidence and its interpretationCast in the framework of users/designer

Qualitative, user-centric interpretation of results of computing with fuzzy sets

Clustering as aconceptual and algorithmic framework of information

granulationData information granules (clusters) abstraction of data

Formalism of: set theory (K-Means) fuzzy sets (FCM) rough sets

shadowed sets

Main categories of clustering

Graph-oriented and hierarchical (single linkage, complete linkage, average linkage..)

Objective function-based clustering

Diversity of formalisms and optimization tools(e.g., methods of Evolutionary Computing)

Key challenges of clustering

Data-driven methods

Selection of distance function (geometry of clusters)

Number of clusters

Quality of clustering results

The dichotomy and the shift of paradigm

Human-centricityGuidance mechanisms

Fuzzy Clustering: Fuzzy C-Means (FCM)

Given data x1, x2, …, xN, determine its structure byforming a collection of information granules – fuzzy sets

Objective function

2ik

N

1k

mik

c

1i||||uQ vx

Minimize Q; structure in data (partition matrix and prototypes)

Fuzzy Clustering: Fuzzy C-Means (FCM)

Vi – prototypes

U- partition matrix

FCM – optimization

2ik

N

1k

mik

c

1i||||uQ vx

Minimize

subject to

(a) prototypes

(b) partition matrix

Domain Knowledge:Category of knowledge-

oriented guidance

Context-based guidance: clustering realized in a certain contextspecified with regard to some attribute

Viewpoints: some structural information is provided

Partially labeled data: some data are provided with labels (classes)

Proximity knowledge: some pairs of data are quantified interms of their proximity (resemblance, closeness)

Clustering with domain knowledge

(Knowledge-based clustering)

Data

Information granules (structure)

CLUSTERING

Domain knowledge

Data-driven Data- and knowledge-driven

Data

Information granules (structure)

CLUSTERING

Context-based clustering

Clustering : construct clusters in input space X

Context-based Clustering : construct clusters in input space X given some context expressed in output space Y

Active role of the designer [customization of processing]

Context-based clustering:Conmputational considerations

•computationally more efficient,•well-focused, •designer-guided clustering process

Data

structure

Data

structure

context

Context-based clustering:focus mechanism

Determine structure in input space given the output is high

Determine structure in input space given the output is medium

Determine structure in input space given the output is low

Input space (data)

Context-based clustering:examples

Find a structure of customer data [clustering]

Find a structure of customer data considering customers making weekly purchases in the range [$1,000 $3,000]

Find a structure of customer data considering customers making weekly purchases at the level of

around $ 2,500

Find a structure of customer data considering customers making significant weekly purchases who

are young

no context

context

context

context(compound)

Context-oriented FCM

Data (xk, targetk), k=1,2,…,N

Contexts: fuzzy sets W1, W2, …, Wp

wjk = Wi(targetk) membership of j-th context for k-th data

c

1i

N

1kikjkikikj iNu0andk wu|0,1u)(WU

Context-driven partition matrix

Context-oriented FCM:Optimization flow

Objective function

Iterative adjustment of partition matrix and prototypes

2ik

c

1i

N

1k

mik ||||uQ vx

c

1j

1m

2

jk

ik

jkik

wu

vx

vx

N

1k

mik

N

1kk

mik

i

u

u xv

Subject to constraint U in U(Wj)

Viewpoints: definition

Description of entity (concept) which is deemed essential in describing phenomenon (system) and helpful in castingan overall analysis in a required setting

“external” , “reinforced” clusters


-150

-100

-50

0

50

100

150

200

0 100 200 300 400 500

x1

x2

a

b

x1

x2

a

viewpoint (a,b) viewpoint (a,?)


Description of entity (concept) which is deemed essential in describing phenomenon (system) and helpful in castingan overall analysis in a required setting

“external” , “reinforced” clusters

Viewpoints in fuzzy clustering

x1

x2

a

b

otherwise 0,

viewpointby the determined is B of rowth -i theof featureth -j theif 1,b ij

0

0

1

0

0

1

B

0

0

b

0

0

a

F

B- Boolean matrix characterizing structure: viewpoints prototypes (induced by data)

Viewpoints in localization of “extreme” information granules

specification of viewpoints through evolutionary/population-basedoptimization

Viewpoints in fuzzy clustering

Q = 2ijkj

n

1:bji,1j

mik

c

1i

N

1k

2ijkj

n

0:bji,1j

mik

c

1i

N

1k

)f(xu)v(xu

ijij

1b if f

0bif vg

ijij

ijijij

2ijkj

n

1j

mik

c

1i

N

1k

)g(xuQ

Labelled data and their description

Characterization in terms of membership degrees:

F = [fik] i=12,…,c , k=1,2, …., N

supervision indicator b = [bk], k=1,2,…, N

Augmented objective function

€

Q =i=1

c

∑ uik2

k=1

N

∑ || xk − vi ||2 +β∑ (uik − fik )2bk || xk − vi ||2∑

> 0

Proximity hints

Characterization in terms of proximity degrees:

Prox(k, l), k, l=1,2, …., N

and supervision indicator matrix B = [bkl], k, l=1,2,…, N

Prox(k,l)

Prox(s,t)

Proximity measure

Properties of proximity:

(a)Prox(k, k) =1

(b)Prox(k,l) = Prox(l,k)

Proximity induced by partition matrix U:

€

Prox(k,l) = min(uik

i=1

c

∑ ,uil )

Linkages with kernel functions K(xk, xl)

Augmented objective function

€

Q =i=1

c

∑ uik2

k=1

N

∑ || xk − vi ||2 +βi=1

c

∑k1=1

N

∑ [Prox(k1,k2) − Prox(U)(k1,k2)]2b(k1, k2) || xk1 − xk2 ||2

k2=1

N

∑

> 0

Two general development strategies

SELECTION OF A “MEANINGFUL” SUBSET OF INFORMATION GRANULES


(1) HIERARCHICAL DEVELOPMENT OF INFORMATION GRANULES (INFORMMATION GRANULES OF HIGHER TYPE)

Information granulesType -1



(2) HIERARCHICAL DEVELOPMENT OF INFORMATION GRANULES AND THE USE OF VIEWPOINTS



viewpoints


(3) HIERARCHICAL DEVELOPMENT OF INFORMATION GRANULES – A MODE OF SUCCESSIVE CONSTRUCTION

Fuzzy Computing:Interpretability

Interpretation of fuzzy sets - departure from pure numeric quantification of membership grades

A= [0.11 0.19 0.34 0.45 1.00 0.98 0.821 0.447…]

Granulation of fuzzy sets

Granulation of membership grades

low, high, medium membership of alternative x

Granulation of membership grades and universe of discourse

low membership for a collection of alternatives….

Granulation of membershipgrades

A= [0.11 0.19 0.34 0.45 1.00 0.98 0.821 0.447…]

A= [L L L M M L L M…]

Granulation of membershipgrades: optimization

A= [L L L M M L L M…]

Entropy minimization

G= {G1, G2, …, Gc}

€

x

∑ H(G i

i=1

c

∑ (x))⇒ MinG


A= [L M L M…]

Granulation of fuzzy sets:optimization

G1

Gi

Gc

1

i c

€

Vol = Vol(G i

i=1

c

∑ ,Wi) ⇒ MinG

Interpretability of fuzzy set computing

Fuzzy set computing

Interpretability layer


Interpretability of fuzzy set computing

Equivalence sought with respect with assumed levelinterpretability:•stability•Equivalence of models

distinguishability

Non-distinguishability

Fuzzy set computing: a retrospective

interpretability

accuracy

~1970

after ~1990

neurofuzzy

evolutionary

Rule-based

Conclusions

Leitmotiv of human-centricity and its underlying reliance on information granules

Design of information granules – shift from data to knowledge-enhanced clustering

Revisiting the practice of fuzzy computing and its interpretabilitycapabilities

algorithmic facets of human centricity in computing with fuzzy sets isda-2009, pisa, italy, november...

Documents

fuzzy setsclustering

fuzzy setsisda

fuzzy setshuman centricity

structural information

prototypesfuzzy clustering

clustersquality of clustering

structure of c

structure byforming