02 newman randomgraphs social scalefree slides
TRANSCRIPT
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
1/21
Random Graph Models of Social
Networks
Paper Authors: M.E. Newman, D.J. Watts,
S.H. Strogatz
Presentation presented by Jessie Riposo
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
2/21
This Paper Focuses on New Techniques for
Generating Social networks
This paper focuses on how to generate random
graphs that will give degree distributions of real
world networks and how to calculate properties ofthe generated networks by using their degree
distributions
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
3/21
Paper Has Two Main Parts
Modeling graphs with arbitrary degree
distribution
Modeling affiliation networks and bipartite
graphs
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
4/21
Modeling Graphs with Arbitrary Degree
Distributions Using Random Graphs to model real world
networks has some serious short-comings
Specifically the fact that the natural degreedistribution of a random graph is unlike that of
real-world networks.
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
5/21
Known Degree Distributions
A large random graph has a Poisson Degree
distribution
Scientific Collaboration Networks, Movie ActorCollaboration Networks, and Company Director
Networks all have highly skewed degree
distributions that cannot be modeled with the
Poisson.
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
6/21
Why the Random Graph if it does not have the
correct degree distribution for real-worldnetworks?
The Random Graph Has Desirable Properties
Many features of its behavior can be calculated
exactly
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
7/21
Is it possible to create a model that
matches real-world networks better
than a random graph, but is still
exactly solvable?
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
8/21
An Algorithm that Generates a Random Graph
with the Desired Degree Distribution
Given (normalized) probabilities p that a randomlychosen vertex in the network has degree k
Take N verticesAssign to each a number k of ends (k is a
random number drawn independently ofprobability of k)
Chose ends randomly in pairs and connect withan edge
If number of ends is odd throw one edge awayand generate a new one from distribution,
repeating until number of ends is even.
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
9/21
Properties of the Network Model are
Exactly Solvable in the limit of large N
The trick is to use the generating function instead
of working directly with the degree distribution
Generating Function = SUM (p*x^k) (k=0 to 100)For example:
Avg. Degree of a vertex = Derivative of the
GF evaluated at 1.
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
10/21
From Experimentation in Social
Networks There are Two Regimes Depending upon the exact probability distribution
of the degrees there are two different regimes:
Many small clusters of vertices connectedtogether by edges
A giant cluster of connected vertices whose size
scales up with the size of the whole network
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
11/21
If Degree Distribution is Known,
Moment Functions are Used to Calculate
Size of Giant Cluster
Generating function is used to calculate the sizes
of the giant component and average components.
The fraction of the networks which is filled by
the giant component, is given by S=1-G(u)
Where u is the smallest non-neg. real solution
of G(1)u=G(u)
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
12/21
The Existence (or not) of a Giant Component is
Important in Social Networks If there is no giant component then
communication can only take place within small
groups of people If there is a giant component then a large fraction
of network can all communicate with one another
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
13/21
A Sample Problem was Derived to Test
the Models The distribution used was a power-law distribution
characterized by
P= CK^(-t)e^(-K/k)Exponent t
Cutoff length k
C is a constant fixed by the requirement to benormalized
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
14/21
The Results Show that Giant Components
Exist Only at Specific t and k When k is below.9102 a giant component can
never exist regardless of the value of t.
For values of t larger than 3.4788 a giantcomponent cannot exist regardless of the value of
k.
Almost all networks found in society and nature
appear to be well inside these limits.
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
15/21
Why Affiliation Networks and Bipartite
Graphs Affiliation networks can be used to avoid
problems of:
Hard to solicit unbiased data in social networkexperiments.
Data is usually limited
Affiliation network is a network in which actors
are joined together by common membership of
groups
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
16/21
For an Affiliation Network There are
Two Different Degree Distributions For example if looking at directors and boards the
distributions would be:
The number of boards that directors sit onThe number of directors who sit on a boards
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
17/21
Mathematically the Networks are
Generated as Random Graphs, But There are now two moment functions
One for each distribution
Let probability that a director sits on j boards equal pjand probability that a board has k members equal qk.
f(x)=Sum (pj(x^j)), g(x)=sum(qk(x^k))
j k
Clustering coefficient is different from that of therandom graph
C = 3* Number of triangles on the graph
Number of connected triples of vertices
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
18/21
Network C Theory C Actual Avg. Degree
Theory
Avg. Degree
Actual
Companydirectors .59 .588 14.53 14.44
Movie
actors
.084 .199 125.6 113.4
Physics .192 .452 16.74 9.27
Biomedicine .042 .088 18.02 16.93
Results of Experimentation
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
19/21
How Does the Theory Measure Up?
The clustering coefficient is remarkably precise
for boards of directors
For the other networks the clustering coefficientseems to be underestimated by a factor of about
two by the theory
For the other networks the average number of
collaborators is moderately accurate.
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
20/21
What Does This Mean?
Remember that the graphs were created withdegree distributions the same as real networks, butthe connections between the nodes were generatedrandomly.
Agreement between model and reality wouldindicate that there is no statistical difference
between the real-world network and an equivalentrandom network.
Differences in the models and real-world networksmay be indicating some potential sociological
phenomenon
-
7/27/2019 02 Newman Randomgraphs Social Scalefree Slides
21/21
The Main Contributions of This Paper
Were:
A set of Models that allow for the fact that thedegree distributions of real-world social networksare often highly skewed
The Statistical Properties of the networks areexactly solvable, once the degree distribution isspecified
A generalized theory in the case of bipartite
random graphs which serve as models foraffiliation networks
Models can be applied not only to SocialNetworks, but to communications, transportation,
distribution, and other networks