network science class 5: ba model (sept 15, 2014) albert-lászló barabási with roberta sinatra

56
Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra www.BarabasiLab.com

Upload: peter-shadbolt

Post on 14-Dec-2015

215 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Network Science

Class 5: BA model

(Sept 15, 2014)

Albert-László BarabásiWith

Roberta Sinatra

www.BarabasiLab.com

Page 2: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Introduction

Section 1

Page 3: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 1

Hubs represent the most striking difference between a random and a scale-free network. Their emergence in many real systems raises several fundamental questions:

• Why does the random network model of Erdős and Rényi fail to reproduce the hubs and the power laws observed in many real networks?

• Why do so different systems as the WWW or the cell converge to a similar scale-free architecture?

Page 4: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Growth and preferential attachment

Section 2

Page 5: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

networks expand through the addition of new nodes

Barabási & Albert, Science 286, 509 (1999)

BA MODEL: Growth

ER model: the number of nodes, N, is fixed (static models)

Page 6: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

New nodes prefer to connect to the more connected nodes

Barabási & Albert, Science 286, 509 (1999) Network Science: Evolving Network Models February 14, 2011

BA MODEL: Preferential attachment

ER model: links are added randomly to the network

Page 7: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Barabási & Albert, Science 286, 509 (1999) Network Science: Evolving Network Models February 14, 2011

Section 2: Growth and Preferential Sttachment

The random network model differs from real networks in two important characteristics:

Growth: While the random network model assumes that the number of nodes is fixed (time invariant), real networks are the result of a growth process that continuously increases.

Preferential Attachment: While nodes in random networks randomly choose their interaction partner, in real networks new nodes prefer to link to the more connected nodes.

Page 8: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

The Barabási-Albert model

Section 3

Page 9: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Barabási & Albert, Science 286, 509 (1999)

P(k) ~k-3

(1) Networks continuously expand by the addition of new nodes

WWW : addition of new documents

GROWTH:

add a new node with m links

PREFERENTIAL ATTACHMENT:

the probability that a node connects to a node with k links is proportional to k.

(2) New nodes prefer to link to highly connected nodes.

WWW : linking to well known sites

Network Science: Evolving Network Models February 14, 2011

Origin of SF networks: Growth and preferential attachment

jj

ii k

kk

)(

Page 10: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 4

Page 11: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra
Page 12: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 4 Linearized Chord Diagram

Page 13: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Degree dynamics

Section 4

Page 14: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999) Network Science: Evolving Network Models February 14, 2011

All nodes follow the same growth law

Use: During a unit time (time step): Δk=m A=m

β: dynamical exponent

Page 15: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

SF model: k(t)~t ½ (first mover advantage)

time

Deg

ree

(k)

All nodes follow the same growth law

Page 16: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 5.3

Page 17: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Degree distribution

Section 5

Page 18: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

γ = 3

A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999)Network Science: Evolving Network Models February

14, 2011

Degree distribution

A node i can come with equal probability any time between ti=m0 and t, hence:

Page 19: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

γ = 3

A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999)Network Science: Evolving Network Models February

14, 2011

Degree distribution

(i) The degree exponent is independent of m.

(ii) As the power-law describes systems of rather different ages and sizes, it is expected that a correct model should provide a time-independent degree distribution. Indeed, asymptotically the degree distribution of the BA model is independent of time (and of the system size N) the network reaches a stationary scale-free state.

(iii) The coefficient of the power-law distribution is proportional to m2.

Page 20: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

The mean field theory offers the correct scaling, BUT it provides the wrong coefficient of the degree distribution.

So assymptotically it is correct (k ∞), but not correct in details (particularly for small k).

To fix it, we need to calculate P(k) exactly, which we will do next using a rate equation based approach.

Network Science: Evolving Network Models February 14, 2011

Page 21: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Number of nodes with degree k at time t.

Nr. of degree k-1 nodes that acquire a new link, becoming degree k Preferential

attachment

Since at each timestep we add one node, we have N=t (total number of nodes =number of timesteps)

2m: each node adds m links, but each link contributed to the degree of 2 nodes

Number of links added to degree k nodes after the arrival of a new node:

Total number of k-nodes

New node adds m new links to other nodes

Nr. of degree k nodes that acquire a new link, becoming degree k+1

# k-nodes at time t+1 # k-nodes at time t

Gain of k-nodes via

k-1 k

Loss of k-nodes via

k k+1

MFT - Degree Distribution: Rate Equation

Page 22: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

# m-nodes at time t+1 # m-nodes at

time t

Add one m-degeree

node

Loss of an m-node via

m m+1

We do not have k=0,1,...,m-1 nodes in the network (each node arrives with degree m) We need a separate equation for degree m modes

# k-nodes at time t+1 # k-nodes at time t

Gain of k-nodes via

k-1 k

Loss of k-nodes via

k k+1

Network Science: Evolving Network Models February 14, 2011

MFT - Degree Distribution: Rate Equation

Page 23: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

k>m

We assume that there is a stationary state in the N=t∞ limit, when P(k,∞)=P(k)

k>m

Network Science: Evolving Network Models February 14, 2011

MFT - Degree Distribution: Rate Equation

Page 24: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

...m+3 k

Krapivsky, Redner, Leyvraz, PRL 2000Dorogovtsev, Mendes, Samukhin, PRL 2000 Bollobas et al, Random Struc. Alg. 2001

for large k

Network Science: Evolving Network Models February 14, 2011

MFT - Degree Distribution: Rate Equation

Page 25: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Its solution is:

Start from eq.

Dorogovtsev and Mendes, 2003Network Science: Evolving Network Models February

14, 2011

MFT - Degree Distribution: A Pretty Caveat

Page 26: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

γ = 3

Network Science: Evolving Network Models February 14, 2011

Degree distribution

(i) The degree exponent is independent of m.

(ii) As the power-law describes systems of rather different ages and sizes, it is expected that a correct model should provide a time-independent degree distribution. Indeed, asymptotically the degree distribution of the BA model is independent of time (and of the system size N)

the network reaches a stationary scale-free state.

(iii) The coefficient of the power-law distribution is proportional to m2.

for large k

Page 27: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

NUMERICAL SIMULATION OF THE BA MODEL

Page 28: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

absence of growth and preferential attachment

Section 6

Page 29: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

growth preferential attachment

Π(ki) : uniform

MODEL A

Page 30: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

tN

CttNN

Ntk

Nt

k

N

N

NkA

t

k

N

N

i

ii

i

2~

)2(

)1(2)(

1

21

1)(

)1(2

growth preferential attachment

pk : power law (initially)

Gaussian Fully Connected

MODEL B

Page 31: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Do we need both growth and preferential attachment?

YEP.

Network Science: Evolving Network Models February 14, 2011

Page 32: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Measuring preferential attachment

Section 7

Page 33: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 7 Measuring preferential attachment

t

kk

t

k ii

i

~)(

Plot the change in the degree Δk during

a fixed time Δt for nodes with degree k.

(Jeong, Neda, A.-L. B, Europhys Letter 2003; cond-mat/0104131)

No pref. attach: κ~k

Linear pref. attach: κ~k2

kK

)K()k( To reduce noise, plot the integral of Π(k) over k:

Network Science: Evolving Network Models February 14, 2011

Page 34: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

neurosci collab

actor collab.

citation network

1 ,)( kAk

kK

)K()k(

Plots shows the integral of Π(k) over k:Internet

Network Science: Evolving Network Models February 14, 2011

Section 7 Measuring preferential attachment

No pref. attach: κ~k

Linear pref. attach: κ~k2

Page 35: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Nonlinear preferenatial attachment

Section 8

Page 36: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 8 Nonlinear preferential attachment

α=0: Reduces to Model A discussed in Section 5.4. The degree distribution follows the simple exponential function.

α=1: Barabási-Albert model, a scale-free network with degree exponent 3.

0<α<1: Sublinear preferential attachment. New nodes favor the more connected nodes over the less connected nodes. Yet, for the bias is not sufficient to generate a scale-free degree distribution. Instead, in this regime the degrees follow the stretched exponential distribution:

Page 37: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 8 Nonlinear preferential attachment

α=0: Reduces to Model A discussed in Section 5.4. The degree distribution follows the simple exponential function.

α=1: Barabási-Albert model, a scale-free network with degree exponent 3.

α>1: Superlinear preferential attachment. The tendency to link to highly connected nodes is enhanced, accelerating the “rich-gets-richer” process. The consequence of this is most obvious for , when the model predicts a winner-takes-all phenomenon: almost all nodes connect to a single or a few super-hubs.

Page 38: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 8 Nonlinear preferential attachment

The growth of the hubs. The nature of preferential attachment affects the degree of the largest node. While in a scale-free network the biggest hub grows as (green curve), for sublinear preferential attachment this dependence becomes logarithmic (red curve). For superlinear preferential attachment the biggest hub grows linearly with time, always grabbing a finite fraction of all links (blue curve)). The symbols are provided by a numerical simulation; the dotted lines represent the analytical predictions.

Page 39: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra
Page 40: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

The origins of preferential attachment

Section 9

Page 41: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 9 Link selection model

Link selection model -- perhaps the simplest example of a local or random mechanism capable of generating preferential attachment.

Growth: at each time step we add a new node to the network.

Link selection: we select a link at random and connect the new node to one of nodes at the two ends of the selected link.

To show that this simple mechanism generates linear preferential attachment, we write the probability that the node at the end of a randomly chosen link has degree k as

Page 42: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 9 Copying model(a) Random Connection: with probability p the new node links to u. (b) Copying: with probability we randomly choose an outgoing link of node u and connect the new node to the selected link's target. Hence the new node “copies” one of the links of an earlier node

(a) the probability of selecting a node is 1/N. (b) is equivalent with selecting a node linked to a randomly selected link. The probability of selecting a degree-k node through the copying process of step (b) is k/2L for undirected networks. The likelihood that the new node will connect to a degree-k node follows preferential attachment

Social networks: Copy your friend’s friends.Citation Networks: Copy references from papers we read.Protein interaction networks: gene duplication,

Page 43: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 9 Optimization model

Page 44: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 9 Optimization model

Page 45: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 9 Optimization model

Page 46: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 9 Optimization model

Page 47: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 9

Page 48: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Diameter and clustering coefficient

Section 10

Page 49: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 10 Diameter

Bollobas, Riordan, 2002

Page 50: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 10 Clustering coefficient

What is the functional form of C(N)?

Reminder: for a random graph we have:

Konstantin Klemm, Victor M. Eguiluz,Growing scale-free networks with small-world behavior,Phys. Rev. E 65, 057102 (2002), cond-mat/0107607

Page 51: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

1

2

Denote the probability to have a link between node i and j with P(i,j)The probability that three nodes i,j,l form a triangle is P(i,j)P(i,l)P(j,l)

The expected number of triangles in which a node l with degree kl participates is thus:

We need to calculate P(i,j).

Network Science: Evolving Network Models February 14, 2011

CLUSTERING COEFFICIENT OF THE BA MODEL

Page 52: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Calculate P(i,j).

Node j arrives at time tj=j and the probability that it will link to node i with degree ki already in the network is determined by preferential attachment:

Where we used that the arrival time of node j is tj=j and the arrival time of node is ti=i

Let us approximate:Which is the degree of node l at current time, at time t=N

There is a factor of two difference... Where does it come from?Network Science: Evolving Network Models February

14, 2011

CLUSTERING COEFFICIENT OF THE BA MODEL

Page 53: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 10 Clustering coefficient

What is the functional form of C(N)?

Reminder: for a random graph we have:

Konstantin Klemm, Victor M. Eguiluz,Growing scale-free networks with small-world behavior,Phys. Rev. E 65, 057102 (2002), cond-mat/0107607

Page 54: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

The network grows, but the degree distribution is stationary.

Section 11: Summary

Page 55: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

The network grows, but the degree distribution is stationary.

Section 11: Summary

Page 56: Network Science Class 5: BA model (Sept 15, 2014) Albert-László Barabási With Roberta Sinatra

Section 11: Summary