scientific american web

11
7/31/2019 Scientific American Web http://slidepdf.com/reader/full/scientific-american-web 1/11 60 SCIENTIFIC AMERICAN MA Scientists have recently discovered that various complex systems hav an underlying architecture governed by shared organizing pr This insight has important implications for a host of applications, from drug development to Internet security BY ALBERT-LÁSZLÓ BARABÁSI AND Scale - Free Networks THE INTERNET, mapped on the opposite page, is a scale-free network in t some sites (starbursts and detail above ) have a seemingly unlimited number of connections to other sites. This map, made on February 6, 2 traces the shortest routes from a test Web site to about 100,000 othe using like colors for similar Web addresse COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Upload: solidteacher

Post on 05-Apr-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 1/1160 SCIENTIFIC AMERICAN MA

Scientists have recently discovered that various complex systems havan underlying architecture governed by shared organizing prThis insight has important implications for a host of applications, from drug development to Internet security

BY ALBERT-LÁSZLÓ BARABÁSI AND

Scale -FreeNetworks

THE INTERNET,mapped on the opposite page, is a scale-free network in tsome sites (starbursts and detail above) have a seemingly unlimited

number of connections to other sites. This map, made on February 6, 2traces the shortest routes from a test Web site to about 100,000 othe

using like colors for similar Web addresse

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 2: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 2/11

BURCHCHESW

CKMAP

C O U R T E S Y O F L U M E T A C O R P O R A T I O N

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 3: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 3/11

The brain is a network of nerve cells con-nected by axons, and cells themselves arenetworks of molecules connected by bio-chemical reactions. Societies, too, are net-works of people linked by friendships,

familial relationships and professionalties. On a larger scale, food webs and eco-systems can be represented as networksof species. And networks pervade tech-nology: the Internet, power grids andtransportation systems are but a few ex-amples. Even the language we are usingto convey these thoughts to you is a net-work, made up of words connected bysyntactic relationships.

Yet despite the importance and per-vasiveness of networks, scientists have

had little understanding of their structureand properties. How do the interactionsof several malfunctioning nodes in a com-plex genetic network result in cancer?How does diffusion occur so rapidly incertain social and communications sys-tems, leading to epidemics of diseases andcomputer viruses? How do some net-works continue to function even after thevast majority of their nodes have failed?

Recent research has begun to answersuch questions. Over the past few years,investigators from a variety of elds havediscovered that many networks —fromthe World Wide Web to a cell’s metabol-ic system to actors in Hollywood —aredominated by a relatively small numberof nodes that are connected to many oth-er sites. Networks containing such im-portant nodes, or hubs, tend to be whatwe call “scale-free,” in the sense thatsome hubs have a seemingly unlimited

number of links and no node is typical of the others. These networks also behave incertain predictable ways; for example,they are remarkably resistant to acciden-tal failures but extremely vulnerable to

coordinated attacks.Such discoveries have dramaticallychanged what we thought we knew aboutthe complex interconnected world aroundus. Unexplained by previous network the-ories, hubs offer convincing proof thatvarious complex systems have a strict ar-chitecture, ruled by fundamental laws —laws that appear to apply equally to cells,computers, languages and society. Fur-thermore, these organizing principles havesignicant implications for developing

better drugs, defending the Internet fromhackers, and halting the spread of deadlyepidemics, among other applications.

Networks without ScaleF O R M O R E T H A N 40 Y E A R S , sciencetreated all complex networks as beingcompletely random. This paradigm has itsroots in the work of two Hungarian math-ematicians, the inimitable Paul Erd ″os andhis close collaborator Alfréd Rényi. In1959, aiming to describe networks seen incommunications and the life sciences,Erd ″os and Rényi suggested that such sys-tems could be effectively modeled by con-necting their nodes with randomly placedlinks. The simplicity of their approach andthe elegance of some of their related theo-rems revitalized graph theory, leading tothe emergence of a eld in mathematicsthat focuses on random networks.

An important prediction of random-

network theory is that, despite the ran-dom placement of links, the resulting sys-tem will be deeply democratic: mostnodes will have approximately the samenumber of links. Indeed, in a random net-

work the nodes follow a Poisson distrib-ution with a bell shape, and it is extreme-ly rare to nd nodes that have signicant-ly more or fewer links than the average.Random networks are also called expo-nential, because the probability that anode is connected to k other sites de-creases exponentially for large k.

So in 1998, when we, together withHawoong Jeong and Réka Albert of theUniversity of Notre Dame, embarked ona project to map the World Wide Web,

we expected to nd a random network.Here’s why: people follow their uniqueinterests when deciding what sites to linktheir Web documents to, and given the di-versity of everyone’s interests and thetremendous number of pages they canchoose from, the resulting pattern of con-nections should appear fairly random.

The measurements, however, deedthat expectation. Software designed forthis project hopped from one Web pageto another and collected all the links itcould. Although this virtual robot reachedonly a tiny fraction of the entire Web, themap it assembled revealed somethingquite surprising: a few highly connectedpages are essentially holding the WorldWide Web together. More than 80 per-cent of the pages on the map had fewerthan four links, but a small minority, lessthan 0.01 percent of all nodes, had morethan 1,000. (A subsequent Web surveywould uncover one document that hadbeen referenced by more than two millionother pages!)

Counting how many Web pages haveexactly k links showed that the distribu-tion followed a so-called power law: theprobability that any node was connectedto k other nodes was proportional to 1 ⁄ k n .The value of n for incoming links was ap-proximately 2, so, for instance, any nodewas roughly four times as likely to havejust half the number of incoming links asanother node. Power laws are quite dif-ferent from the bell-shaped distributionsthat characterize random networks.

62 SCIENTIFIC AMERICAN MA

■ A variety of complex systems share an important property: some nodes havea tremendous number of connections to other nodes, whereas most nodes have just a handful. The popular nodes, called hubs, can have hundreds, thousandsor even millions of links. In this sense, the network appears to have no scale.

■ Scale-free networks have certain important characteristics. They are,for instance, robust against accidental failures but vulnerable tocoordinated attacks.

■ Understanding of such characteristics could lead to new applications inmany arenas. For example, computer scientists might be able to devisemore effective strategies for preventing computer viruses from cripplinga network such as the Internet.

Overview/ Scale-Free Networks

Networks are everywhere.

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 4: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 4/11

Specically, a power law does not have apeak, as a bell curve does, but is instead de-scribed by a continuously decreasing func-tion. When plotted on a double-logarith-mic scale, a power law is a straight line[see illustration above ]. In contrast to thedemocratic distribution of links seen inrandom networks, power laws describesystems in which a few hubs, such as Ya-hoo and Google, dominate.

Hubs are simply forbidden in randomnetworks. When we began to map theWeb, we expected the nodes to follow abell-shaped distribution, as do people’sheights. Instead we discovered certainnodes that deed explanation, almost asif we had stumbled on a signicant num-ber of people who were 100 feet tall, thusprompting us to coin the term “scale-free.”

Scale-Free Networks AboundO V E R T H E P A S T several years, re-searchers have uncovered scale-free struc-tures in a stunning range of systems.When we studied the World Wide Web,we looked at the virtual network of Webpages connected to one another by hy-perlinks. In contrast, Michalis Faloutsosof the University of California at River-side, Petros Faloutsos of the University of Toronto and Christos Faloutsos of Car-negie Mellon University analyzed the

physical structure of the Internet. Thesethree computer-scientist brothers investi-gated the routers connected by optical orother communications lines and foundthat the topology of that network, too, isscale-free.

Researchers have also discovered that

some social networks are scale-free. A col-laboration between scientists from BostonUniversity and Stockholm University, forinstance, has shown that a network of sexual relationships among people inSweden followed a power law: althoughmost individuals had only a few sexualpartners during their lifetime, a few (thehubs) had hundreds. A recent study ledby Stefan Bornholdt of the University of Kiel in Germany concluded that the net-work of people connected by e-mail islikewise scale-free. Sidney Redner of Boston University demonstrated that thenetwork of scientic papers, connectedby citations, follows a power law as well.And Mark Newman of the University of Michigan at Ann Arbor examined col-laborations among scientists in several

www.sciam.com SCIENTIFIC AMERI63

SLMFLMS

RANDOM VERSUS SCALE-FREE NETWO

Number of Links

N u m

b e r o

f N o

d e s Typical node

Number of Links

N u m

b e r o

f N o

d e s

Number of Links (log scale)

N u m b

e r o

f N o

d e s

( l o g s c a

l e )

RANDOM NETWORKS,which resemble the U.S. highway system(simplified in left map), consist of nodes with randomly placedconnections. In such systems, a plot of the distribution of nodelinkages will follow a bell-shaped curve (left graph), with mostnodes having approximately the same number of links.

In contrast, scale-free networks, which resemble the U.S.airline system (simplified in right map), contain hubs (red)—

nodes with a very high number of links. In such networks, tdistribution of node linkages follows a power law (center graph)in that most nodes have just a few connections and some haa tremendous number of links. In that sense, the system has“scale.” The defining characteristic of such networks is thatdistribution of links, if plotted on a double-logarithmic scal(right graph), results in a straight line.

Bell Curve Distribution of Node Linkages Power Law Distribution of Node Linkages

Random Network Scale-Free Network

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 5: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 5/11

disciplines, including physicians and com-puter scientists, and found that those net-works were also scale-free, corroboratinga study we conducted focusing on math-ematicians and neurologists. (Interesting-ly, one of the largest hubs in the mathe-

matics community is Erd″

os himself, whowrote more than 1,400 papers with nofewer than 500 co-authors.)

Scale-free networks can occur in busi-ness. Walter W. Powell of Stanford Uni-versity, Douglas R. White of the Univer-sity of California at Irvine, Kenneth W.Koput of the University of Arizona, and

Jason-Owen Smith of the University of Michigan studied the formation of al-liance networks in the U.S. biotechnolo-gy industry and discovered denite hubs —

for instance, companies such as Genzyme,Chiron and Genentech had a dispropor-tionately large number of partnershipswith other rms. Researchers in Italy tooka deeper look at that network. Using datacollected by the University of Siena’s Phar-maceutical Industry Database, which nowprovides information for around 20,100R&D agreements among more than 7,200organizations, they found that the hubsdetected by Powell and his colleagues wereactually part of a scale-free network.

Even the network of actors in Holly-wood —popularized by the game Six De-grees of Kevin Bacon, in which playerstry to connect actors to Bacon via themovies in which they have appeared to-gether —is scale-free. A quantitative analy-

sis of that network showed that it, too,is dominated by hubs. Specically, al-though most actors have only a few linksto others, a handful of actors, includingRod Steiger and Donald Pleasence, havethousands of connections. (Incidentally,

on a list of most connected actors, Baconranked just 876th.)On a more serious note, scale-free

networks are present in the biologicalrealm. With Zoltán Oltvai, a cell biologistfrom Northwestern University, we founda scale-free structure in the cellular meta-bolic networks of 43 different organismsfrom all three domains of life, includingArchaeoglobus fulgidus (an archaebac-terium), Escherichia coli (a eubacterium)and Caenorhabditis elegans (a eukary-

ote). In such networks, cells burn food bysplitting complex molecules to release en-ergy. Each node is a particular molecule,and each link is a biochemical reaction.We found that most molecules participatein just one or two reactions, but a few (thehubs), such as water and adenosine tri-phosphate, play a role in most of them.

We discovered that the protein-inter-action network of cells is scale-free as well.In such a network, two proteins are “con-nected” if they are known to interact witheach other. When we investigated Baker’syeast, one of the simplest eukaryotic (nu-cleus-containing) cells, with thousands of proteins, we discovered a scale-free topol-ogy: although most proteins interact withonly one or two others, a few are able to

attach themselves physically to a hugenumber. We found a similar result in theprotein-interaction network of an organ-ism that is very different from yeast, a sim-ple bacterium called Helicobacter pylori.

Indeed, the more that scientists stud-

ied networks, the more they uncoveredscale-free structures. These ndings raisedan important question: How can systemsas fundamentally different as the cell andthe Internet have the same architectureand obey the same laws? Not only arethese various networks scale-free, theyalso share an intriguing property: for rea-sons not yet known, the value of n in thek n term of the power law tends to fall be-tween 2 and 3.

The Rich Get RicherP E R HA P S A M O R E B A S I C question iswhy random-network theory fails to ex-plain the existence of hubs. A closer ex-amination of the work of Erd ″os and Rén-yi reveals two reasons.

In developing their model, Erd ″os andRényi assumed that they had the full in-ventory of nodes before they placed thelinks. In contrast, the number of docu-ments on the Web is anything but con-stant. In 1990 the Web had only one page.Now it has more than three billion. Mostnetworks have expanded similarly. Hol-lywood had only a handful of actors in1890, but as new people joined the trade,the network grew to include more thanhalf a million, with the rookies connect-ing to veteran actors. The Internet hadonly a few routers about three decadesago, but it gradually grew to have mil-lions, with the new routers always linkingto those that were already part of the net-work. Thanks to the growing nature of real networks, older nodes had greateropportunities to acquire links.

Furthermore, all nodes are not equal.When deciding where to link their Webpage, people can choose from a few billionlocations. Yet most of us are familiar withonly a tiny fraction of the full Web, andthat subset tends to include the more con-nected sites because they are easier to nd.By simply linking to those nodes, peopleexercise and reinforce a bias toward them.This process of “preferential attachment”occurs elsewhere. In Hollywood the more

64 SCIENTIFIC AMERICAN MA

NETWORK NODES LINKS

Cellular metabolism Molecules involved in Participation in the sameburning food for energy biochemical reaction

Hollywood Actors Appearance in the same movie

Internet Routers Optical and otherphysical connections

Protein regulatory Proteins that help to Interactions amongnetwork regulate a cell’s activities proteins

Research collaborations Scientists Co-authorship of papers

Sexual relationships People Sexual contact

World Wide Web Web pages URLs

Examples of Scale-Free Networks

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 6: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 6/11

Page 7: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 7/11

tion. This is certainly true for random net-works: if a critical fraction of nodes is re-moved, these systems break into tiny,noncommunicating islands. Yet simula-tions of scale-free networks tell a differentstory: as many as 80 percent of random-

ly selected Internet routers can fail and theremaining ones will still form a compactcluster in which there will still be a pathbetween any two nodes. It is equally dif-cult to disrupt a cell’s protein-interactionnetwork: our measurements indicate thateven after a high level of random muta-tions are introduced, the unaffected pro-teins will continue to work together.

In general, scale-free networks dis-play an amazing robustness against ac-cidental failures, a property that is root-

ed in their inhomogeneous topology. Therandom removal of nodes will take outmainly the small ones because they aremuch more plentiful than hubs. And theelimination of small nodes will not dis-rupt the network topology signicantly,because they contain few links comparedwith the hubs, which connect to nearlyeverything. But a reliance on hubs has aserious drawback: vulnerability to attacks.

In a series of simulations, we found

that the removal of just a few key hubsfrom the Internet splintered the systeminto tiny groups of hopelessly isolatedrouters. Similarly, knockout experimentsin yeast have shown that the removal of the more highly connected proteins has a

signicantly greater chance of killing theorganism than does the deletion of othernodes. These hubs are crucial —if muta-tions make them dysfunctional, the cellwill most likely die.

A reliance on hubs can be advanta-geous or not, depending on the system.Certainly, resistance to random break-down is good news for both the Internetand the cell. In addition, the cell’s relianceon hubs provides pharmaceutical re-searchers with new strategies for selecting

drug targets, potentially leading to curesthat would kill only harmful cells or bac-teria by selectively targeting their hubs,while leaving healthy tissue unaffected.But the ability of a small group of well-in-formed hackers to crash the entire com-munications infrastructure by targetingits hubs is a major reason for concern.

The Achilles’ heel of scale-free net-works raises a compelling question: Howmany hubs are essential? Recent research

suggests that, generally speaking, the si-multaneous elimination of as few as 5 to15 percent of all hubs can crash a system.For the Internet, our experiments implythat a highly coordinated attack —rst re-moving the largest hub, then the next

largest, and so on —could cause signi-cant disruptions after the elimination of just several hubs. Therefore, protectingthe hubs is perhaps the most effectiveway to avoid large-scale disruptionscaused by malicious cyber-attacks. Butmuch more work is required to deter-mine just how fragile specic networksare. For instance, could the failure of sev-eral hubs like Genzyme and Genentechlead to the collapse of the entire U.S. bio-tech industry?

Scale-Free EpidemicsK N O W L E D G E A B O U T scale-free net-works has implications for understandingthe spread of computer viruses, diseasesand fads. Diffusion theories, intensivelystudied for decades by both epidemi-ologists and marketing experts, predict acritical threshold for the propagation of a contagion throughout a population.Any virus, disease or fad that is less in-fectious than that well-dened thresholdwill inevitably die out, whereas thoseabove the threshold will multiply expo-nentially, eventually penetrating the en-tire system.

Recently, though, Romualdo Pastor-Satorras of the Polytechnic University of Catalonia in Barcelona and AlessandroVespigniani of the International Centerfor Theoretical Physics in Trieste, Italy,reached a disturbing conclusion. Theyfound that in a scale-free network thethreshold is zero. That is, all viruses, eventhose that are weakly contagious, willspread and persist in the system. This re-sult explains why Love Bug, the mostdamaging computer virus thus far (it shutdown the British Parliament in 2000),was still one of the most pervasive virus-es a year after its supposed eradication.

Because hubs are connected to manyother nodes, at least one hub will tend tobe infected by any corrupted node. Andonce a hub has been infected, it will passthe virus to numerous other sites, eventu-ally compromising other hubs, which will

66 SCIENTIFIC AMERICAN MA

Computing■ Computer networks with scale-free architectures, such as the World Wide Web,

are highly resistant to accidental failures. But they are very vulnerable to deliberateattacks and sabotage.

■ Eradicating viruses, even known ones, from the Internet will be effectively impossible.

Medicine■ Vaccination campaigns against serious viruses, such as smallpox, might be most

effective if they concentrate on treating hubs—people who have many connectionsto others. But identifying such individuals can be difficult.

■ Mapping out the networks within the human cell could aid researchers in uncoveringand controlling the side effects of drugs. Furthermore, identifying the hub moleculesinvolved in certain diseases could lead to new drugs that would target those hubs.

Business■ Understanding how companies, industries and economies are interlinked could help

researchers monitor and avoid cascading financial failures.■ Studying the spread of a contagion on a scale-free network could offer new ways for

marketers to propagate consumer buzz about their products.

The Potential Implicationsof Scale-Free Networks for . . .

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 8: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 8/11

then spread the virus throughout the en-tire system.

The fact that biological viruses spreadin social networks, which in many casesappear to be scale-free, suggests that sci-entists should take a second look at thevolumes of research written on the inter-play of network topology and epidemics.Specically, in a scale-free network, the

traditional public health approach of ran-dom immunization could easily fail be-cause it would very likely neglect a num-ber of the hubs. In fact, nearly everyonewould have to be treated to ensure thatthe hubs were not missed. A vaccinationfor measles, for instance, must reach 90percent of the population to be effective.

Instead of random immunizations,

though, what if doctors targeted the hubs,or the most connected individuals? Re-search in scale-free networks indicatesthat this alternative approach could be ef-fective even if the immunizations reachedonly a small fraction of the overall popu-lation, provided that the fraction con-tained the hubs.

But identifying the hubs in a social

HOW ROBUST ARE RANDOM AND SCALE-FREE THE ACCIDENTAL FAILUREof a number of nodes in a randomnetwork (top panels) can fracture the system into non-communicating islands. In contrast, scale-free networks are

more robust in the face of such failures (middle panels).But they are highly vulnerable to a coordinated attack againtheir hubs (bottom panels).

www.sciam.com SCIENTIFIC AMERI67

SLMFLMS

Random Network, Accidental Node Failure

Scale-Free Network, Accidental Node Failure

Scale-Free Network, Attack on Hubs

Before After

Before After

Before After

Hub

Hub

Node Failed node

Failed node

Attacked hub

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 9: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 9/11

network is much more difcult than de-tecting them in other types of systems.Nevertheless, Reuven Cohen and ShlomoHavlin of Bar-Ilan University in Israel, to-gether with Daniel ben-Avraham of Clarkson University, have proposed a

clever solution: immunize a small fractionof the random acquaintances of arbitrar-ily selected individuals, a procedure thatselects hubs with a high probability be-cause they are linked to many people.That approach, though, leads to a num-ber of ethical dilemmas. For instance,even if the hubs could be identied,should they have priority for immuniza-

tions and cures? Such issues notwith-standing, targeting hubs could be themost pragmatic solution for the futuredistribution of AIDS or smallpox vaccinesin countries and regions that do not havethe resources to treat everyone.

In many business contexts, peoplewant to start, not stop, epidemics. Viralmarketing campaigns, for instance, oftenspecically try to target hubs to speed theadoption of a product. Obviously, such astrategy is not new. Back in the 1950s, astudy funded by pharmaceutical giantPzer discovered the important role thathubs play in how quickly a community of

doctors begins using a new drug. Indeed,marketers have intuitively known forsome time that certain customers outshineothers in spreading promotional buzzabout products and fads. But recent workin scale-free networks provides the scien-

tic framework and mathematical tools toprobe that phenomenon more rigorously.

From Theory to PracticeA LT HO U G H S C A L E - F R E E networksare pervasive, numerous prominent ex-ceptions exist. For example, the highwaysystem and power grid in the U.S. are notscale-free. Neither are most networks

68 SCIENTIFIC AMERICAN MA

IN 1967 STANLEY MILGRAM,a social psychologist at HarvardUniversity, sent hundreds of letters to people in Nebraska,asking them to forward the correspondence to acquaintanceswho might be able to shepherd it closer to a target recipient: astockbroker in Boston. To track each of the different paths,Milgram asked the participants to mail a postcard back to himwhen they passed the letter to someone else. Milgram found thatthe letters that eventually arrived at the final destination hadpassed through an average of six individuals—the basis of thepopular notion of “six degrees of separation” between everyone.

Although Milgram’s work was hardly conclusive—most of theletters never made their way to the stockbroker—scientists haverecently learned that other networks exhibit this “small world”property. We have found, for instance, that a path of just threereactions will connect almost any pair of chemicals in a cell. And

on the World Wide Web, which contains more than three billiodocuments, Web pages are typically 19 clicks from one another

The small-world property does not necessarily indicate thepresence of any magic organizing principle. Even a large netwwith purely random connections will be a small world. Considthat you might have about 1,000 acquaintances. If each of thosindividuals also knows another 1,000, then a million people wbe just two handshakes away from you, a billion will be just thaway, and the earth’s entire population will be well within fouGiven that fact, the notion that any two strangers in the world aseparated by an average of six degrees seems almost trivial. Bufurther investigation reveals some deeper insights.

Our simple calculation assumes that the people you know aall strangers to one another. In reality, there is much overlap.Indeed, society is fragmented into clusters of individuals havinsimilar characteristics (such as income or interests), a featurethat has been widely discussed in the sociology literaturefollowing the seminal work in the 1970s of Mark Granovetter,then a graduate student at Harvard. Clustering is also a generalproperty of many other types of networks. In 1998 Duncan Waand Steven Strogatz, then both at Cornell University, foundsignificant clustering in a variety of systems, from the U.S. powgrid to the neural network of theCaenorhabditis elegansworm.

At first glance, isolated clusters of highly interconnectednodes appear to run counter to the topology of scale-freenetworks, in which a number of hubs radiate throughout thesystem, linking everything. Recently, however, we have shownthat the two properties are compatible: a network can be bothhighly clustered and scale-free when small, tightly interlinkedclusters of nodes are connected into larger, less cohesive group(left). This type of hierarchy appears to exist in a number of systems, from the World Wide Web (in which clusters aregroupings of Web pages devoted to the same topic) to a cell(in which clusters are teams of molecules responsible fora specific function). — A.-L.B. and E.B.

It’s a Small World, After All

HIERARCHICAL CLUSTERS,shown schematically, could include, say, Webpages on the Frank Lloyd Wright home Fallingwater ( yellow ), which couldbe linked to other clusters ( green) focusing on Wright, famous homes orPennsylvania’s attractions. Those sites, in turn, could be connected toclusters (red) on famous architects or architecture in general.

Clusters in differenthierarchical levels

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 10: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 10/11

seen in materials science. In a crystal lat-tice, for instance, atoms have the samenumber of links to their neighbors. Withother networks, the data are inconclusive.The relatively small size of food webs,which show predator-prey relationships,has prevented scientists from reaching aclear conclusion regarding that network’stype. And the absence of large-scale con-nectivity maps of the brain has kept re-searchers from knowing the nature of that important network as well.

Determining whether a network isscale-free is important in understandingthe system’s behavior, but other signi-cant parameters merit attention, too.One such characteristic is the diameter,or path length, of a network: the largestnumber of hops required to get from onenode to another by following the shortestroute possible [ see box on opposite page ].

Finally, knowledge of a network’s gen-eral topology is just part of the story in un-derstanding the overall characteristics and

behavior of such systems. There might besteep costs, for instance, with the additionof each link to a given node that couldprevent certain networks (such as the U.S.highway system) from becoming scale-free. In food chains, some prey are easierto catch than others, and that fact has aprofound effect on the overall ecosystem.With social networks, ties among house-hold members are much stronger thanconnections to casual acquaintances, sodiseases (and information) are more like-ly to spread through such linkages. Fortransportation, transmission and commu-nications systems (such as the Internet),

congestion along specic links is a majorconsideration: too much trafc on a par-ticular link can cause it to break down,leading to the potential failure of otherlinks that must then handle the spillover.And the nodes themselves might not be

homogeneous —certain Web pages havemore interesting content, for instance —which could greatly alter the preferential-attachment mechanism.

Because of these and other factors, sci-entists have only begun to uncover the be-havior of scale-free systems. Immunizinghubs, for instance, might not be sufcientto stop the spread of a disease; a more ef-fective solution might be found by con-sidering not just the number of connec-tions a person has but also the frequency

and duration of contact for those links.In essence, we have studied complexnetworks rst by ignoring the details of their individual links and nodes. By dis-tancing ourselves from those particulars,we have been able to better glimpse someof the organizing principles behind theseseemingly incomprehensible systems. Atthe very least, knowledge from this en-deavor has led to the rethinking of manybasic assumptions. In the past, for exam-ple, researchers modeled the Internet as arandom network to test how a new rout-ing protocol might affect system conges-tion. But we now know that the Internet isa scale-free system with behavior that isdramatically different from a random net-work’s. Consequently, investigators suchas John W. Byers and his colleagues atBoston University are revamping the com-puter models they have been using to sim-ulate the Internet. Similarly, knowledge of the properties of scale-free networks willbe valuable in a number of other elds, es-pecially as we move beyond network to-pologies to probe the intricate and oftensubtle dynamics taking place within thosecomplex systems.

www.sciam.com SCIENTIFIC AMERI69

COURTESYOFHAWOONGJEONG

All the World’s a Net. David Cohen inNew Scientist,Vol. 174, No. 2338, pages 24–29; April 13Statistical Mechanics of Complex Networks. Réka Albert and Albert-László Barabási inReviewsof Modern Physics,Vol. 74, pages 47–97; January 2002.Linked: The New Science of Networks. Albert-László Barabási. Perseus Publishing, 2002.Evolution of Networks: From Biological Nets to the Internet and WWW. J.F.F. Mendes and SergeDorogovtsev. Oxford University Press, 2003.Find links to papers on scale-free networks atwww.nd.edu/ ~ networks

M O R E T O E X P L O R E

MAP OF INTERACTING PROTEINSin yeast highlights the discovery that highly linked, or hub, proteinstend to be crucial for a cell’s survival. Red denotes essential proteins (their removal will cause the cellto die). Orange represents proteins of some importance (their removal will slow cell growth). Greenand yellow represent proteins of lesser or unknown significance, respectively.

COPYRIGHT 2003 SCIENTIFIC AMERICAN INC

Page 11: Scientific American Web

7/31/2019 Scientific American Web

http://slidepdf.com/reader/full/scientific-american-web 11/11