the power-law of top500 - university of chicagopeople.cs.uchicago.edu/~matei/p2p/pl-top500.doc ·...

2
The Power-Law of Top500 Matei Ripeanu ([email protected]) Most natural and technical phenomena are characterized by highly unbalanced distributions: there are few powerful, devastating earthquakes and countless unnoticeable ones; there are few machines with a peak FLOPS rate larger than 1 TFLOPS while millions of machines work at around 1 MFLOPS. Many of these events (city sizes, incomes, word frequency [1, 2]) fit power-law distributions: the number of events of a certain size is proportional to the size of the event to a negative constant power. There are many dimensions of variation for entities participating in the Internet: from the obvious ones like CPU speed, available disk space and network bandwidth, to more elaborate ones such as inter-failure time, node trustworthiness, or reliability. We conjecture that these follow similar, highly heterogeneous distributions. Preliminary results support our intuition: Internet’s autonomous system size [3], node bandwidth for nodes in Gnutella network [4, 5] or CPU power for machines in Top500 list [6], all follow power-law distributions (or at least highly variable distributions that can be well approximated as power-laws). Figure 1: Peak processing rate (GFLOPS) for world’s fastest supercomputers in Top500 list from 1995 to 2001. Each series of

Upload: dangthu

Post on 18-Jul-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Power-Law of Top500 - University of Chicagopeople.cs.uchicago.edu/~matei/P2P/pl-top500.doc · Web viewMany of these events (city sizes, incomes, word frequency [1, 2]) fit power-law

The Power-Law of Top500Matei Ripeanu

([email protected])

Most natural and technical phenomena are characterized by highly unbalanced distributions: there are few powerful, devastating earthquakes and countless unnoticeable ones; there are few machines with a peak FLOPS rate larger than 1 TFLOPS while millions of machines work at around 1 MFLOPS. Many of these events (city sizes, incomes, word frequency [1, 2]) fit power-law distributions: the number of events of a certain size is proportional to the size of the event to a negative constant power. There are many dimensions of variation for entities participating in the Internet: from the obvious ones like CPU speed, available disk space and network bandwidth, to more elaborate ones such as inter-failure time, node trustworthiness, or reliability. We conjecture that these follow similar, highly heterogeneous distributions. Preliminary results support our intuition: Internet’s autonomous system size [3], node bandwidth for nodes in Gnutella network [4, 5] or CPU power for machines in Top500 list [6], all follow power-law distributions (or at least highly variable distributions that can be well

approximated as power-laws).We use an example to depict the characteristics of this distribution: peak MFLOPS rate of the world’s most powerful supercomputers follows a power-law distribution for all years for which data is available (see Figure 1). If we make the assumption that the same distribution extends to machines beyond the Top500, even more interesting is perhaps the heavy-tail property. As a result, aggregating computers in the tail results into a powerful machine comparable with the top ones. By analyzing data for the last seven years, one can notice a fascinating trend: the heavy-tail property becomes more accentuated

(Figure 2 shows that the power constant of these distributions gets closer to zero). If this

Figure 1: Peak processing rate (GFLOPS) for world’s fastest supercomputers in Top500 list from 1995 to 2001. Each series of points represents one year on this log-log plot.

Figure 4: Evolution of power-law distribution coefficient k over time. As k gets closer to 0 the distribution is more heavy-tailed. Note that k decreases on average 2% per year.

Page 2: The Power-Law of Top500 - University of Chicagopeople.cs.uchicago.edu/~matei/P2P/pl-top500.doc · Web viewMany of these events (city sizes, incomes, word frequency [1, 2]) fit power-law

trend persists, the interest will continue to shift from building large machines to large-scale integrations of less powerful systems.

References