sigcomm - john s. otto, mario a. sanchez, david r. choffnes*,...

26
http://aqualab.cs.northwestern.edu John S. Otto, Mario A. Sanchez, David R. Choffnes*, Fabián E. Bustamante, Georgos Siganos** Northwestern, EECS * U. Wash, CSE ** Telefónica Research

Upload: others

Post on 14-Mar-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

http://aqualab.cs.northwestern.edu

John S. Otto, Mario A. Sanchez, David R. Choffnes*, Fabián E. Bustamante, Georgos Siganos** Northwestern, EECS * U. Wash, CSE ** Telefónica Research

Page 2: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 2 On Blind Mice and the Elephant

Page 3: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 3

! A large, global peer-to-peer system ! Millions of users exchanging content ! Virtually every country in the world

On Blind Mice and the Elephant

Page 4: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 4

! System’s measured network impact depends on measurement vantage point –  How much of network traffic is from BitTorrent?

On Blind Mice and the Elephant

Eastern Europe

57% (ipoque)

South America

20% (ipoque)

No, Germany is 9-15% (Maier et al. IMC’09)

Germany

37% (ipoque)

Page 5: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 5

! A view from a broad set of end users –  To sample its overall network traffic –  Understand where it flows –  Who pays for it (and how expensive it is)

! This work –  Relies on end users as vantage points

•  Captures a sample of all BitTorrent traffic •  Reveals traffic’s path through the network

–  Public view is not sufficient to map most BitTorrent traffic

–  ISP data provides context to understand cost of BitTorrent traffic

On Blind Mice and the Elephant

Page 6: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 6

! Representative sample of users –  500,000 users, 3,300 networks, 169 countries

! Running extensions (Ono & NEWS) for Vuze BitTorrent client –  Anonymously report statistics –  Provide application-level data

•  e.g. session length, per-connection transfer volumes •  Log 13 TB of traffic per day

–  Conduct active measurements to reveal traffic paths •  With public view alone, we can map 25% of traffic •  Supplemented with traceroutes, we can map 89%

On Blind Mice and the Elephant

Page 7: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 7

! How BitTorrent is being used –  Who is using BitTorrent? –  When do people run BitTorrent? –  How much traffic does it generate? –  Study data from Nov. 2008 to Nov. 2010

! Where the generated traffic flows ! Who pays for it and how much

On Blind Mice and the Elephant

Page 8: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 8 On Blind Mice and the Elephant

EU 52%

NA 20%AS 19%

SA 4%OC 3%AF 2%

EU 61%

NA 19%AS 13%

SA 4%OC 2%AF 1%

2009 2010

! Decrease in Europe ! Increase in Asia, Africa and Oceania

Connected peers by continent

! Overall population reduced by 10% ! Locations of users change over time

Page 9: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 9 On Blind Mice and the Elephant

Rate of growth of connected users per continent relative to Nov. 2008

! Europe continues to drop ! N. America, S. America remain stable since 2009 ! 76% growth in Africa and 47% in Asia

Page 10: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 10

06 08 10 12 14 16 18 20 22 00 02 04 060

20

40

60

80

100

Nor

mal

ized

%of

hour

lype

ers

seen

2009

! Shift away from overnight use ! Peak usage aligns with evening hours, local time

–  Potential impact on ISPs’ costs under burstable billing

On Blind Mice and the Elephant

European peers seen on weekdays Normalized number of peers seen per hour in Europe, depending on time of day

Page 11: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 11

06 08 10 12 14 16 18 20 22 00 02 04 060

20

40

60

80

100

Nor

mal

ized

%of

hour

lype

ers

seen

2009

! Shift away from overnight use ! Peak usage aligns with evening hours, local time

–  Potential impact on ISPs’ costs under burstable billing

On Blind Mice and the Elephant

European peers seen on weekdays Normalized number of peers seen per hour in Europe, depending on time of day

06 08 10 12 14 16 18 20 22 00 02 04 060

20

40

60

80

100

Nor

mal

ized

%of

hour

lype

ers

seen

20092010

Page 12: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 12

! 25% increase in per-peer hourly download volume ! Despite a 20% drop in total connections,

a 12% increase in overall system traffic

On Blind Mice and the Elephant

Nov ’09Jan ’10

Mar ’10May ’10

Jul ’10Sep ’10

Nov ’100

20

40

60

80

100

120

140

160

Hou

rlyPe

erD

ownl

oad

Volu

me

(MB

)

Per-peer hourly download volume (in MB) over the last year

Page 13: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 13

! Overall population reduced by 10% –  But large increase in Africa and Asia

! Peak usage aligns with evening hours ! 12% increase in overall system traffic

–  25% increase in per-peer hourly download volume

So where’s the traffic?

On Blind Mice and the Elephant

Page 14: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 14

! How “deep” does traffic go in the network? ! Who is paying for it? ! Traffic path analysis to see which networks carry

most BitTorrent traffic –  Tier 1: Well-known networks –  Tier 2: Large transit providers –  Tier 3: Small transit providers –  Tier 4: Content/access/hosting providers

Enterprise customers

On Blind Mice and the Elephant

Tiers based on Dhamdhere and Dovrolis, IMC 2008

Page 15: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 15

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

CD

F [

X !

x]

Proportion of Traffic

Tier 3

! Most traffic stays at or below Tier 3 ! Significant fraction of traffic never reaches Tiers 1 or 2

–  Typically missed by in-network monitoring studies from the core

On Blind Mice and the Elephant

Fraction of each peer’s traffic that reaches Tier X

Smaller fraction of traffic

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

CD

F [

X !

x]

Proportion of Traffic

Tier 1 Tier 2 Tier 3 Tier 4

Page 16: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 16

! Most traffic stays at or below Tier 3 ! Significant fraction of traffic never reaches Tiers 1 or 2

–  Typically missed by in-network monitoring studies from the core

On Blind Mice and the Elephant

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

CD

F [

X !

x]

Proportion of Traffic

Tier 1 Tier 2 Tier 3 Tier 4

Fraction of each peer’s traffic that reaches Tier X

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

CD

F [

X !

x]

Proportion of Traffic

Tier 1 Tier 2 Tier 3 Tier 4

Page 17: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 17

! Traffic generally stays in the originating tier ! Tier 2 networks do not provide “intermediate” level of

connectivity between Tiers 1 & 3

On Blind Mice and the Elephant

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

CD

F [X

! x

]

Proportion of Traffic

Tier 1Tier 2Tier 3

Traffic from Tier 2 to Tier 2 Traffic from Tier 3 to Tier 3

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

CD

F [X

! x

]

Proportion of Traffic

Tier 1Tier 2

Page 18: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 18

! How BitTorrent is being used ! Where the generated traffic flows

–  Most traffic is handled at or below Tier 3

! Who pays for it and how much

On Blind Mice and the Elephant

Page 19: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 19

! Determine BitTorrent cost relative to other traffic –  ISP X’s data provides context to interpret traffic sample

! Study at granularity of individual network links ! Consider common burstable billing model

–  e.g. 95th-percentile billing

! Data for several of ISP X’s links over 1 week

On Blind Mice and the Elephant

ISP X

ISP A ISP B

ISP C ISP G

Providers

Customers

Page 20: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 20

! Aggregate link volume for each 5 minute bin ! Cost is based on 95th-percentile bin’s value ! Under burstable billing model, not all bytes may have the

same cost –  Peak-hour bytes are more expensive than off-peak

On Blind Mice and the Elephant

95th-percentile value

When value is defined

All Traffic

Page 21: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 21 On Blind Mice and the Elephant

Other BT

Other

BT

! BitTorrent at peak hour is more expensive ! Use Shapley value to determine relative cost of BitTorrent

–  Shapley value gives the cost contribution of BitTorrent traffic –  Compare to other traffic on the network –  Is BitTorrent’s cost more than its “fair share” by volume?

BitTorrent peaks at 3AM BitTorrent peaks at 9PM

Page 22: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 22 On Blind Mice and the Elephant

Other BT

Other

BT

! BitTorrent at peak hour is more expensive ! Use Shapley value to determine relative cost of BitTorrent

–  Shapley value gives the cost contribution of BitTorrent traffic –  Compare to other traffic on the network –  Is BitTorrent’s cost more than its “fair share” by volume?

BitTorrent peaks at 3AM BitTorrent peaks at 9PM BitTorrent’s contribution

to cost

BT BT

Other Other

Page 23: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 23 On Blind Mice and the Elephant

! BitTorrent traffic is generally more expensive than other traffic

! What traffic characteristics result in high relative cost?

Additional cost of BitTorrent traffic, percent above relative cost of 1

ISP X

ISP A ISP B

ISP C ISP G

Page 24: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 24

! High relative cost of BitTorrent –  Large coefficient of variation (“C.V.”, size of peaks in BitTorrent traffic) –  Small cross-correlation offset (“X-corr”, alignment with overall traffic)

On Blind Mice and the Elephant

Out-of-phase peaks Aligned peaks

Small peaks

Large peaks

Page 25: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 25

! High relative cost of BitTorrent –  Large coefficient of variation (“C.V.”, size of peaks in BitTorrent traffic) –  Small cross-correlation offset (“X-corr”, alignment with overall traffic)

On Blind Mice and the Elephant

Out-of-phase peaks Aligned peaks

ISP B X-corr: 3.2 hours C.V.: 188% Relative cost: 50%

ISP A X-corr: -7.1 hours C.V.: 130% Relative cost: 13%

ISP F X-corr: 7.4 C.V.: 325% Relative cost: 52%

ISP E X-corr: 1.6 C.V.: 158% Relative cost: 83%

Small peaks

Large peaks

Page 26: SIGCOMM - John S. Otto, Mario A. Sanchez, David R. Choffnes*, …conferences.sigcomm.org/sigcomm/2011/slides/s110.pdf · 2011-08-17 · Otto, Sánchez, Choffnes, Bustamante & Siganos

Otto, Sánchez, Choffnes, Bustamante & Siganos 26

! BitTorrent is still alive and costly –  Most traffic stays at the edge of the network –  It is moving into prime-time –  Logically, it is relatively more expensive

! A broad view from the edge of the network is required to see the system’s full usage spectrum

! Approach is general to understanding other distributed systems –  Video streaming –  Peer-to-peer CDNs

On Blind Mice and the Elephant