classifying internet one-way traffic - caida...classification scheme one-way traffic composition...

29
Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz, Xenofontas Dimitropoulos ETH Zurich May 15, 2012 Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Upload: others

Post on 31-Mar-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Classifying Internet One-way Traffic

Eduard Glatz,Xenofontas Dimitropoulos

ETH Zurich

May 15, 2012

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 2: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Overview

I Classification scheme for dissecting one-way traffic that reliessolely on flow-level data

I Observation on one-way traffic based on a massive dataset of457 billion flows

I Show how one-way flows are useful for service availabilitymonitoring

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 3: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Preliminaries

I Study incoming one-way traffic at the network level:connections that do not receive a reply.

I Example causes of one-way traffic:

I Failures & PoliciesI AttacksI Special application behavior

I Sampling and asymmetric routing can result in artificialone-way traffic

I One-way traffic can be measured in edge networks

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 4: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Preliminaries

I Study incoming one-way traffic at the network level:connections that do not receive a reply.

I Example causes of one-way traffic:

I Failures & PoliciesI AttacksI Special application behavior

I Sampling and asymmetric routing can result in artificialone-way traffic

I One-way traffic can be measured in edge networks

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 5: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Classification Scheme

I Associate each one-way flow with a number of signs

I Introduce 18 signs exploiting in 4 cases techniques from theliterature

I Classify flows based on their signsI Classes:

I Unreachable servicesI P2P applicationsI ScanningI BackscatterI Suspected BenignI Bogon

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 6: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Signs: Host pair behavior

a) b) c) d)

Figure: Mixture of incoming one- and two-way flows exchanged betweena host pair. Hosts are represented by nodes and the presence ofinflow/outflow/biflows by arrows.

I End-hosts-communicating: One-way flow between productivehost pair

I Limited dialog: One-way flows between unproductive host pair

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 7: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Signs: Host pair behavior

a) b) c) d)

Figure: Mixture of incoming one- and two-way flows exchanged betweena host pair. Hosts are represented by nodes and the presence ofinflow/outflow/biflows by arrows.

I End-hosts-communicating: One-way flow between productivehost pair

I Limited dialog: One-way flows between unproductive host pair

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 8: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Signs: Local host behavior

I Unused local address: Unpopulated local IP address

I Service unreachable: Unanswered request to local service

I Peer-to-peer1: Flow towards local P2P host

1W. John and S. Tafvelin. Heuristics to classify internet backbone traffic based on connection patterns.

International Conference on Information Networking (ICOIN), 2008

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 9: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Signs: Remote host behavior

I Service sole reply: no biflow on srcIP ∧ dstPort≥1024 ∧srcPort < 1024

I Remote scanner 12: TRW algorithm (suspected scanner)

I Remote scanner 23: Host classification (suspected scanner)

I Remote non-scanner: TRW algorithm (suspected regular host)

2J. Jung, V. Paxson, A. Berger, and H. Balakrishnan. Fast portscan detection using sequential hypothesis

testing. In Proceedings of the IEEE Symposium on Security and Privacy, 20043M. Allman, V. Paxson, and J. Terrell. A brief history of scanning. In Proceedings of the 7th ACM

SIGCOMM IMC, 2007

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 10: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Signs: Flow feature

I Artifact: UDP/TCP flow with both port numbers=0

I Single packet: Flow contains one packet only

I Large flow: Flow carries ≥ 10 packets or ≥ 10240 bytes

I Bogon: Source IP belongs to bogon space

I Protocol: IP protocol type of flow

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 11: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Classification Rules

Final classifier includes 17 classification rules

Class Name Rule # Flow Membership Rules

Malicious 1 {TRWscan,HCscan,PotOk} ⇒ Scanner

Scanning 2 {HCscan,TRWscan,TRWnom,PotOk} ⇒ Scanner

3 {TRWscan,HCscan,PotOk} ⇒ Scanner

4 {TRWnom,HCscan} ⇒ Scanner

5 {GreyIP,Onepkt,TRWscan,HCscan,Backsc , ICMP,UDP, bogon} ⇒ Scanner

6 {GreyIP,TRWscan,HCscan,Onepkt, ICMP,Backsc , bogon} ⇒ Scanner

7 {Onepkt,GreyIP, ICMP,TRWscan,HCscan,TRWnom, bogon,P2P,Unreach,PotOk ,Backsc , Large} ⇒ Scanner

8 {GreyIP,Onepkt,TRWscan,HCscan,Backsc , ICMP,TCP, bogon} ⇒ Scanner

9 {ICMP,TRWscan,TRWnom,HCscan, InOut, bogon,PotOk} ⇒ Scanner

Backscatter 10 {Backsc ,TRWscan,HCscan,P2P, InOut,PotOk} ⇒ Backscatter

Service 11 {Unreach,TRWscan,HCscan, bogon,P2P} ⇒ Unreachable

Unreachable

Benign P2P 12 {P2P,TRWscan,HCscan, bogon} ⇒ P2P

Scanning

Suspected 13 {PotOk ,Unreach,P2P,TRWnom, bogon} ⇒ Benign

Benign 14 {Large,GreyIP,TRWscan,HCscan,P2P,Unreach,PotOk , ICMP,Backsc , bogon,TRWnom} ⇒ Benign

15 {TRWnom,GreyIP,HCscan,P2P,Unreach, bogon,Backsc} ⇒ Benign

16 {ICMP, InOut,TRWscan,HCscan,TRWnom, bogon,PotOk} ⇒ Benign

Bogon 17 {bogon,TRWscan,HCscan,Backsc} ⇒ Bogon

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 12: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Data-Sets

I Use data from the Swiss academic backbone network(SWITCH)

I Analyze the first 400 hours of each Feb and Aug between2004 and 2011

I The studied traffic data correspond to:I 457 billion flowsI 7.41 petabytesI cover 9% of the total number of flows

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 13: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Data Sanitization

I Double-counting elimination reduces total traffic volume by32.3%

I Defragmentation reduces the number of flows by a fractionranging between 20.6% and 39.6% for different years

I Bi-flow Pairing:I For TCP and UDP based on standard 5-tupleI For other protocols based on 3-tuple

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 14: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Evolution of One- and Two-way Traffic

I One-way flows are a largefraction of all flows:

I In 2004, 2 out of every 3flows were one-way

I From 2007 to 2010, 1 out ofevery 3 flows were one-way

I The number of one-way flowsin 2011 is almost equal to 2004

I The fraction of one-way flowshas declined

Period

Mea

n F

low

s/24

h

'4.2 '5.2 '6.2 '7.2 '8.2 '9.2 '0.2 '1.20e

+00

4e+

068e

+06

Inbound One−Way Two−WayInbound One−Way Two−Way

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 15: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Evolution of One- and Two-way Traffic

I One-way flows are a largefraction of all flows:

I In 2004, 2 out of every 3flows were one-way

I From 2007 to 2010, 1 out ofevery 3 flows were one-way

I The number of one-way flowsin 2011 is almost equal to 2004

I The fraction of one-way flowshas declined

Period

Mea

n F

low

s/24

h

'4.2 '5.2 '6.2 '7.2 '8.2 '9.2 '0.2 '1.20e

+00

4e+

068e

+06

Inbound One−Way Two−WayInbound One−Way Two−Way

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 16: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Composition of One-way Traffic

Class % of flows % of pkts pkts/flow

Scanning 83.5% 62.6% 1.6P2P applications 6.7% 13.0% 6.8Unreach services 4.8% 10.1% 4.1

Suspected Benign 2.6% 9.1% 12.1Other 2.2% 4.7% 4.6

Backscatter 0.3% 0.5% 3.3

I The top sources of one-waytraffic are scanning, P2Pprotocols, and unreachableservices

0e+

001e

+08

2e+

083e

+08

4e+

08Period

One

−W

ay F

low

s/24

h

2004

.01

2004

.07

2005

.01

2005

.07

2006

.02

2006

.07

2007

.01

2007

.07

2008

.02

2008

.07

2009

.01

2009

.07

2010

.01

2010

.07

2011

.01

2011

.08

SuspBenign SrvUnreach other

MalScan

Bogon BenignP2P Backscat

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 17: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Service Availability Monitoring

I One-way flows are very useful for service availabilitymonitoring

I Traditional service availability monitoring is based on activeprobing

I Advantages of flow-based approach:

I Provides a tangible assessment of the impact of disruptionsI Discovers running services without requiring manual

configurationI Exploits passive measurements

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 18: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Service Availability Monitoring

I One-way flows are very useful for service availabilitymonitoring

I Traditional service availability monitoring is based on activeprobing

I Advantages of flow-based approach:

I Provides a tangible assessment of the impact of disruptionsI Discovers running services without requiring manual

configurationI Exploits passive measurements

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 19: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Outages and Misconfigurations in ETH Zurich

I Examine a week of NetFlowdata from the EE Departmentof ETH Zurch

I Found 32 main services(> 99% availability) and 11transient services

I Identified a coinciding globaloutage

I During the identified interval287,583 unique IPaddresses failed to accesstarget services!

0

0.2

0.4

0.6

0.8

1

20 21 22 23 24 25 26 27

Fra

ctio

n of

Fai

led

Clie

nts

Day of June 2011

993/tcp (tardis.ee)25/tcp (tranquility.ee)80/tcp (yosemite.ee)

25/tcp (smtp.ee)

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 20: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Outages and Misconfigurations in ETH Zurich

I Examine a week of NetFlowdata from the EE Departmentof ETH Zurch

I Found 32 main services(> 99% availability) and 11transient services

I Identified a coinciding globaloutage

I During the identified interval287,583 unique IPaddresses failed to accesstarget services!

0

0.2

0.4

0.6

0.8

1

20 21 22 23 24 25 26 27F

ract

ion

of F

aile

d C

lient

s

Day of June 2011

993/tcp (tardis.ee)25/tcp (tranquility.ee)80/tcp (yosemite.ee)

25/tcp (smtp.ee)

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 21: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Outages and Misconfigurations in ETH Zurich

I Examine a week of NetFlowdata from the EE Departmentof ETH Zurch

I Found 32 main services(> 99% availability) and 11transient services

I Identified a coinciding globaloutage

I During the identified interval287,583 unique IPaddresses failed to accesstarget services!

0

0.2

0.4

0.6

0.8

1

20 21 22 23 24 25 26 27F

ract

ion

of F

aile

d C

lient

s

Day of June 2011

993/tcp (tardis.ee)25/tcp (tranquility.ee)80/tcp (yosemite.ee)

25/tcp (smtp.ee)

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 22: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Conclusions

I Classification scheme for one-way traffic that relies on 18signs derived from flow data

I Observations based on a very large data-set:

I One-way flows are a large fraction of all flowsI In terms of flows, the share of one-way traffic has declined

since 2004I The top sources of one-way traffic are scanning, P2P

protocols, and unreachable services

I One-way traffic is very useful for assessing the impact offailures

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 23: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Questions?Contact: [email protected]

E. Glatz and X. Dimitropoulos. Classifying Internet One-wayTraffic. TIK-Report 336, ETH Zurich, May 2012

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 24: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Validation

I Collect packet traces from a small campus networkI Exploit additional information:

I Extended host profilesI ICMP types and codesI TCP flags (Check protocol state machine)I DPI-based application identification4

I Precise timestamps

Class Name Recall [%] Precision [%]Malicious Scanning 99.9 99.8Service Unreachable 99.6 96.1Benign P2P Scanning 95.3 95.5Backscatter 62.4 88.4Suspected Benign 85.1 75.0Bogon 40.4 100.0

4H. Kim, K. Claffy, M. Fomenkov, D. Barman, M. Faloutsos, and K. Lee. Internet traffic classification

demystified: myths, caveats, and the best practices. ACM CoNEXT, 2008

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 25: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Validation

I Collect packet traces from a small campus networkI Exploit additional information:

I Extended host profilesI ICMP types and codesI TCP flags (Check protocol state machine)I DPI-based application identification4

I Precise timestamps

Class Name Recall [%] Precision [%]Malicious Scanning 99.9 99.8Service Unreachable 99.6 96.1Benign P2P Scanning 95.3 95.5Backscatter 62.4 88.4Suspected Benign 85.1 75.0Bogon 40.4 100.0

4H. Kim, K. Claffy, M. Fomenkov, D. Barman, M. Faloutsos, and K. Lee. Internet traffic classification

demystified: myths, caveats, and the best practices. ACM CoNEXT, 2008

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 26: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Outages and Misconfigurations in ETH Zurich

I Found server that was not reachable during the studied weekin total by 2.2 million unique clients!

I What was this server? Hint: Switzerland is famous forchocolate, banking, swiss army knifes, and watches

I Popular NTP server swisstime.ee.ethz.ch preconfigured inNTP clients and used in NTP “hello world” examples

I It was not reachable to 12.9% of its clients cause by invalidCRC checksums and a filtering policy

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 27: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Outages and Misconfigurations in ETH Zurich

I Found server that was not reachable during the studied weekin total by 2.2 million unique clients!

I What was this server? Hint: Switzerland is famous forchocolate, banking, swiss army knifes, and watches

I Popular NTP server swisstime.ee.ethz.ch preconfigured inNTP clients and used in NTP “hello world” examples

I It was not reachable to 12.9% of its clients cause by invalidCRC checksums and a filtering policy

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 28: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Impact of the Interval Size

Doubling the interval size:

I decreases absolute countmetrics by 3-5%.

I decreases relative volumemetrics by 1.2% and does not

I decrease further with anincreasing interval size.

0.95

1.00

1.05

Interval Size [s]

Var

iatio

n w

rt. 6

00 s

inte

rval

siz

e300 450 600 720 900 1200

Two−way flowsOne−way flowsOne−way/total flowsTotal flows

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic

Page 29: Classifying Internet One-way Traffic - CAIDA...Classification Scheme One-way Traffic Composition Service Availability Monitoring Classifying Internet One-way Traffic Eduard Glatz,

Classification SchemeOne-way Traffic Composition

Service Availability Monitoring

Signs

Sign Type Sign Name Detection Criterion/Algorithm

Host pair behavior End-hosts-communicating One-way flow between productive host pairLimited dialog One-way flows between unproductive host pair

Remote host behavior Service sole reply no biflow on srcIP ∧ dstPort≥1024 ∧ srcPort < 1024Remote scanner 1 TRW algorithm (suspected scanner)Remote scanner 2 Host classification (suspected scanner)Remote non-scanner TRW algorithm (suspected regular host)

Local host behavior Unused local address Unpopulated local IP addressService unreachable Unanswered request to local servicePeer-to-peer Flow towards local P2P host

Flow feature Artifact UDP/TCP flow with both port numbers=0Single packet Flow contains one packet onlyLarge flow Flow carries ≥ 10 packets or ≥ 10240 bytesBogon Source IP belongs to bogon spaceProtocol IP protocol type of flow

Eduard Glatz, Xenofontas Dimitropoulos Classifying Internet One-way Traffic