strata san jose 2016 - reduce false positives in security

Powerball Predictor

Photo Credit: Sean McGrath

Crystal ball tells me with 99% accuracy if a powerball prediction is a winner.

https://www.flickr.com/photos/mcgraths/3248483447/in/photolist-5X4jXi-6yxZ1a-CGf4U-66Tr3c-6ee5Rq-4hvaHA-iM1FMn-2jXobW-b6Wy1i-fs2WHp-nFuJt-54rhWJ-9vmRZL-7Ut5P3-54wa3g-4TYKY5-8V2KxT-da3A28-gMhZLV-CThDXh-Dn3ayt-6R8RM1-derkYj-73QdMF-8MojHP-daiKW9-d3ZCYA-dern3A-daxCQ3-dcDDna-oP14hJ-bnyK58-da3Asi-da3Abr-h1hb3G-bnyH3K-daxAXa-daxCNC-d3ZKDf-daxFoY-exQXYU-adoDkQ-qXoNsv-7v5B4X-7vB19q-85bu8F-7ki5n9-54mYde-aWPLh6-4yqu7S

Powerball Predictor


● ~300 million samples.● ~ 3 million false positives.● 1 true positive.


Powerball Predictor


The overwhelming majority of tickets are not winners.

Failing to recognize this is falling victim to the base rate fallacy.


Security Crystal Ball


The overwhelming majority of log entries and data points do not represent fraud and intrusions.

Failing to recognize this is falling victim to the base rate fallacy.


FRAUD Intrusion

Detection

System

Source: MXLabs

http://blog.mxlab.eu/2014/12/29/phishing-email-your-netflix-account-has-been-suspeded/

Base Rate Fallacy

Why False Positives?

Case Study: Outlier Detection

Using an outlier detection system to identify fraudsters within the environment.

For a set of generating mechanisms find the unusual ones.

Example Time Series

Photo credit SuperCar-RoadTrip.fr under Creative Commons Attribution 2.0

Change in the data over time in unforeseen ways.

Concept Drift

https://www.flickr.com/photos/xavier33300/15052804730/in/photolist-oWayq3-oW3LQq-e4YqAw-oV975K-66wm4g-pcovST-66B8md-5uPbDV-6KP3D9-5VhQtC-92osSH-mDdQG-or2WMB-fsicY7-bXNoqA-4xEjhm-4xDRCT-495Vvz-c2Cg3S-oVbx1m-kSsu4z-7cykK1-q5SXqg-C2bEC-oW7p6e-bPjkni-oW5DRv-oVyHA6-oYz2jo-pdiszz-bYvBeh-66wQRP-e4SLwX-r58ZaU-e4SPm8-5nnjfS-8XXrpQ-eNsVxi-mo1x3z-kLKuy5-aM6MW4-oV9nEw-495Vkt-5RJF4s-82XhV5-r2zZ5S-pcbxb3-pcz7Gg-cZEutW-oW6F25

https://creativecommons.org/licenses/by/2.0/legalcode

Solution: Feedback Loop

Explicit Feedback Loop

Photo credit Alan Levine under Creative Commons Attribution 2.0

https://www.flickr.com/photos/cogdog/14279306964/in/photolist-nKPbtE-fzviyT-3UaCt1-6KaJw3-61foih-61bb2t-5ZMwNj-7aQ4JT-6CmbTU-61bb4V-uepYJ-76DVSc-rTiuCw-d7KKVA-9E4LD8-7X3Pgt-7xUnZs-pFFHx-b1krRX-7xUnTC-4TGezD-7xUnWs-pX71kj-cojunw-4THtui-bxCqMR-8EqL2E-b9txnz-bzr6BP-64Qk5s-5ZHa5i-8tAUF4-5ZMnbW-5ZMngC-rpFkJ-5ZMnR7-4TFMCg-5xayDj-jibVc8-5ZHabP-5ZMnAY-5xazh7-5ZHaoc-5x6b9t-nQ5aQ3-mYboTU-5x6bmK-5ZH9FP-5ZHajM-4KRRtx


Explicit Feedback Loop

Photo credit Alan Levine under Creative Commons Attribution 2.0

Implicit Feedback Loop

https://www.flickr.com/photos/cogdog/14279306964/in/photolist-nKPbtE-fzviyT-3UaCt1-6KaJw3-61foih-61bb2t-5ZMwNj-7aQ4JT-6CmbTU-61bb4V-uepYJ-76DVSc-rTiuCw-d7KKVA-9E4LD8-7X3Pgt-7xUnZs-pFFHx-b1krRX-7xUnTC-4TGezD-7xUnWs-pX71kj-cojunw-4THtui-bxCqMR-8EqL2E-b9txnz-bzr6BP-64Qk5s-5ZHa5i-8tAUF4-5ZMnbW-5ZMngC-rpFkJ-5ZMnR7-4TFMCg-5xayDj-jibVc8-5ZHabP-5ZMnAY-5xazh7-5ZHaoc-5x6b9t-nQ5aQ3-mYboTU-5x6bmK-5ZH9FP-5ZHajM-4KRRtx


Fraud: Takeaways

- Concept Drift is a shift in behavior.- Feedback combats concept drift.- Implicit Feedback > Explicit Feedback

IDS: Anatomy of Successful Detection

Context: Security Analyst

Red team Kill Chain

Blue team Kill Chain

False positives: Lose Ability to Triage

Fact: You cannot salvage a false positive with Contextual Info or Visualization

What is a Successful detection?

Properties + Frameworks

Successful detection captures Adversary TTP from Sensor data ignoring Expected activity

Source: @MSwannMSFT

Properties of a Successful Detection

Adaptability

Credible

Interpretability

Actionable

Basic Advanced

Less Useful

More U

seful

Sophistication of Algorithms

Usefulness of A

lerts

Secu

rity

Dom

ain

Kno

wle

dge

Framework for a Successful detection

Basic Advanced

Less Useful

More U

seful


Usefulness of A

lerts

Secu

rity

Dom

ain

Kno

wle

dge

Outlier

Basic Advanced

Less Useful

More U

seful


Usefulness of A

lerts

Secu

rity

Dom

ain

Kno

wle

dge

Outlier

Anomaly

Increase Complexity

Basic Advanced

Less Useful

More U

seful


Usefulness of A

lerts

Secu

rity

Dom

ain

Kno

wle

dge

Outlier

AnomalyIncrease Complexity

Security InterestingAlerts

Incr

e ase

Dom

ain

Kno

wle

dgeSuccessful

Detections incorporate Domain Knowledge Alerts

How to encode Domain Knowledge: Embrace Rules

• Business Heuristics to filter out the “Security interesting anomalies”

• Rules can take many forms: •TI feeds •IOCs, IOAs•TTPs

• Rules are awesome • Credible, Interpretable, Adaptable (to some

extent), Actionable!• Highest Precision • Highest Recall

Three ways to combine ML and Rules

Three Ways to combine Rules and ML 1.Above Machine Learning Systems

a.Business Heuristics to filter alerts i. “For account _foo_, only raise sev 2 alerts until March 28th, 2016”,

Work by Dan Mace et. al, Microsoft

2. Below Machine Learning Systemsa. Featurizations - “If IP address present in List of malicious IP dataset, flag 1”b. Utilizes Threat Intel feeds (Cymru, Virus total, FireEye)

3: Combining Rules and Machine Learning together using Markov Logic Networks

Initial Ideas given by Vinod Nair, MSR

Intuition

•Rules alone place a set of hard constraintson the set of possible worlds•Let’s make them soft constraints:When a world violates a formula,It becomes less probable, not impossible•Give each formula a weight(Higher weight ⇒ Stronger constraint)

Source: Lectures by Pedro Domingos

Interactive logons from service accounts causes attack

Similar service accounts tend to have similar logon behavior

Example: Service Accounts

Domain Knowledge


Encode as First Order Logic


1.5

1.1


AssociateEach Rule With the Learned Weight


1.5

1.1

Attack(A)

InteractiveLogon(A)

InteractiveLogon(B)

Attack(B)


Consider two service accounts: A,B


1.5

1.1

Attack(A)

InteractiveLogon(A)

InteractiveLogon(B)

Attack(B)Similar(A,

B)

Similar(B,A)

Similar(A,A)

Similar(B,B)

•How to learn the structure? •Begin with hand-coded rules•Use Inductive Logic Programming, but need to infer arbitrary clause

•How to learn the weights? •For generative learning, depend on pseudolikelihood

•Checkout Alchemy -- http://alchemy.cs.washington.edu/

http://alchemy.cs.washington.edu/

Call for Action - After the conference • One Week

•Review •@CodyRioux - IPython Notebook•@Ram_ssk - Follow Up material

•Think comprehensively about Rules

• One Month •Ask your data scientists to literature review section

•Implement the rules on TOP of ML systems

• One quarter•Implement a feedback system to capture training data

•Implement all TI feeds within an ML System

•Play with Alchemy

Literature● The Base-Rate Fallacy and its Implications for the Difficulty of Intrusion Detection

(Alexsson, 1999)

● Enhancing Performance Prediction Robustness by Combining Analytical Modeling

and Machine Learning (Didona et al., 2015)

● Richardson, Matthew, and Pedro Domingos. "Markov logic networks."Machine

learning 62.1-2 (2006): 107-136.

strata san jose 2016 - reduce false positives in security

Engineering