dangerous minds: the art of guerrilla data mining - def … · mark ryan del moral talabis...

43
Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

Upload: phungkhanh

Post on 28-Jul-2018

218 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

Mark Ryan del Moral Talabis

Dangerous Minds: The Art of Guerrilla Data Mining

Page 2: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 20092

Background

“Security Analytics”:

Concept of using data mining and AI in security

Presented techniques and theories that we could use

This Talk:

Move from theory to practical applications

Provide scenarios and examples to leverage these techniques for your research

Page 3: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 20093

Introduction

Traditional warfare vs. Information Security

Very similar

Reconnaissance, information gathering, and espionage play an important part in battle tactics

Page 4: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 20094

This is Sparta!

Anyone watched 300?

Spartans: they knew and understood the terrain

Persians: They did not win because of overwhelming numbers, they actually won because someone told them about a hidden pass.

Page 5: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 20095

Information Security

In information security:

Not only in “hacking” systems

The more information you have, you’ll have a better chance to protect you organization

Drafting good policies and procedures as well as picking the correct tools and techniques based on the information that you have.

Page 6: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 20096

Words of Wisdom

It is said that if you know your enemies and know yourself, you will not be imperiled in a hundred battles - Sun Tzu

Page 7: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 20097

Information Warfare

Information Warfare

The use and management of information in pursuit of a competitive advantage over an opponent

Information are just 1’s and 0’s if not used properly

Analysis makes information meaningful - INTELLIGENCE

Page 8: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 20098

The Business of Information Warfare

People who are into the Information Warfare Business:

CIA

FBI

NSA

Information Awareness

Office

Foreign Governments

Page 9: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 20099

Projects

Government Projects:

ECHELON

TALON

ADVISE

MATRIX

Able Danger

Large endeavors!

Page 10: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200910

Challenges

Amount of data: there’s just too much

Resources: way too little

Intelligence

Data

DataData

Challenge

Page 11: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200911

The Veritas Project

Veritas is latin for “Truth”

The Veritas Project

Modeled in the same general threat intelligence premise

Primarily based on community sharing approach and using tools, technologies, and techniques that are freely available.

Hawaii Honeynet Project and Secure-DNA

Page 12: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200912

An Analogy

Information Warfare

Field Agents

HQ

Analysts

Decision Makers

Page 13: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200913

Framework

Framework

Data Collection

Data Storage

Data Analysis

Decision Making

Page 14: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200914

Data Collection

Sources of Data

Depends on what you want to research Forums

Bulletins

Chat logs

News

Articles

Blogs

Word documents

The more you can gather, the better results

It’s not as easy, unless you’re Google

Page 15: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200915

Data Storage

Information can be stored in:

Relational databases

Flat files

Possibly the easiest part of all this

Page 16: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200916

Analysis

Possible the most important aspect of the framework

Crunching large amount of data.

Making data and information meaningful

Some Data Mining and Artificial Intelligence Concepts

K-Means

Neural Networks

SVM

A lot more

Not too easy but there are a lot of tools out there

Page 17: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200917

Data Analysis Tools

Some very useful tools that are free

Text Garden

Ontogen

Weka

Rapid Miner

Tanagra

Orange

MEAD

Page 18: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200918

Human Factor

Why do we need humans?

Interpretation of Results and Analysis = Intelligence

Page 19: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200919

Demo Scenarios

Let’s look at the scenarios that you can use as templates for your own research

Page 20: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200920

Applications

Examples

Trends Research

Malware Taxonomy

Monitoring – Persons of Interest

Corporate Intelligence - Strategy

Opinion Polls – What people are thinking about

Page 21: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200921

Trends Research

Track increases in chatter across time

Gives researchers focus

Find relationships between topics

Framework

Data Collection Crawlers (News articles, Forums)

Data Storage MySQL

Analysis Text Garden (html2txt, txt2bow, bowkmeans)

Decision Making Me!

Page 22: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200922

China Activity

Page 23: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200923

China Computer Assisted

Page 24: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200924

China Nuclear / Power

Page 25: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200925

Thought Cloud

Page 26: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200926

Defcon

Page 27: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200927

Defcon Sumbissions

Page 28: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200928

Defcon Crime

Page 29: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200929

Conclusion

Defcon = Crime

Page 30: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200930

iRobot!

Page 31: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200931

Skynet?

Page 32: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200932

Malware Taxonomy

Grouping similar malwares together

Framework

Data Collection Notes from Malware Analysts

Data Storage Flat files

Analysis Ontogen

Decision Making Depends

Page 33: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200933

Thousands of malware descriptions

Page 34: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200934

“Unsupervised Learning”

Page 35: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200935

Monitoring – Persons of Interest

Monitoring of chat logs and finding “persons of interest” and who they talk to. “Cells”.

Page 36: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200936

Monitoring – Persons of Interest

Page 37: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200937

Monitoring – Persons of Interest

Page 38: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200938

Corporate Intelligence

Using data mining to profile companies to determine strategy

Page 39: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200939

What the Public is Thinking About

Obama Town Hall Meeting

Data mining of over 100,000 questions to get a “pulse” of what people are concerned about

Healthcare

Page 40: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200940

Our Love Affair with Marijuana

Page 41: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200941

Future

Contributors

Sentiment Analysis

Good or Bad?

We need more data!

Page 42: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200942

Acknowledgements

Howard Van de Vaarst

Chris Potter

Secure DNA management

University of Santo Tomas (Philippines)

Blaz Fortuna (Ontogen)

Jozef Stefan Institute, Slovenia (Text Garden)

Page 43: Dangerous Minds: The Art of Guerrilla Data Mining - DEF … · Mark Ryan del Moral Talabis Dangerous Minds: The Art of Guerrilla Data Mining

DEFCON 200943

Mahalo

The Veritas Project