automating intelligence creation & analysis& analysis sonal.pdf · 2018-09-02 · mobile...

31
Automating Intelligence Creation & Analysis & Analysis 30 Jan 2020 Ashish Sonal CEO © 2010 - Orkash Services Pvt. Ltd. 1 …ensuring Assurance in complexity and uncertainty CEO Orkash Services Pvt Ltd

Upload: others

Post on 22-May-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Automating Intelligence Creation & Analysis& Analysis

30 Jan 2020

Ashish SonalCEO

© 2010 - Orkash Services Pvt. Ltd.1…ensuring Assurance in complexity and uncertainty

CEO Orkash Services Pvt Ltd

Page 2: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

About Orkash

Orkash is an international management-consulting and high-technology services company g g g gy p yoperating in the following areas:

― Investment & Transaction Advisory Support Operational Risk and Security Risk Management ― Operational Risk and Security Risk Management

― Strategic Business & Market Intelligence― SaaS-based Technology Solutions

Cutting edge technology expertise across AI systems, very large data mining, BI, cyber forensics, and parallel computing clusters & cloud computing.

Orkash is dedicated to turn knowledge and information into intelligence as a value driver for rapid response to new opportunities, competitive forces and risks. The greater the uncertainty, p p pp , p g y,complexity and risk in an operating environment, the greater is the potential of the value that Orkash delivers.

The company prides itself in measuring the value that it creates for its clients on strategic business metrics such as efficiencies in execution time-to-market and returns

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty

business metrics such as efficiencies in execution, time-to-market, and returns.

Page 3: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Key Areas

Automating Intelligence Creation

Semantics & GIS Integration - Bridging g g gTechnical Intelligence & Human Intelligence

Investigative Intelligence - CyberInvestigative Intelligence Cyber Forensics

Decision Support - Granularity & VisualizationVisualization

Backend Technologies - HPC Clusters and Clouds, AI Expert Systems, Large , p y , gScale and Real Time Data Mining

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty

© 2009 - Orkash Services Pvt. Ltd. 3

Page 4: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Patterns ..13 September 2008

Ne Delhi27 September 2008

Ne Delhi13 May 2008Jaipur, Rajasthan(Eight devices)

New Delhi(Six devices)

29 September 2008

30 October 2008Guwahati, Assam

(18 devices)

New Delhi(One device)

01January 2008Rampur U P

21 October 2008Imphal, Manipur

(One device)26 July 2008

29 September 2008Modasa, Gujarat

(One device)

( )Rampur, U.P(Two devices)

yAhmedabad, Gujarat

(21 devices)

29-30 July 2008Surat, Gujarat(18 d i )

01 October 2008Tripura, Agartala

(Five devices)

25 J l 2008

(18 devices)

29 September 2008Malegaon, Maharashtra

(Two devices)

(Five devices)

Date of attackLocation

25 July 2008Bengaluru, Karnataka

(Eight Devices)26-29 November 2008Mumbai, Maharashtra

(Multiple Attacks & Blasts)

(Scope of the attack)

© 2010 - Orkash Services Pvt. Ltd.4…ensuring Assurance in complexity and uncertainty

( )LEGEND KEY

Page 5: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Major Naxalites Areas

120

140

118

Naxalites attack in last five years

60

80

100

7160

No. of attacksNo. of people killed

umbe

r

0

20

40

2 17 3

88 8

Nu

2005 2006 2007 2008 20090

Year

© 2010 - Orkash Services Pvt. Ltd.5…ensuring Assurance in complexity and uncertainty

Page 6: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Information Asymmetry

• From the organized 26/11 Mumbai attacks it is clear that the attack was • From the organized 26/11 Mumbai attacks, it is clear that the attack was planned long ago

• Terrorist groups used the internet and the mobile networks, for communicating messages collecting information money transactions and othermessages, collecting information, money transactions and other

[email protected]

[email protected]

Internet accountAccount: Hazi AlPassword:

[email protected]

IP address

Banking and Card Usage

Hazi_123

Multiple Sims

IP addressMAC addressMetadataEmail tracing

Multiple HandsetsTelephone

Historical data analysis can train artificial intelligence expert-

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 6

y g pengines to raise triggers and red flags. Real time data mining of mobile traffic generate trends and patterns in the usage

Page 7: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

The Intelligence Stack

Intelligence Consumption Intelligence Creation

TacticalStrategic

Tactical

Intelligence

Operational Intelligence

Intelligence

Operational Intelligence

Operational Intelligence

Strategic Intelligence Tactical Intelligence

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty

Page 8: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Intelligence Needs

Technical Intelligence + Human Intelligence

Predictive and Investigative Intelligence

G l it D t C bGranularity - Data Cubes

WHEN

WHO

WHEREWHERE

WHAT

WHY

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty

WHY

Page 9: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

The FOUR Pillars of Automated Intelligence Creation

i i &Information Extraction and

Monitoring

Data Mining & Forensics

Monitoring

Semantic A l i

Geospatial Analysis•Extraction of data

from varioussources including

Data Mining onreal time basis

Analysis of trendsAnalysissources including websites, blogs, mobile phone, Emails and couriers Use of semantics

to decipher the

Analysis ingeospatial contextto have betterdata visualization

Analysis of trendsand patterns ofactivities

Internet & Cyber • Bankingtransactions, creditcard usage, travelrecords

to decipher the content

Using domain specific ontologies

Geospatial intelligence for effective decisionmaking

Forensics

Target Centric

S i l N t k• Monitoring of webfor any unusual communication andred flags

specific ontologies

Network analysisTools and analytics

making

Locationintelligence for better interpretation

Social NetworkAnalysis

Pattern Tracing and Tracking

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 9

red flags y pof network

Tracking

Page 10: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Automating Intelligence Creation - The Process Layers Where is the Intelligence Gap

d i f i f i• access and extraction of information• language interpretation for the context, and removal of NOISE• search vs scan .

• pattern tracking granularity & visualization

Leverage‘collective int’

pattern tracking, granularity & visualization

Geospatial analysis

collective int

Semantic

analysis

Data mining

Information

analysis

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty

© 2010 - Orkash Services Pvt. Ltd. 10

generation/extraction

Page 11: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Monitoring & Data Mining - Scale of the Problem

Global Internet Audience (in ,000's)

416 281

282,651

( )

Increased Email Communications416,281

48 783 185 109

74,906

10001200

1400

1600 1480

Text Messages Sent

ons)60000

80000

100000 90000

72900

Total number of emails (in billions)Spam emails (in billions)

48,783 185,109

Asia Pacific

Middle East & Africa

North America

Latin Amer-ica

Europe200400

600800

1000772 SM S sent

( bi l l i ons

Mess

ages

sen

t (in

billio

2008 20090

20000

40000

210 147

72900 (in billions)

Active users: 1.3 mnD il T t 27 3

2008 20090

Year

60000000

70000000

Internet users in India Internet Users

Daily Tweets: 27.3 mnTweets per hour: 1.8mn

“80 tweets per second 10000000

20000000

30000000

40000000

50000000

60000000

© 2010 - Orkash Services Pvt. Ltd.11…ensuring Assurance in complexity and uncertainty

were posted during 26/11Mumbai attacks” 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009

0

10000000

Page 12: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Real Time Data Mining

• Requirement for massive high-performance parallel-processing clusters/cloud Requirement for massive high performance parallel processing clusters/cloud computing - consisting of hundreds of servers

• GPU processors - 1000 cores and above• Massive data streams very specialized database architecture

[email protected]

[email protected]

• Massive data streams - very specialized database architecture

Internet accountAccount: Hazi AlPassword:

[email protected]

IP address

Banking and Card Usage

Multiple Sims

Password: Hazi_123

IP addressMAC addressMetadataEmail tracing

Multiple HandsetsTelephone

Historical data analysis would train expert engines to

© 2010 - Orkash Services Pvt. Ltd.12…ensuring Assurance in complexity and uncertainty

Historical data analysis would train expert engines to raise triggers and red flags. Real time data mining will generate trends and patterns in the usage

Page 13: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

A Key Intelligence Challenge

• A key challenge for intelligence purposes is A key challenge for intelligence purposes is

Deciphering the Intent of a target individual • Behavior profiling

• Internet footprint - creation of filters over the internet traffic so that the keywords can be picked up and can be used as triggers

• Search patterns and kind of websites visited

• RSS feeds subscribed, social networking sites, search engine, alerts set discussion board and blog postingsalerts set, discussion board and blog postings

• Travel and movement of a person

• Email and communication patterns and linkagesEmail and communication patterns and linkages

© 2010 - Orkash Services Pvt. Ltd.13…ensuring Assurance in complexity and uncertainty

Page 14: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

The Challenges

• The present platforms either do not tackle the p pissue of semantics fully or take into account a simple semantic structure. The burden of 'meaning construction' is left entirely to the useruser

• Interpreting the geographical or the spatial context of text and technical metadatacontext of text and technical metadata

• Conversion of unstructured to structured data

• Integration with geographic data.

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 14

Page 15: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

What is required??

A t tedAutomated collection of

real time events

Contextualize info & events

Organize and identify

relationships

Identify abstract qualifiers - e.g

opinions'Output' is the input for the ne t le el of info opinionsthe next level of info

gathering

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 15

About 80% of all data and human decisions has a geographical component making the geo-spatial context more relevant in intelligence creation

Page 16: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Semantic Enrichment

Semantic enrichment is the process of creating or associating semantic tags in Semantic enrichment is the process of creating or associating semantic tags in unstructured data or text, usually involving concepts, entities, relationships, events and properties described in an ontology or rule basedBenefits :Benefits :

Adding semantic metadata tags to the original unstructured data enables advanced correlation and data fusion capabilities.

Commonsense Engine

Concepts Engine

Enables concept , event and relationship extraction and automated metadata tagging.

D1D2D3 Document

Cl ifi ti

Classification Results

Engine Engine

Semantically E i h d

(XML)

.

.Dn

Classification Results

Concepts Tagged Documents

+Entities

Enriched Documents

Documents

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 16

(XML) Entities

Semantic Enrichment

Documents

Page 17: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Semantic graph to perform semantic search

Query : Where does the woman who lives at 75-C, Sector 18 work??Q y ,

dog is a

mammalis a

d b iis a

Palipet Krishna

personis a

owned by is a

is a

house Ruhi

lives in owned byis a

k t

married to

75-C, Sector-18 Orkash

located at

works at

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 17

Page 18: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Expert Engine based Contextualization

Copyright Orkash Services 2009

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 18

Page 19: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

AI Expert Engine Analyzer

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 19

Copyright Orkash Services 2009

Page 20: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Query: What is the key reason behind the frequency of attacks and the transition of terrorism India?

Semantic query generation through NLP

Contextualize throughNLPNLPA

+B+C

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 20

Query Result: Operational sophistication and adoption of modern technologies has led to the transition of terrorism in India

Page 21: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Visualizing Semantic Data in GIS Context & Scan• Visualization of the meta-data and the

t t C ti f 'Vi l' lcontent – Creation of 'Visual' layers

• Role of contextual scan -semanticand geospatial

• User created content and collective intelligence

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 21

Page 22: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Target Centric Intelligence Model

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 22

Page 23: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Role of Cyber Forensics in Preemptive Intelligence• E-mail system of the Indian Prime Minister's Office (PMO) remained affected by a computer virus for 3

th d i 2008 S i f ti ff t d d 600 t f th Mi i t f E t l Aff i months during 2008. Spyware infection affected around 600 computers of the Ministry of External Affairs in February in 2009.

• There have been several serious attacks by international hacking communities on Indian government IT networks Many Indian ministries embassies the NIC and other organizations have faced disruptions networks. Many Indian ministries, embassies, the NIC and other organizations have faced disruptions arising from such threats

• In 2006, investigations into the Mumbai train bombings highlighted that tech professionals in a leading technology firm in Bangalore were operating sleeper cells, using advanced techniques to mask identities of IP addresses and resorted to steganography for disguised communications and fund transfers

Cyber Forensics

Pre-event Post-event1. Driven by intelligence gathering2. Is predictive in nature3 Data Mining can produce trends

1. Driven by Evidence Chain Management System 2 Is investigative in nature

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty 23

3. Data Mining can produce trends and patterns4. Track back analysis

2. Is investigative in nature3. Evidence that can be produced in the court

Page 24: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Packet Level Forensics

DimPortDimBackTrak (S DimPort

(VictimPortKey)

DimPort(I t d P

Audit (UpdateAu

dit)

ck (System Detail

i.e.IP,MAC,DNS,Node)

(IntruderPortKey)

Audit (Internal

DimTime (Time series

Analysis)

Audit (Internal Package and

table execution auditing)

TRACK-BACK FORENSICS - Internet

Packets

DimEvidenceSu

DimIssues (Threats)

mmary (EvidenceLog e.g

Protocol Info, Total time,

Packet)DimResponseLevel (Analytics

DimSecurityLevel (4

categories

© 2010 - Orkash Services Pvt. Ltd.…ensuring Assurance in complexity and uncertainty

(Analytics of severity Calculation

categories of

Security)

Page 25: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Illustration : Track back analysis

© 2010 - Orkash Services Pvt. Ltd.25…ensuring Assurance in complexity and uncertainty

Page 26: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Internet-based pharmacy

Internet Pharmacy

I N V E S T I G A T I O N C H A N N E L S

Online selling Payment gateway

Physical Supply

• Emails headers and firewall logsare critical in investigations through

k t l l f i d t k

gateway Supply

• Results from data mining canhelp in tracking paymenttransactions

• Every physical packet of goodsleaves a footprint during itstransportationpacket level forensics and track

backs

• Employ data mining tools likeclustering, classification andassociation

transactions

• Derive the correlation betweenpayment gateways and assessthe business architecture of thesales channel

transportation

• It is possible to track theinternational movement of goodsand narrow-down on the source.Especially in case of medicines as

• Mining of data in a relationship database

• Narrow down on suspicious IP addresses through pattern detection

sales channel Especially in case of medicines asthese are classified as ‘poisons’

• Track the network of courierservices and/or other individualsinvolved in the transit

© 2010 - Orkash Services Pvt. Ltd.26…ensuring Assurance in complexity and uncertainty

Page 27: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Deviators – the critical juncture

© 2010 - Orkash Services Pvt. Ltd.27…ensuring Assurance in complexity and uncertainty

Page 28: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Combining Data Mining and Link Analysis

© 2010 - Orkash Services Pvt. Ltd.28…ensuring Assurance in complexity and uncertainty

Page 29: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Network Analysis and Data visualization Structured analysis approach4Graphic based approachManual approach

(1) Manually creation of association matrix by identifying the relations through raw data

(1) Graphically representation of terrorists networks using tools such as Analyst Notebook (i2)

(1)Advanced analytical capabilities to assist investigation

(2) It can help in identifyingthrough raw data

(2) Helpful in crime investigation, becomes ineffective where datasets

Notebook (i2)

(2) Helpful in visualization of large amount of relationship data but without analytical

(2) It can help in identifying networks to mining of large volumes of data to discover useful knowledge and create intelligence about the structure and organization

are very large functionality. of criminal networks- Data Mining- Social Network Analysis- Pattern Tracing and interactions

© 2010 - Orkash Services Pvt. Ltd.29…ensuring Assurance in complexity and uncertainty

Page 30: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

Convergence

Convergence of Humint and TechintConvergence of Humint and TechintGovernment Agencies -Databases

PrivateEnterprises -Databases

Processing Raw Data i tDatabases Databases

Tech Intelligence

into Usable form

Human Intelligence

DataminingBetter Information

Internet based information collection

and investigation(Track Back)

Collection( )

IdentifyingTriggers & Patterns

Automated analysis ofstructured & unstructured

d t

Enhanced

Patternsdata

© 2010 - Orkash Services Pvt. Ltd.30…ensuring Assurance in complexity and uncertainty

Prevent-Prepare-Respond-RecoverStructure

Page 31: Automating Intelligence Creation & Analysis& Analysis Sonal.pdf · 2018-09-02 · mobile phone, Emails and couriers Use of semantics to decipher the Analysis in geospatial context

THANK YOUTHANK YOU

Orkash Services Pvt Ltd75 C, Sector 18,

Gurgaon 122 015Haryana, India

Phone: +91 124 4033773

[email protected]

www.orkash.com

+91 98102 36020

© 2010 - Orkash Services Pvt. Ltd.31…ensuring Assurance in complexity and uncertainty