ai systems in iot-scale deployments - gtc on-demand...

30
Leading EDGE COMPUTING 1 AI Systems in IoT-Scale Deployments Simon Collins Senior Product Manager GTC Europe Thursday, 11 October 2018

Upload: others

Post on 19-Jun-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 11

AI Systems in IoT-Scale Deployments

Simon Collins

Senior Product Manager

GTC Europe

Thursday, 11 October 2018

Page 2: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 22

Enabled by convergence of key technologies

Massive global growth driven by data

Artificial Intelligence to create $3.5 trillion - $5.8 trillion of

value globally

How do we manage data across this vast ecosystem?

Internet of Things to have24 billion connected devices,

163 zetabytes of data by 2025

Edge Computing set to reach $3.24 billon by 2025

Page 3: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 33

As IoT grows, the “Things” become more widely distributed

Problem space

AI Systems in IoT-Scale Deployments

☺ Generates more value

AI at the Edge allows “intelligent” machines to make sense of the data and transmit only valuable information

Increases total system complexity Increases management costs Increases data connectivity costs

We need to be able to easily and securely manage the distribution of intelligence, while enabling continuous upgrades in capability

Generates more data

Increasingly capable hardware allow us to deploy more sophisticated tools at the Edge

Lowers latency Keeps data on-premise = privacy Increases reliability

Page 4: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 44

Easy to Integrate - Data-centric architecture that hides topology details, enabling true plug-and-play

Scalable - Scaling across thousands or even millions of participants

Open – Mature, proven and open standard with future-proof APIs plus wire-protocol

Secure – End-to-end secure data connectivity with authentication, encryption & access control capabilities

Fault-tolerant – Peer-to-peer communication without message brokers or servers

Widely applicable - Available for embedded, mobile, web, enterprise and cloud applications

QoS-enabled – Full control over data distribution: timeliness, prioritization, reliability and resource usage

High-performing - Latencies as low as 30 µsec, throughput of millions of messages/sec

Focus on the data, not on the connectivity

Solution – Data-centric architecture

Raw

data

AI

Process

Publish

Decision

Process

Subscribe Publish

CommandCommand

Process

Subscribe

=

Decouple

=

Decouple

Page 5: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 55

Allows a microservice architecture, each component focussed on its task

Move towards Software Defined Machines

Rapid scalability and re-configuration

Solution – Data-centric architecture

Raw

data

AI

Process

Publish

Decision

Process

Subscribe Publish

CommandCommand

Process

Subscribe

ReportReporting

Process

Subscribe

Augment

Process

Subscribe Publish

CommandCommand

Process

SubscribeAdd functionality

Page 6: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 66

AI developers focus on their core task, turning the input data into valuable information

Decouple from other aspects of the system; let others deal with:

• Cloud connectivity – northbound

• Peer-peer communications – east-west

• Operations & control – southbound

• Modbus, PROFINET, CANbus, …

• Enterprise connectivity

AI

Cloud

Enterprise

Operations

Peer-peer

Decouple to achieve focus

IoT needs a lot of other connectivity

Page 7: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 77

Addresses the needs of large-scale mission and business-critical applications, scaling to ultra-large-scale deployments

Open standard for efficient, secure, interoperable, platform- and programming-language independent data sharing between networked devices

Data Distribution Service

Underlying technology

Applications can autonomously and

asynchronously read and write data

enjoying spatial and temporal decoupling

Built-in dynamic discovery isolates

applications from network topology and

connectivity details

Page 8: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 88

Widely deployed by early adopter market

Proven in Defense & Aerospace

Page 9: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 99

Broad Commercial Applications

Page 10: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1010

Incorporated into further standards

Generic Vehicle Architecture

Page 11: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1111

Livedata

Livedata

Livedata

Livedata

Standard deployment

AI model training & deployment use case

Training

dataset

Test dataset

Model_ID,Model

Livedata

Livedata

Livedata

Camera_ID,Class,

Confidence

EDGECLOUD / DATA CENTER

Model topic

Operational topic

Page 12: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1212

Many identical nodes collaborating

Distributed AI – homogeneous system

Training Model

ModelModelInference

result

ModelModelInference

result

ModelModelInference

result

ModelModelInference

result

ModelModelInference

result

Control center

EDGEDATA CENTER CONTROL CENTER

Operational topic

Page 13: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1313

Central command center analysing traffic flow across a city

Traffic flow example

Camera_ID: CAM101Class: Car | Bus | TruckConfidence: %Direction: N | E | S | W

Model_ID: flow_v10.1Model: blob

DATA CENTER CONTROL CENTER

Model topic

Operational topic

Page 14: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1414

Data produced by the app is the same; deploy new apps to consume locally

Mesh of intelligent co-operating nodes, “east-west” communication

Traffic flow example

Model_ID: flow_v10.1Model: blob

DATA CENTER CONTROL CENTER

Model topic

Camera_ID: CAM101Class: Car | Bus | TruckConfidence: %Direction: N | E | S | W

Operational topic

Page 15: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1515

The real world is messy!

AI in the real world

Class: PedestrianConfidence: 71%

Class: PedestrianConfidence: 98%

0%

30%

95%

100%

?

I’m confident of a positive match in the data

I’m confident of no match in the data

Confidence

I’m not really sure whether there is a match in the data

Note: high and low threshold points will be specific to each application, and a relevant risk assessment (probability of occurrence / severity of occurrence)

Page 16: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1616

For each frame, are we confident to act?

Confidence matters!

Time

Livedata

Livedata

Livedata

Livedata

Livedata

Livedata

Livedata

Publish class& confidence to

enable business logic

?Retrieve

problematic data

Publish data,

class & confidence to

learning cycle

HIGH THRESHOLD

LOW THRESHOLD

Publish class& confidence to

enable business logic

Publish class& confidence to

enable business logic

Publish class& confidence to

enable business logic

Publish class& confidence to

enable business logic

Publish class& confidence to

enable business logic

Co

nfi

de

nce

Page 17: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1717

Livedata

Livedata

Livedata

Livedata

Example: supervised learning of classification

Iterative AI model training use case

Training

dataset

Test dataset

Model_ID,Model

Model_IDClass,

Confidence

Corner-casedata

Newly tagged training dataset

Problematicdataset

Livedata

Livedata

Livedata

Camera_ID,Class,

Confidence

EDGECLOUD / DATA CENTER

Model topic

Operational topic

Bad data topic

Page 18: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1818

Multi-layered network implementation

Operations & training systems in parallel

CLOUD / DATA CENTERSubscribes to bad data topic(s)

CONTROL CENTERSubscribes to operational topics

EDGE

Page 19: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 1919

Processing cycle time longer than frame period

Distributed AI – load balancing

Process data

Publish result

Data available

Frame 1

1 2 3 4

Frame 4

5 6

Frame 7

7 8 9 10

Frame 10

Lostdata

Lostdata

Lostdata

Lostdata

Lostdata

Lostdata

Node A:

Data is dropped & lost if processing node is not available

Page 20: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2020

Livedata

Livedata

Livedata

Livedata

Example: Introduce “worker pattern” to share load

Distributed AI – load-balancing

Livedata

Livedata

Livedata

Operationaltopic

Operationaltopic

Operationaltopic

Livedata

Livedata

Livedata

Raw data is published to be consumed by other “worker” nodes

Node ANode BNode C

Page 21: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2121

Data processing is interleaved across worker nodes

Distributed AI – load balancing

Data available 1 2 3 4 5 6 7 8 9 10

Process data

Publish result

Frame 1 Frame 4 Frame 7 Frame 10

Node A:

Process data

Publish result

Frame 2 Frame 5 Frame 8

Node B:

Process data

Publish result

Frame 3 Frame 6 Frame 9

Node C:

1 4 7

2 5

3 6 9

8

Page 22: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2222

Enormous growth fuelled by convergence of technologies

A data-centric solution allows easy and robust scaling to IoT volumes

Developers focus on how to best exploit the data, rather than how to connect everything together

Leads to Software Defined Machines, which can be flexibly reconfigured as requirements change over time, supporting an iterative system evolution

Summary

AI Systems in IoT-Scale Deployments

Page 23: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2323

Innovative E2E Solutions powered by Deep-Learning Technology

AI at ADLINK

Digital Surveillance & Security

Intelligent Network Video Recorder

Robotics & AGV

Smart Manufacturing

Smart Camera for AOI

Page 24: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2424

Page 25: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2525

Additional Use Cases

Page 26: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2626

European Air Traffic Control

“Vortex is at the heart of the EU next-generation Air Traffic Control System” – CoFlight

DDS is the standard recommended by Eurocontrol/Eurocare

6GB of data in movement~400 Mbps average

ThroughputPan-European Deployment

Page 27: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2727

World leader in automated agricultural systems have implemented DDS architecture for real-time control and monitoring of their milking robots

Page 28: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

COMBAT MANAGEMENT

Vortex guarantees mission critical reliability and real-time behaviour in large-scale naval combat management systems

Co

nfid

en

tial in

form

atio

n su

bje

ct to n

on

disclo

sure

Page 29: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

Leading EDGE COMPUTING 2929

SMART GREEN HOUSESVortex is used to virtualize sensor data and to distribute actions and

insights

Co

nfid

en

tial in

form

atio

n su

bje

ct to n

on

disclo

sure

C

op

yrig

ht ©

Prism

Te

ch 2

01

7

Page 30: AI Systems in IoT-Scale Deployments - GTC On-Demand ...on-demand.gputechconf.com/gtc-eu/2018/pdf/e8307-ai... · Widely applicable - Available for embedded, mobile, web, enterprise

LAUNCH SYSTEM

80K+ data points with aggregate

updates rate of ~400K msgs/sec

Confidential information subject to non disclosure

Co

py

righ

t ©P

rismT

ech

20

17