where do we go from here? lessons and landmarks from … · gateway with kafka spark streaming...
TRANSCRIPT
Where do we go from here?Lessons and Landmarks from Real-World Cisco UCS Big Data Deployments
Han Yang, Senior Product Manager, Cisco
Raghunath Nambiar, CTO, Cisco
BRKINI-2021
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Cisco Spark
Questions? Use Cisco Spark to communicate with the speaker after the session
1. Find this session in the Cisco Live Mobile App
2. Click “Join the Discussion”
3. Install Spark or go directly to the space
4. Enter messages/questions in the space
How
cs.co/ciscolivebot#BRKINI-2021
“Extracting value from information is not primarily a matter of how much data you have . . ., it’s how much you use them to create new or better approaches to doing business.”
Less than half of structured data is actively used in decision making
Less than 1% of unstructured data is analyzed or used at all
May, 2017
How are you managing
your data as part of
your business strategy?
https://hbr.org/2017/05/whats-your-data-strategy
• What is your data
acquisition strategy
from new product and
services?
• Are you consolidating
your existing data sets
into a data hub?
• Do you have AI center
of excellence to pool
talent?
What is Your Corporate Strategy in the World of AI?
Andrew Ng:
• Stanford Professor
• Led Google Brain
• Drove AI at Baidu
EmTech, Dec, 2017: https://www.youtube.com/watch?v=NKpuX_yzdYs
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
15,000 206
31 8,000
connected athletes countries
buildings rooms
9 milliontickets sold & validated
85,000soldiers &
policemen
70,000volunteers
Olympic Village with
fro
m
25,000media personnel
42sports
in
Every person, venue,
device was
connected.
It was the most
connected event in
the history
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
The first truly connected Olympics
5 billion
1 trillion visitors to official Olympic web sites
TV spectators from more than 200
countries. That’s about 66% of world population…
hours of online video contentthat’s about equal to one person watching 20 years of video without a break
hours of video watched via mobile
170,000
130 million
54 million data feeds transported and processed
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Education: Connected
classrooms over a stable,
secure, reliable network.
Crisis Management:
Protect your business by
seeing what’s happening in
real time.
Health Care Management:
Electronic Medical Records
to spot trends in chronic
diseases.
the possibilities are endless. …
Be more effective. Make better decisions. Protect and secure.
Operations: Massive scale of technology and products to
plan, design, optimize your data center.
BRKINI-2021 10
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Lessons being learned from smart cities
finance, construction, transportations, media, utilities, health
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Cities
1990: 10
2015: 29
2030: 41
1990: 270
2015: 488
2030: 662
Mega-Cities >1M Pop.
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Today, cities consume 75% of worlds resources and
generates 80% of greenhouse gases
2015
3.9 Billion
2030
5.1 Billion
Urban Population Growth
4.3 Billion
2020
There are several areas of improvement
Make cites cleaner, safer, efficient and sustainableSmart City
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Smart Cities: Dream vs. Reality
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Areas of
Improvement
Energy
Transportation
Environmental Monitoring
Safety
Emergency Preparedness
Make the City Smart
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Are You the Cause of Analytics Fumble?
Traditional IT Project Big Data Analytics Project
Typical Project
• Install ERP System
• Optimize supply chain performance
Typical Project
• Understand customers’ needs and
behavior
• Predict future growth markets
Overarching Goals
• Improve efficiency
• Lower Cost
Overarching Goals
• Challenge assumptions & bias for
decision
• New insight into customer
Project Structure
• Develop detailed plans to deploy,
manage, and train
Project Structure
• Discovery driven: Hypothesis -> Get
Data -> Experiment
BRKINI-2021 17
Types of Analytics
1. Business Analytics
2. Big Data Analytics
3. IT Operation Analytics
4. IoT Analytics
5. Machine Learning
BRKINI-2021 19© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Areas of FocusCisco Validated Designs, Case Studies, Performance Benchmarks
Business
Analytics
Big Data
Analytics
IT Operations
Analytics
IoT
Analytics
AI, ML/DL
Analytics
VSOM
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
IOT Data
Sources
Web Data Sources
Streaming Data Sources
Gateway with Kafka Spark Streaming
Compute Nodes
HDFS
No SQL
Hbase,
Cassandra
….
Apps
Analytics DB
Se
rvin
g L
aye
r
Speed Layer Batch Layer
Lambda Architecture on UCS
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Cisco UCS at all layers, fully validated architectures
with all major players
Kafka
Cisco UCS C240
Real-Time Data Store
UCS C220/C240Data Inject (CoAP/MQTT.XMPP Data Processing
Batch
Real-Time
D
a
t
a
A
n
a
l
y
t
i
c
s
F
o
g
F
o
g
F
o
g
F
o
g
F
o
g
F
o
g
Big Data Store
UCS C240/C3160Serving Layer
Speed Layer Batch Layer
DATA
Aggregator
Cisco UCS C240C800/UCS Mini/UCS C240
ISR 8x9 with 4G LTE and Dual
802.11n a/g/n (WiFi) Radios
Managed by Cisco FogDirector
Cisco UCS Infrastructure for Big Data & Analytics
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
IT is accountable for:
Becoming proactive and predictive
Meeting/exceeding SLAs
Reducing expenses
Providing end-to-end operational visibility
MORE
DEVICES
MORE
DATA
MORE
APPSDelivered
Continuously
Multi-vendor, siloed IT
infrastructures create management
complexity
MORE
MOBILITY
Network Compute OS ApplicationsStorage
Need for ITOA
IT silos create obstacles
to consistent application,
infrastructure performance
Lack of visibility puts SLAs in jeopardy
BRKINI-2021 24
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Required: Real-time Insights, Comprehensive VisibilityIT operations analytics demands innovation in analytics software and infrastructure
Analytics Software Analytics Infrastructure
• Single software platform integrates across
infrastructure silos, enabling visibility to data anywhere
• Flexibility to identify, analyze new data sources
• Fast time to value
• Comprehensive IT management functionality
to improve IT productivity
+Monitoring Incident
Mgmt
Problem
Mgmt
Capacity
Mgmt
+ +Low TCOConsistent, split-
second
response times
Highly
scalable
+ +
IT needs an analytics platform that can meet their needs today and tomorrow
BRKINI-2021 25
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
IT Operations Analytics on Cisco UCS with SplunkSplunk provides powerful analytics software platform for IT operations
End-to-end operations visibility
Enhances productivity
Fast time-to-value
Full continuum of IT management
Smartphones and Devices
Online Services
PublicCloud
Security
Telecoms
NetworksWeb
Services
Web Clickstreams
RFID
Call Detail Records
On-Premises
GPS Location
Databases
Packaged Applications
Energy Meters
Storage
PrivateCloud
Servers
Messaging
Desktops
OnlineShopping Cart
MonitoringProblem
Management
Capacity
Management
Incident
Management
Index Untapped Data:
Any Source, Type, Volume
+ + +
BRKINI-2021 26
• Internet Of Things Market To Reach $267B
• 50% of IoT spending will be driven by manufacturing,
transportation, logistics, and transportation
• IoT Analytics is predicted to generate $21.4B
https://www.forbes.com/sites/louiscolumbus/2017/01/29/internet-of-things-market-to-reach-267b-by-2020/
By 2020
Yet in the IoT world, smooth sailing is rare
26%
of all surveyed companies
are successful with their
IoT Initiatives
Only…
BRKINI-2021 31© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Remote Data
• IoT data can be very
remote
• Costly to send all data
back to data center
• Does all data need to be
sent back to data
center?
BRKINI-2021 32© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Discovery
• Data discovery is an
essential part of data
scientist job
• How do you know what
type of analytics and
models will result?
BRKINI-2021 33© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Investment
• IoT project requires
investment in sensors,
network, analytics . . .
• How to make the
investment when
discovery can lead to
different type of data
collection and analytics
requirements?
BRKINI-2021 34© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Challenges and Opportunities for IoT Data Lifecycle
Remote Data
Raise
Abstraction
Discovery
Edge
Analytics
Investment
Multiple
Options
BRKINI-2021 35© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Data Mgmt. & Pattern Detection
using ESP
Data: power
frequency,
voltage, current,
phasor angle, …
Cisco UCS
Pattern Detection
using ESP
Visualization of
streaming data using
SAS ESP
Streamviewer
Application Life
Cycle Management
(Cisco FogDirector)
Update
Streaming
models
Connect to
external apps
with ESP
connectorsCisco UCS
Cisco UCS
Pattern discovery using SAS
Visual Analytics and SAS Visual
Statistics
Cisco
ISA
KAFKA
Edge Enterprise
Visualization of streaming data using
SAS ESP Streamviewer
Edge to Enterprise IoT Analytics
BRKINI-2021 36© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Data Mgmt. & Pattern Detection
using ESP
Data: power
frequency,
voltage, current,
phasor angle, …
Cisco UCS
Pattern Detection
using ESP
Visualization of
streaming data using
SAS ESP
Streamviewer
Application Life
Cycle Management
(Cisco FogDirector)
Update
Streaming
models
Connect to
external apps
with ESP
connectorsCisco UCS
Cisco UCS
Pattern discovery using SAS
Visual Analytics and SAS Visual
Statistics
Cisco
ISA
KAFKA
Substation Operations Center
Visualization of streaming data using
SAS ESP Streamviewer
Edge to Enterprise IoT Analytics: Electric Utility
BRKINI-2021 37© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Local and Global Analytics
Global AnalyticsLocal Analytics
Frequency
BRKINI-2021 38© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
IR 8x9
• Ruggedized Integrated Services Router
• Runs SAS analytics as data is being gathered
• Securely send relevant summary back to data center
Integrated Infrastructure for Big Data
• Industry leading Cisco UCS
• Tested and validated configuration
• High performance & availability
Cisco Infrastructure
IR 829IR 809
BRKINI-2021 39© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Data Mgmt. & Pattern Detection
using ESP
Data: power frequency,
voltage, current, phasor
angle, …
Cisco UCS
Pattern Detection
using ESP
Visualization of
streaming data
using SAS ESP
Streamviewer
Application Life Cycle Management (Cisco FogDirector)
Update Streaming
models
Connect to external apps
with ESP connectorsCisco UCS
Cisco UCS
Pattern discovery using SAS
Visual Analytics and SAS Visual
Statistics
Cisco ISA
KAFKA
Visualization of streaming data using SAS ESP Streamviewer
Remote Data and Raised Abstraction
Substation Operations Center
BRKINI-2021 40© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Data Mgmt. & Pattern Detection
using ESP
Data: power frequency,
voltage, current, phasor
angle, …
Cisco UCS
Pattern Detection
using ESP
Visualization of
streaming data
using SAS ESP
Streamviewer
Application Life Cycle Management (Cisco FogDirector)
Update Streaming
models
Connect to external apps
with ESP connectorsCisco UCS
Cisco UCS
Pattern discovery using SAS
Visual Analytics and SAS Visual
Statistics
Cisco ISA
KAFKA
Visualization of streaming data using SAS ESP Streamviewer
Remote Data and Raised Abstraction
Part of Data Pipeline
Reduce Bandwidth
Raise Abstraction
Substation Operations Center
BRKINI-2021 41© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Data Mgmt. & Pattern Detection
using ESP
Data: power frequency,
voltage, current, phasor
angle, …
Cisco UCS
Pattern Detection
using ESP
Visualization of
streaming data
using SAS ESP
Streamviewer
Application Life Cycle Management (Cisco FogDirector)
Connect to external apps
with ESP connectorsCisco UCS
Cisco UCS
Pattern discovery using SAS
Visual Analytics and SAS Visual
Statistics
Cisco ISA
KAFKA
Substation Operations Center
Visualization of streaming data using SAS ESP Streamviewer
Discovery and Edge Analytics
Update Streaming
models
BRKINI-2021 42© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Data Mgmt. & Pattern Detection
using ESP
Data: power frequency,
voltage, current, phasor
angle, …
Cisco UCS
Pattern Detection
using ESP
Visualization of
streaming data
using SAS ESP
Streamviewer
Application Life Cycle Management (Cisco FogDirector)
Connect to external apps
with ESP connectorsCisco UCS
Cisco UCS
Pattern discovery using SAS
Visual Analytics and SAS Visual
Statistics
Cisco ISA
KAFKA
Substation Operations Center
Visualization of streaming data using SAS ESP Streamviewer
Discovery and Edge Analytics
Update Streaming
models
Discovery -> Updated Edge Analytics Model
LoRa ™ Network Features
Low Cost Minimal infrastructure
Low cost end-node
Open SW
Long Range Greater than cellular
Deep indoor coverage
Star topology
Max Lifetime Low power optimized
10-20yr lifetime
>10x vs cellular M2M
Multi-Usage High capacity
Multi-tenant
Public network
Semtech Proprietary and Confidential 12
Field Area Network (Wi-SUN)
AMI smart meteringDistribution automationStreet lightingO&G wellhead monitoringWater/wastewater
Low Power Long Range Wireless (LPWA – LoRA)
SP IoT InfrastructuresBattery powered sensorsEnvironmental monitoringSmart Cities, parking, and AgricultureSP cell tower monitoring
Remote Asset Monitoring
Pipeline monitoringRoadside infrastructureDistribution automationATMsDigital Signage
Fleet VehiclesMass Transit
Automated Vehicle Location tracking, Data Uploaded in Seconds with 4G/LTEHandles Multiple Wireless Laptops, Smartphones, Tablets Simultaneously
Premium Mobile Broadband (PMB)
Public safety and security CPE
CGR1000
SDKIR500
IR910
IR8x9 +
LoRA
Modem
(future)
819H IR829
IR809
IOT Field Network Director/Industrial Operations Kit – Zero Touch Provisioning, Firmware upgrade,…
IOT Software Platform – Fog Computing, BYOI,…
819H
IR829
IR809
819H
Investment with Multiple Options
CGR
M
M
M
M
M
M6
M
M
M
PLC
RF
RF
RF
M
M
M M
M
M
M
M
M M
M
M M M M
M7 M3
M2M8
M5
M4
RF
RFMM
M
MM
M
M
M
M
M
M
M
PLC
PLC
RF
RF
M
M
M
M
M
M
M
M
M
M1
BRKINI-2021 45© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
• Expands computational capability of Connected Grid Router
• Specifications:
• 4-core AMD GX410 x86 800 MHz
• 4 GB DDR Memory
• Storage:
• 64GB (50GB available for apps)
• 128GB (100GB available for apps)
• Gigabit Ethernet and two USB interfaces
• -40C to +65C\
CGR 1000 Compute Module
CGR 1000
CGR Compute Module
BRKINI-2021 46© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Cisco UCS Integrated Infrastructure for Big Data and AnalyticsFifth Generation of Reference Architectures and Bundles
UCS B200 M5 UCS-SP-C220M5-A2 UCS-SP-C240M5-A2 UCS-SPC240M5L-S1 UCSS-SP-S3260-BV
• Network: 2x 6332• Servers: 8 X UCS B200 M5 Blades• Server Type: B200 M5 Blade servers• CPU: 2x Intel Xeon Processor Scalable
Family 6132 (2x14 cores , 2.6 GHz)• Memory: 192GB 2666 MHz• Boot: M.2 with 2x480GB SSDs• Storage: 2 x Cisco 2.5-inch 7.7-TB
HGST SN200 NVMe• VIC: VIC 1340• Storage Controller: Cisco FlexStorage
PCIe SSD pass-through module with HDD cage
• Network: 2x 6332• Servers: 8 X UCS-SP-C220M5-A2• Server Type: C220 M5 SFF• CPU: 2x Intel Xeon Processor Scalable
Family 6132 (2x14 cores , 2.6 GHz)• Memory: 192 GB 2666 MHz• Boot: M.2 with 2x480GB SSDs• Storage: 10 x 1.6TB EV SATA SSDs SFF• VIC: VIC 1387• Storage Controller: Cisco 12 Gbps SAS
Modular RAID Controller with 2GB FBWC or Cisco 12 Gbps Modular SAS HBA
• Network: 2x 6332• Servers: 16 X UCS-SP-C240M5-A2• Server Type: C240 M5 SFF• CPU: 2x Intel Xeon Processor Scalable
Family 6132 (2x14 cores , 2.6 GHz)• Memory: 192 GB 2666 MHz• Boot: M.2 with 2x480GB SSDs• Storage: 26 x 1.8TB 10K rpm SAS HDD
SFF or 12 x 1.6TB EV SATA SSDs SFF• VIC: VIC 1387• Storage Controller: Cisco 12 Gbps SAS
Modular RAID Controller with 4GB FBWC or Cisco 12 Gbps Modular SAS HBA
• Network: 2x 6332• Servers: 16 X UCS-SPCC240M5L-S1• Server Type: C240 M5 LFF• CPU: 2x Intel Xeon Processor Scalable
Family 4110 (2x8 cores , 2.1 GHz)• Memory: 192 GB 2666 MHz• Boot: M.2 with 2x480GB SSDs• Storage: 12 x 8TB 7.2K rpm SAS HDD
LFF plus 2 x 1.6TB EV SATA SSDs SFF• VIC: VIC 1387• Storage Controller: Cisco 12 Gbps SAS
Modular RAID Controller with 2GB FBWC or Cisco 12 Gbps Modular SAS HBA
• Network: 2x 6332• Servers: 8 X Cisco UCS S3260 Storage server • Server Type: S3260 (2 x servers)Each server node with:o CPU: 2 Intel Xeon processor E5-2680 v4
CPUs (2 x 14 cores, 2.4 GHz) o Memory: 256GB 2400 MHzo Boot: 2 x 480GB SSDso Storage: 24 x 6TB 7.2K rpm SAS HDD LFFo VIC: VIC 1387o Storage Controller: Cisco 12 Gbps SAS
Modular RAID Controller with 4GB FBWC
BRKINI-2021 47© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
IoT Analytics
• IoT has many challenges
& opportunities
• Remote Data: Raise
Abstraction
• Discovery: Edge Analytics
• Investment: Multiple
Options
BRKINI-2021 49© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
What is Artificial Intelligence, Machine Learning, Deep Learning?
BRKINI-2021 50© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Artificial Intelligence, Machine Learning, and Deep Learning
F(x)
Deep
Learning
Artificial
Intelligence
Machine
Learning
Artificial IntelligenceTechnique where computer can
mimic human behavior
Machine LearningSubset of AI techniques which use
algorithms to enable machines to learn from
data
Deep LearningSubset of ML techniques which uses multi-
layer neural network to learn
BRKINI-2021 51© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Examples of Artificial Intelligence, Machine Learning, and Deep Learning
F(x)
Deep
Learning
Artificial
Intelligence
Machine
Learning
Artificial IntelligencePlay Chess: Find all moves and
prune off bad moves
Machine LearningBased on size and price of neighboring
houses, predict price of new house on
market
Deep LearningGiven lots of images of cats and dogs, is
there a cat or dog in a new photo?
BRKINI-2021 52© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
5 Levels of Artificial Intelligence
Level Status
Generality Solved
Learning without being Taught
Solved
Transfer Learning Not Yet
Common Sense Not Yet
Self Awareness Still Mystery
https://www.huffingtonpost.com/entry/human-level-ai-how-far-are-
we_us_59ecc013e4b092f9f241931e
BRKINI-2021 53© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
ComputerData
ProgramOutput
ComputerData
ProgramOutput
Traditional Programming
Supervised Machine Learning
BRKINI-2021 54© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Magic?
No, more like gardening
• Seeds = ML Algorithms
• Nutrients = Data
• Gardener = Data Scientist
• Plants = Programs
BRKINI-2021 55© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
ComputerTraining Data
ProgramOutput
Supervised Machine Learning
ComputerNew Data
New Output
Machine Learning Inferencing
Program
Training
BRKINI-2021 56© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
• Linear regression is the “go-to” analytic techniques
• Very easy to explain the result, especially in regulated industry like finance
• May not model nonlinear data very well
Machine Learning Algorithm: Linear Regression
Is there a relation between
these 2 variables?
https://hbr.org/2015/11/a-refresher-on-regression-analysis
BRKINI-2021 57© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Puppy or Muffin?
Deep Learning: Breakthrough in Image Recognition
P
BRKINI-2021 58© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Deep Learning = Neural Networks
Each Edge has
Unique Weight
Training Algorithm Computes the Weight of Each Edge
BRKINI-2021 59© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Deep Learning is Matrix Math Intensive
• Gradient Descent Error Function and its Partial Derivatives
Gradient Descent Error Equation
and Partial Derivatives
BRKINI-2021 60© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
How Are Customers Deploying ML?
BRKINI-2021 61© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Apache SparkML
• Apache Spark: Leading
open source analytic
software
• SparkML: Machine
learning packages
integrated with Apache
Spark
• Part of big data Hadoop
software stack
• Uses x86 CPUs
Machine
Learning
BRKINI-2021 62© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Deep Learning Frameworks
• Leading Frameworks
• TensorFlow, PyTorch,
Caffe, Theano
• Not part of Hadoop
distributions
• Run on CPUs or GPUsF(x)
Deep
Learning
BRKINI-2021 63© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Big Data is a Prerequisite for DL
“Data is to AI
what food is to
humans”Berry Smyth, Professor of
Computer Science at University
College Dublin
F(x)
Deep
Learning
BRKINI-2021 64© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
New Data Continues to Come In: Retrain
BRKINI-2021 65© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Enterprise Software with ML Features
• SAP Leonardo Machne
Learning
• Part of SAP Cloud Platform
(PaaS that can run on-
premise too)
• Create applications with ML
capabilities
• SAS Viya
• Companion to SAS
Foundation
• Includes ML algorithms to
complement statistic
techniques
• Accelerate with GPUs
Machine
Learning
F(x)
Deep
Learning
BRKINI-2021 66© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Infrastructure Impact on ML Performance
BRKINI-2021 67© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public22
WHY ARE GPUs GOOD FOR DEEP LEARNING?
GPUs deliver --- same or better prediction accuracy- faster results- smaller footprint- lower power- lower cost
Neural
NetworksGPUs
Inherently
Parallel
Matrix
Operations
FLOPS
Bandwidth
[Lee, Ranganath & Ng, 2007]
BRKINI-2021 68© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Many Machine Learning Algorithms
BRKINI-2021 69© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Many Machine Learning Algorithms
Each Algorithm has Many Parameters that Can Affect Performance:
Want to Try Combinations of Hyper Parameters
BRKINI-2021 70© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
• Bigger Training Data Set -> Better Training
• Bigger Test Data Set -> Better Testing
• Limited Amount of Data
• Cross Validate to use all the data for both training and testing
• Pick the best performing model
Cross Validation: More Training Data Sets
Total Data Set
Train Test
TrainTest
TrainTest
Train Test
BRKINI-2021 71© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Number of Models to Train: Performance Matters
Number of ML
Algorithms: X5
Hyper Parameters: X10
TrainTest
TrainTest
Train Test
Cross Validation: X10
New Data: Start Over
Empirical Process:
Lots of Experiments
Required
Not Accurate: Troubleshoot
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Cisco Unified Computing Systems Momentum
Source: 1 IDC, 2016 Q3, Nov 2016, Vendor Revenue Share
Source: As of Cisco Q2FY16 earnings results. Data Center Revenue is defined as Cisco UCS and Nexus 1000V
Integrated Infrastructure
(Cisco UCS, Nexus)
#1
Americas Revenue
Market Share in x86 Blades
#1
Global Revenue
Market Share in x86 Blades
#2
World Record Performance
Benchmark
130+
Total Storage Deployed in last
12 months
1 EXABYTE
Big Data Revenue
Growth in 4 Years
18x
Customers
60,000+
of Fortune 500 Have
Invested in UCS
>85%
HyperFlex Customers
2000+
BRKINI-2021 73
BRKINI-2021 74© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Architectural Challenges
Multiple
Management
Tools
Scale Up Converged Infrastructure
Hyperconverged
Infrastructure
Silos
Fabric
Sprawl
Complex Integration APIs APIs APIs APIs
Scale Out
X X X
Scale Out
Single Point of
Management
Scale Up Converged Infrastructure
Hyperconverged
No Silos
Unified
Fabric
Seamless
IntegrationOpen API
• Continual Optimization
• Automation
• End-to-End Security
Actual State Desired State
BRKINI-2021 75© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Why Cisco UCS for Big Data and Analytics
World Record Performance
#1 - TPCx-HS Benchmark
Easy to Scale
Unified Fabric for Network,
Storage, and Management Traffic
Simplified Management
Single Pane of Glass for
Thousands of Nodes
UCS Manager(Service Profiles)
Management
Ethernet
Storage
Proven Solutions
Validated Designs with Industry
Leaders
UCS 6200
Series Fabric
Interconnect
UCS Manager
16 Servers
Per Rack
• UCS Domain (160 Servers
(with FEX)
• 80 Servers direct connect)
• Manage by UCS Manager
• Up to 11.2 PB storage
• Multiple UCS Domains
• Interconnect using Nexus
7000/9000
• Scalable to 1000s of servers
• Centrally manage by UCS
Central
Scale from Small to Very Large Clusters
BRKINI-2021 77© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Management Simplicity
Simplified ScalabilityEasily Scale your infrastructure from few servers to thousands of servers with a fully Integrated Infrastructure
Centralized ManagementService Profiles for Servers• Manage all servers centrallyApplication Profiles for Network• Manage all network centrally
UCS Service Profile Cisco ACI Application Profile
Big Data: Management nightmare
Hundreds of Servers,
Thousands of management points
BRKINI-2021 78© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Cisco Intersight
Infrastructure-as-a-Service and Orchestration
Third Party Integrations: Infrastructure and Toolchains
Global Resource Pooling and Policy Management
Policy-Based Automation
Unified Element Management
UCS Capabilities Today
Transition to SaaS
SaaS Consumption Model
Free customers from care and feeding of management tools and eliminate upgrade dependencies
Seamless Extensibility
Simplify management across technologies and geography
Continuous Feature Integration
Rapid development, delivery and customer feedback
SaaS ModelCisco-Hosted CloudCustomer-Hosted CloudPartner-Hosted Cloud
Traditional Delivery ModelOn-prem software and HW-embedded tools
BRKINI-2021 79© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
SaaS Simplicity
ActionableIntelligence
SaaS Delivered
Intuitive Experience
Enhanced Support
Proactive Guidance
Secure and Extensible
Analytics PoweredCisco Intersight
Centralized Management
Global Policies
Comprehensive Automation
Single Pane of Glass
Cisco Intersight
HSph-Hadoop Sort per Hour
This provides a normalized value of how much data is generated, sorted, and validated in one hour for the scale factor
(divide by 30 for a 30TB run). This is the performance of the system under test. Higher HSph is better
Price/HSph: Price per Performance
This divides the total cost of the System under test (inclusive of hardware, software, license cost, and 3 year 24x7x4
support) along with discount and divide by Performance (above HSpH). Lower price/performance is better
*As of 30-May-2016. Visit www.tpc.org for latest results
TPC Express Benchmark HS: Industry’s first standard for benchmarking big data systems to provide the industry with
verifiable performance, price-performance and availability metrics of hardware and software systems dealing with Big Data
TPC Members
Why Cisco UCS for Big Data? – Proven Performance
BRKINI-2021 81© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
What is a CVD?http://www.cisco.com/go/bigdata_design
It is a comprehensive co-validated design guide documented to facilitate faster, more reliable and more predictable customer deployments.
The document includes the solution overview, architecture design, configuration of different components involved, along with sizing and scaling guide. It includes every step involved from elevation diagrams to configuring storage, network, compute, operating system and application and Bill of materials.
BRKINI-2021 82© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Mainstream Computing
Cisco UCS is a Revolutionary System Any Workload. Any Scale. One Platform.
HyperconvergedInfrastructure
Converged InfrastructureROBO
Fifth Generation UCS
HyperFlex Systems
UCS MiniE-Series
Scale Out
UCS S3000 Series
C-Series Rack Servers
UCS Integrated Infrastructure Solutions
Cisco UCS: It’s not a Server. It’s a System.
Unified Management, Single Control Plane, Single API
Edge Cloud
BRKINI-2021 83© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Enterprise Performance
Next Generation UCS M5 Portfolio Performance Optimized for Bare Metal, Virtualized, and Cloud Applications
Rac
kC240 M556 Cores – 20% more cores per
socket
100% more memory
80% more NVMe drives
Up to 2 GPUs
C220 M5
56 Cores – 20% more cores
per socket
100% more memory
80% more NVMe drives
B200 M5
100% more memory
55% more storage – 20.5TB
Up to 2 GPUs
Cloud Scale
S3260Ideal Capacity-Optimized
Platform for Large Object
Storage at Scale
(Purley Skylake refresh
targeting Q4’CY17)
Intensive/Mission Critical
C480 M5
Up to 6 GPUs
63% more drive bays
94% more NVMe support
B480 M5
Up to 4 GPUs
80% more storage – 39TB
20% better memory bandwidth
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Summary
• Analytics is critical to business differentiation
• Working with lines of business in a collaborative, exploratory fashion is the key to successful big data and analytics project
• Multiple types of analytics
• Business
• Big Data
• IT Operations Analytics
• Internet of Things (IoT)
• Machine Learning / Deep Learning
Cisco has Analytics Solutions Based on UCS with
Management, Scale, and Performance
BRKINI-2021 84
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Cisco Spark
Questions? Use Cisco Spark to communicate with the speaker after the session
1. Find this session in the Cisco Live Mobile App
2. Click “Join the Discussion”
3. Install Spark or go directly to the space
4. Enter messages/questions in the space
How
cs.co/ciscolivebot#BRKINI-2021
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
• Please complete your Online Session Evaluations after each session
• Complete 4 Session Evaluations & the Overall Conference Evaluation (available from Thursday) to receive your Cisco Live T-shirt
• All surveys can be completed via the Cisco Live Mobile App or the Communication Stations
Don’t forget: Cisco Live sessions will be available for viewing on-demand after the event at www.ciscolive.com/global/on-demand-library/.
Complete Your Online Session Evaluation
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Continue Your Education
• Demos in the Cisco campus
• Walk-in Self-Paced Labs
• Tech Circle
• Meet the Engineer 1:1 meetings
• Related sessions
87BRKINI-2021