Data Analytics
Nagesh Madhwal
Client Solutions Director, Consulting,
Southeast Asia, Dell EMC
3 Dell - Internal Use - Confidential
Next 15 yearsBusiness-centric
Cloud-Native Applications
Prescriptive Analytics
Agile Infrastructure
Internet of Everything
Last 15 yearsIT-centric
Traditional Applications
Traditional Analytics
Rigid Infrastructure
Internet
4 Dell - Internal Use - Confidential
Evolution of Workloads
Your business must evolve.
Future success is achieved
by unlocking ALL DATA.
6 Dell - Internal Use - Confidential
When it comes to Applications
and Data Analytics
TIME is everything.
Dell - Internal Use - Confidential7 of Y
The Power of “Now”
Business Event “Moment of Impact”
Data Captured
Intelligence Delivered
Decision Taken
Valu
e
Time
Real time Batch
Big Data
Opportunity Missed Opportunity “Too Late To Take Action”
Dell - Internal Use - Confidential8
Traditional Analytics
Limited to static data
Reactive and slow
Restricted sources and access
Dell - Internal Use - Confidential9
Traditional ..
Data Warehouse
Reporting
Source Systems
Files
Sources 1 to x
Batch Data SourcesExisting DBs, ERP
Staging Transform
ation
ETL Data Marts
Analytics
Dell - Internal Use - Confidential10
Traditional…
Data Warehouse
Reporting
Source Systems
Files
Sources 1 to x
Batch Data SourcesExisting DBs, ERP
Staging Transform
ation
ETL
Data Marts
Analytics
ETL
ETL
ODS
Dell - Internal Use - Confidential11
Modern Analytics
Analyze ALL data
Deliver anywhere, anytime analytics
Empower your end users
Dell - Internal Use - Confidential12
The Data Lake
12
ETL Offload | Analytics as a Service | Data Science | Decision Support |
Data Visualization | Executive Reporting | Predictive Modeling | Threat Analysis
Structured Unstructured
Managed using NoSQL
Static Schema
RDBMS/EDW
Dynamic Schema
Hadoop Eco Sys
file types: videos, pdf, ppt,
mp3, doc, email, pics
Managed using SQLdata types: numeric, currency,
alphabetic, name, date, address
Sources:
• ERP
• CRM
• SCM
• POS
Sources:
• Social
• IOT
• Text
• Geo
• Media
Use Cases /APPS
Kafka
BigSQLSqoop
Spark Streaming
Hive
FlumeImpala
HAWQ…
Application-Integrated Protocols
Dell - Internal Use - Confidential13
Modern Data Lake Environment
All Data
Fed Into
The
Data
Lake
HADOOP
DATA LAKE
ETL
- Exploratory, Ad Hoc
- Unpredictable Load
- Experimentation
- Loosely Governed
- Best Tools
- Production
- Predictable Load
- SLA Drive
- Heavily Governed
- Standard Tools
DWH
MPP
Analytics
Sandbox
Analytics / Sandbox Environment
Data Prep & Enrichment
ISILON SCALE OUT
NAS STORAGE
Foundation of your Data Management and Analytics
Architecture
Active Archive
BI / DWH Environment
Dell - Internal Use - Confidential14
Reference Architectures
DATA LAKE (HADOOP)RDBMS
MACHINE IOT
STATISTICAL MODELING/NLP EXPLORATION
TRANSFORM
BI
ORGANIZE MANAGE/CATALOG
DATA WAREHOUSESTREAMCEP
NEARREAL-TIME
MODELS MAY TAKE HOUR OR DAYSQUERIES MAY RETURN IN SECONDS OR MINUTES
SECONDS
SEARCH/INDEX
ENTERPRISE LOG ANALYSIS
APPLICATIONS
3rd PARTY
SOCIAL MEDIA SQL ON HADOOP
Dell - Internal Use - Confidential15
Big Data Journey To BDaaS
BDaaSAgility
Valu
e
Control
Prototyping
Dev/Test and Pre-Production Lifecycle Management
< 20 Nodes
Template Libraries, Shared Data Across Clusters, Rapid Prototyping and Evaluation
Dev / Test Lab
Multiple Hadoop Distros, Spark, Transient Workloads
Departmental
Hadoop and Spark in a Secure Production Environment
20+ Nodes
Performance, Security, Capacity Prioritization, Compute/ETL Offload,
Separate Compute/Storage
Dev/Test/Stage/QA/UAT
Improve Utilization, Scaling, Consistent Data
Big-Data-as-a-Service
Multi-Tenant Hadoop and Spark Deployment On/Off Premise
50+ Nodes
Multi-Tenancy, Self-Service, Logs, APIs, Tenant/Admin
Controls, Shared Data Lake with Access Controls
Heterogeneous Production Environments, Diverse Tenants
and User Groups
Support Multiple LOBs, Dynamic Resource
Management/QoS, Automation
Dell - Internal Use - Confidential16
Multi-Tenant Big-Data-as-a-Service
Multiple lines of business, multiple user
groups
Multiple use cases
Multiple ecosystem products
(including non-Hadoop, BI/ETL
tools)
Compute isolation between tenants
Multiple environments
per tenant
Multiple versions and/or
distributions
Data isolation by tenant (incl.
ability to physically isolate storage)
Data/Storage
Prod Dev/Test POC Prod Dev/Test
Data Isolation
Data Isolation
MARKETING R&D MANUFACTURING
360 Customer View Log Analysis Predictive Maintenance
MARKETING R&D MANUFACTURING
Shared, Centrally Managed Server Infrastructure
Compute Isolation
Compute Isolation
Dell - Internal Use - Confidential17
Analytics
Infrastructure
Integration
Range of solutions
44% of organizations still struggle with how to approach Big Data
S e r v i c e s
18 Dell - Internal Use - Confidential
Data Analytics Journey
19 Dell - Internal Use - Confidential
Big Data Systems
• VxRail
• VxRack
• Vblock
• XC Series
20 Dell - Internal Use - Confidential
Big Data Foundations
• Servers
• Storage
• Networking
• Software
21 Dell - Internal Use - Confidential
Big Data Solutions
• Reference Architectures
• Engineered Solutions
• Customized Designs
22 Dell - Internal Use - Confidential
BUSINESS
TECHNOLOGY
DEPLOYASSESS PROVE
Big Data
Proof of Value
Big Data
Proof of
Technology
Big Data
Applied Analytics
Implementation
Big Data
Technology
Implementation
Big Data Vision
Workshop
Big Data Technology
Advisory
DELL EMC Big Data
Portfolio Implementation
Global Services
Dell - Internal Use - Confidential23 of Y
Subject Details
Workshop Objective To understand the key business initiatives and requirements for big data in order to understand where
and how to start the big data journey
Workshop Vision How to become data driven through the use of big data and analytics
Workshop Duration Half Day or 1 Day on customer site
Workshop Agenda 1) Business & IT Goals
2) Business Initiatives
3) Current Environment Review
4) Use Cases
5) Data Sources Review
6) Data Science / Analytics / BI Requirements
Recommended types of participants: IT and Business Users
Expected Outcomes 1) List of business opportunities and use cases
2) Business Value and Feasibility
3) Identify and prioritize data sources mapped with use cases
4) Prioritized used cases with potential business value and impediments
5) Document workshop results
6) Big Data Technology Roadmap with clear next steps
Next Step After Workshop Proof of Value (POV) or Proof of Technology (POT)
Dell EMC Workshop Team Head of Big Data Practice SEA, Head of Consulting SEA, Dell EMC Account / Sales Manager
Example of a Big Data Workshop
24 Dell - Internal Use - Confidential
Customer Success Stories