1530-1600 dany el khoury sap hana in your big data strategy
DESCRIPTION
TRANSCRIPT
SAP HANA IN YOUR BIG DATA STRATEGY JUPITER PLATFORM
VCE Confidential © 2014 VCE Company, LLC. All rights reserved.
SAP AND BIG DATA
Combination of ERP data and big data enables powerful new use cases
Federation of structured and unstructured data is key
Myriad of platforms, tools, and skills is typically the challenge
BW ECC CRM
Hadoop Social
SMS
Mobile records
sensor data
trade data
Media
Big Data
HANA
Etc…
© 2014 VCE Company, LLC. All rights reserved. VCE Confidential
Analytics
Data Governance
Data Sources
Big Data Platform
Tools & Consulting
MASTER DATA MANAGEMENT
D A T A QUALITY
PLANNING / WHAT-IF PREDICTIVE T E X T
ANALYTICS
UNSTRUCTURED STRUCTURED
D A T A INTEGRATION
DATA SCIENTIST
VISUALIZATION REPORTING DASHBOARDS
COLLABORATION
TRAINING SOLUTION DESIGN/
DEVELOPMENT
O T H E R S TRANSACTIONAL
D A T A MACHINE
D ATA SOCIAL EXTERNAL
BIG DATA ANALYTICS– WHERE DOES JUPITER
FIT IN?
© 2014 VCE Company, LLC. All rights reserved. VCE Confidential
Data Lake for SAP
JUPITER INTEGRATES
Flexible and fast ETL Federated Query
VMware software
BMMsoft software
VCE hardware
© 2014 VCE Company, LLC. All rights reserved. VCE Confidential
Data
JUPITER IN ESSENCE
Big Data Solution for SAP HOT/WARM/COLD data
Utilizing existing components and partners
Big Data reference architecture including infrastructure
Simplifies access to multiple data sources and types
Excellent platform for very high data volume and velocity use cases
Hot Warm Cold
HANA SybaseIQ HADOOP
Storage
Data Protection
Disaster Tolerance
Storage Storage
Compute Compute Compute
Network
© 2014 VCE Company, LLC. All rights reserved. VCE Confidential
EMC ETL Storage
SAP IQ, SAP HANA, Netezza,
Oracle, Exadata, Hadoop
Data Management, Access Control, Alerts, Auto-Classification,
Collaboration, Taxonomy, Data Retention, Connectivity, Search API
ED
MT
AP
I &
Con
ne
cto
rs
E
D
M
T
ETL
(INGEST) Real-time ETL Parser, Metadata
Manager, Parallel Loader
EMC DB Storage
Cisco blades Cisco blades
DB Servers ETL & App Servers
EDMT Data Access & Analysis Layer EDMT GUI
Web Services
Data Export
Proxy Mobile
GUI
eDiscovery, Audit, Fraud
Modules
Social Net Analysis
JUPITER ARCHITECTURE
Da
ta
Acce
ss
So
ftw
are
&
Da
tab
ase
In
fra
str
uctu
re
EMC HANA Storage
Cisco blades
HANA scale out
EMC unified Data Protection
EMC unified Disaster Tolerance
VMware, VMware & EMC Operational reporting
© 2014 VCE Company, LLC. All rights reserved. VCE Confidential
JUPITER 3-MILLION CHANNEL ETL “Reaction”
starts here
DB Server ( SW+DB_HW )
1,000,000–channel
Ingest engine for SQL data
+100B_rows/day/channel
1,000,000-channel
Ingest engine for Emails,
SMS, IM etc.
+1TB/day/channel Node 1,000
“Event”
occurs here Event-to-Reaction Tim =0.2-2 sec
Node 1
Node 1,000
Node 1
Node 1
Node 1,000
1,000,000-channel
Ingest engine for Docs and
Multimedia(aud/vid/img)
+1TB/day/channel
© 2014 VCE Company, LLC. All rights reserved. VCE Confidential
FEDERATED JUPITER(*): HANA, IQ, OTHER DBS
HANA
(MPP Shared Nothing)
IQ Multiplex
(Multi-node MPP Shared disk)
Disk storage
SAN
EDMT®
Federated
ETL
using
federated
data model
EDMT®
Federated Query:
(federated query,
final merge,
metadata
management,
retention, access
control, partitioning,
HA/DR,
replication…)
Other DBs : Oracle, Exadata, Netezza, Sybase ASE, MySQL, Hadoop
© 2014 VCE Company, LLC. All rights reserved. VCE Confidential
FEDERATED JUPITER– THE BENEFITS
1. Higher Data Scalability, Ingest speed – When single-DB CAN’T handle data size (i.e. +2,000 PB) or ingest speed (+PB/hour)
2. Geo distances – Not limited by LAN/SAN/Ethernet distance – WAN is OK
3. Selective Replication for HA, DR – Controlled replication for HA, DR and B/R purposes
4. Hetero-DB – Include hetero-DB in the ETL and Fed_query
5. Operational requirements/benefits – Upgrades, life-cycle management etc.
6. Fed Jupiter ETL/Query knows the purpose/config. of ALL Fed “members”
VCE Confidential © 2014 VCE Company, LLC. All rights reserved.
WHAT PROBLEMS IS JUPITER SOLVING ?
1. Data Volume: Jupiter has Multi-PB scalability - Certified to 10 PB – new benchmark ongoing
2. Data Velocity: Jupiter has Multi-PB/day loading speed
3. Data Variety: Jupiter stores “std” SQL data and unstructured data (sensor, files, email/sms, social net, multimedia..) in single RDBMS(**)
4. Value (Low Cost): Jupiter is price competitive with Big Data solutions based on Hadoop
5. Jupiter is different from “new” solutions
1. Uses proven Enterprise SW+HW: SAP DBs (HANA, IQ, ASE etc.) and VCE Vblock
2. Jupiter is fully compatible with enterprise DBs, Apps and Reporting/Analytic tools
3. Jupiter is more reliable because it uses proven enterprise SW+HW
4. Unifies struct.+unstruct. while “new solutions” process unstructured data only
5. Much lower cost-per-TB than “std” enterprise apps and lower price than “new” solutions
Super Scale
Extreme Speed
Lower Cost
Simplified
✔
✔
✔
✔
Jupiter Platform - Summary
QUESTIONS