exasol dual data center - sphinx
TRANSCRIPT
EXASOL Dual Data Center
Using a dual data center approach for Disaster Recovery
ARZ Allgemeines Rechenzentrum GmbH 2
Agenda
ARZ – Who are we?
Why Exasol with Dual Data Center?
How does it work?
Prerequisites
Configuration
Switchover
Tooling
Advantages
Financial solution provider in Tyrol
ARZ – Who are we?
ARZ Allgemeines Rechenzentrum GmbH 4
ARZ – Who are we?
• Computing centre for banks in Austria
• Owned by customers
• Mainly Volksbanken and Hypo but also private banks and health institutions
• ~ 500 employees
• Complete service
o IT acquisition
o Workstation Management
o Dev and enhancing core banking system
o SaaS
o ...
Origins of the Dual Data Center Approach
Why Exasol with Dual Data Center?
ARZ Allgemeines Rechenzentrum GmbH 6
A History of Data in ARZ
Data Warehousing at ARZ - 2000
• Used for Basel I reporting regulations
• Core banking system does not keep history of data
• Monthly snapshots
• Only database platform we had: DB2 zOS IBM Mainframe
• Development started in Cobol
• High Costs for this platform as primarily used for Mission critical OLTP systems
Start of DB2 for Unix as Reporting Database - 2003
• Creation of Data Marts for Reporting
• Setup of Business Objects (SAP BI) as Reporting Tool
Setup of Datastage as ETL Tool (now IBM Information Server) – 2011
Start of a redesign project for our data platform - 2015
ARZ Allgemeines Rechenzentrum GmbH 7
Evaluation Project for new Database Platform
• Started Feb 2015 with 7 Vendors
• Ended Juli 2015
• PoC with Netezza, Exasol and SAP HANA
• Main Focus was
o Ease of development
o Ease of administration
o Performance
o Dual Data Center Solution
ARZ Allgemeines Rechenzentrum GmbH 8
Why Exasol with Dual Data Center?
• ARZ owns 2 data centers
o 16 km line length
o 2x 20 Gb dedicated direct connections between the data centers
o Network latency 120 µs
• Every system connected to our regulation reporting process has to be clustered
spanning both data centers
o Max delay for reports to the national bank is within hours
o High penalties for failure to report
• Blackout of one data center once a year as a disaster test
o Varying scenarios every year
o This year: simulation of failure of cooling system
Planning vs. reality
What do we do with Exasol?
ARZ Allgemeines Rechenzentrum GmbH 10
Processing Chain
Unload Sources
•Unload from systems that we don’t have direct access to
Load Files / Direct Access
•Load unloaded files
•Load files from external systems (non ARZ systems)
•Load data from systems from within Exasol (using Exasol Connections)
RAW Layer
•Only Data Type checks (through Exasol import)
CDWH Layer
•Data Cleansing
•Reshape into new data model
Compute Cores
•Enrichment
•Analytics
•Master Data Management
ARZ Allgemeines Rechenzentrum GmbH 11
Planned Data Processing vs Reality
• After the evaluation project we planned for
o Complete redesign of our data model
3NF data model enhanced by surrogate keys (SHA-1 over business key)
we had a look at Data Vault modeling but didn’t go that way
o 90 GB of RAW Data per day
o 90 days of daily time slices
o Infinite amount of ultimo time slices
• Currently
o Data Model completely redesigned
o 600 GB of RAW Data per day compressed 100 GB
o Time for load + data cleansing and reshape into new data model 3 hours
Prerequisites
How does it work?
ARZ Allgemeines Rechenzentrum GmbH 13
Prerequisites
“Common” Exasol installation
ARZ Allgemeines Rechenzentrum GmbH 14
Prerequisites
9 active nodes per side
1 standby node per side
License server is a
VMWare Image
Can be switched to
secondary DC
Uses mirrored storage
Stretch Cluster Setup
Switchover
How does it work?
ARZ Allgemeines Rechenzentrum GmbH 16
Switchover
Switchover
• Check if storage is in synco Syslog monitor for segment recovery
o Problem might be network problems prior to switchover
• Move License-Server VMWare to secondary DC
• Stop database on primary site
• Start database on secondary site
• Stop nodes on primary siteo Otherwise the master segments will be accessed remotely
Switchback
• Bring up primary nodes
• Wait for segment recovery
• Move License-Server VMWare to primary DC
• Stop database on secondary site
• Start database on primary site
ARZ Allgemeines Rechenzentrum GmbH 17
Switchover Problems
• Hard power down (explosion)
o Storage volumes get locked on abnormal power failure
o Exasol Support has to unlock volumes
• When switching to secondary DC power down of primary DC nodes recommended
o Severe performance degradations when secondary database is accessing master data in primary
DC
• Connect string for apps contains active/passive site
o Longer connect time compared to regular installations
Tooling
How does it work?
ARZ Allgemeines Rechenzentrum GmbH 19
Tooling
• Custom build Python Script to manage whole process and more
o Used by Operations Department in ARZ
• Make it foolproof
o Operations has to Manage > 1000 Systems no time for complex checks
o Errors may lead to corrupted data ongoing segment recovery, ...
• Automatic Shutdown of Nodes if takeover to secondary site
• Automatic Startup and wait for seg recovery on takeback from secondary site
Advantages compared to other vendors
Advantages
ARZ Allgemeines Rechenzentrum GmbH 21
Advantages
• Cheap license
o No full license for secondary site required
o Only additional hardware in secondary DC
• Functionality build in
o No extra components required
• Save even during network outages
o Quorum build in through license server
• No Impact on Query Performance
• Low Impact on Load Performance
o Is dependent on your network connection (latency)
o In our Installation < 5 %
Questions
ARZ Allgemeines Rechenzentrum GmbH 23
DBeaver Open Source
• Open Source DB-Query Tool
o I build an Exasol Plugin for DBeaver (shipped with DBeaver)
• Some enhancements to EXAPlus
o Column Auto-Complete
o Direct Table Data Editor
o Multiple Table Import/Export
o Manage Database Sessions
o Support for Virtual Schemas (Exasol V6)
o Display Table Statistics in Details (Size, Compressed Size …)
o Security Browser (Roles, Users)
o “Explain” SQL Feature
• Constantly enhanced
• I’m open for feature requests, bug reports …
• Details https://goo.gl/w2F3Wk
KONTAKT
Karl Grießer
Data and Information Management
DB2 LUW, Oracle, Exasol, Imperva
+43 50400 91166