ibm puredata system for analytics n3001 overview
TRANSCRIPT
© 2015 IBM Corporation
IBM® PureData™ System for Analytics
N3001 Overview
© 2015 IBM Corporation2
INTRODUCTION TO NETEZZA
TECHNOLOGY
© 2015 IBM Corporation3
IBM PureData System for AnalyticsThe Simple Data Warehouse Appliance for Serious Analytics
What makes it different?
Speed - 10-100x faster than traditional custom systems1
Simplicity - minimal administration and tuning
Scalability - petabyte+ scale user data capacity
Smart - high performance, advanced analytics
1 Based on IBM customers' reported results. "Traditional custom systems" refers to systems that are not professionally pre-built, pre-tested and optimized. Individual results may vary.
Purpose-built analytics appliance
Integrated database, server and storage
Standard interfaces
Low total cost of ownership
© 2015 IBM Corporation4
▪ Too complex an infrastructure
▪ Too complicated to deploy
▪ Too much tuning required
▪ Too inefficient at analytics
▪ Too many people needed to maintain
▪ Too costly to operate
4
Traditional Data Warehouses
They do NOT meet the demands of advanced analytics on big data.
are just too complex
Too long to get answers
© 2015 IBM Corporation5
Appliances Make It Simple
transforming the user experience.
▪ Dedicated device
▪ Optimized for purpose
▪ Complete solution
▪ Fast installation
▪ Very easy operation
▪ Standard interfaces
▪ Low cost
© 2015 IBM Corporation6
Evolution of Netezza & PureData System for Analytics
World’s FirstData Warehouse
Appliance
World’s First100 TB DataWarehouse Appliance
World’s FirstPetabyte Data
Warehouse Appliance
World’s FirstAnalytic Data Warehouse Appliance
NPS®
8000 Series
TwinFin™ with i-Class™
Advanced Analytics
NPS®
10000 Series
TwinFin™
2003 2006 2009 2010 2012 2014
World’s Fastest and Greenest Analytical
Appliance
PureData System for AnalyticsN300x
PureData System for AnalyticsN200x
World’s First appliance with no cost encryption
© 2015 IBM Corporation7
Targeted advertisingto promote products that customers
want at the price they want them
Understand what customers want,
when they walk into a Bon-Ton store
Freeing the time of Bon-Ton
buyers and plannersfrom the mundane task of gathering &
compiling customer data so they can spend
their time making informed decisions to
drive the business
“I need some way to understand what they're
thinking, what they're feeling, without having to
have contact with them. PureData for Analytics
is what's going to help us understand what the
customers want when they walk into my
stores”
- Paula Post, Vice President Merchandising Optimization.
Bon-Ton Optimizes Their Customer’s Experience Using
IBM PureData System for Analytics
Video: https:/www.youtube.com/watch?v=0gsWOL6gciw
© 2015 IBM Corporation8
Carphone Warehouse Increases Profitability Through New Revenue
Streams & Reduced Costs
Case Study: http://www-03.ibm.com/software/businesscasestudies?synkey=M183113U13038J58
“The PureData System, powered by
Netezza technology, provided huge
technical advantages & big business
advantages. We can now insure devices on
behalf of a bank in the UK, which we
couldn’t have done before.”
- Paul Scullion, Head of Business Intelligence
Up to 1200Xfaster performance; reports that once
took an hour to run now take seconds
50% reductionin time to market for new business intelligence services
© 2015 IBM Corporation9
96% decreasein query run times
(from 1 hour to 2 minutes)
100% increasein subscriber base
Reduced spendingOn low-return promotional activities
"Through the entire subscription lifecycle, the
company tracks everything members do on
the website. This process generates an
enormous amount of data, which would be
completely wasted without the ability to extract
hidden insights about how members behave.”
- eHarmony C-Level executive
eHarmony Attracts New Members by Understanding Behavior and
Fine-tuning Matching Algorithm
Video: https://www.youtube.com/watch?v=_0wffNyHn8s
© 2015 IBM Corporation10
Canadian National Railway Company leverages the power of
predictive analytics to run trains on time
Reduction in time spent on running reports, some
reports that took 10-20 minutes
earlier now run in 5 seconds
Enhanced confidence in data driven decision-making
Accelerated analytics for faster insight, the company is
moving to near real time report
generation compared to monthly
reports earlier
“The performance of PureData is very good,
most reports we have are running in less than
5 seconds where as with other databases we
had reports running for 10-20 minutes”
- Philippe Chartier, BI Team Lead, Information Delivery,
Canadian National Railway Company
Video: https://www.youtube.com/watch?v=yyZu5seKbLI
© 2015 IBM Corporation11
Promotes self-service
business intelligence & insights throughout the hospital
98% reduction in time spent on some queries
“We’re getting deeper into the data in
multiple ways . . . When we see new
commonalities in treatments for children,
we can design new protocols to provide
the best possible care”
- Wendy Soethe, Enterprise Data Warehouse Manager
More effective
diagnosis & treatment by enabling faster, more accurate insights,
on-demand
Seattle Children’s Optimizes Business Intelligence & Insight into
New Treatment Protocols to Enrich Patient Care
Video: https://www.youtube.com/watch?v=bjGWIectvkI
© 2015 IBM Corporation12
THE NEW PUREDATA SYSTEM
FOR ANALYTICS N3001
© 2015 IBM Corporation13
Announcing the PureData System for Analytics N3001
Big Data and Business Intelligence ready
with capabilities to unlock data’s true potential
Advanced security in an insecure world
at no extra cost
An even broader family of appliance models
to fit a broad range of data capacity needs
Changing the game for data warehouse appliances (again)
and yes, simple is STILL better!
© 2015 IBM Corporation14
Big Data and Business Intelligence ReadyUnlocking Data’s True Potential
Data Warehouse Appliance
Built-in, In-Database analytic capability and integration with
a variety of 3rd party toolsReal-time AnalyticsInfoSphere Streams Developer Edition 2 users, non-production licenses
Business Intelligence Cognos software, 5 Analytics User licenses, plus 1 Analytics Administrator license
Hadoop Data ServicesInfoSphere BigInsights Software licenses to manage ~100 TB of Hadoop data
Exceptional value
provided
Included with the PureData System for Analytics N3001
Industry Process & Data ModelsModels for Banking, Financial Markets, Healthcare, Insurance, Retail, Telco
For additionalvalue
• Advanced security• New rack-mountable
appliance for midsize organizations
• New 8-rack system for Petabyte+ capacity
Data Integration & TransformationInfoSphere DataStage 280 PVUs, 2 concurrent Designer Client licenses and InfoSphere Data Click
IBM InfoSphere Data Privacy and Security for Data Warehousing
© 2015 IBM Corporation15
IBM Netezza AnalyticsIn-database Analytics For Every Role in Your Enterprise
Bring the analytics to the data
not the data to the analytics
Included
Use cases
Features
▪ Built-in, in-database analytic functions
- Data mining, prediction, transformations, statistics, geospatial, data preparation
▪ Full integration with tools for BI & visualization
- IBM Cognos, Microstrategy, Business Objects, SAS, MS Excel, SSRS, Kognitio, Qlikview
▪ Full integration with tools for model building & scoring
- IBM SPSS, SAS, Open Source R, Fuzzy Logix
▪ Full integration for custom analytics
- Open Source R, Java, C, C++, Python, LUA
▪ Reduce hospital admissions or personalize disease treatments
▪ Achieve an order of magnitude improvement in manufacturing quality
▪ Better understand the risk of catastrophic events
▪ …and many more
Data
Preparation
Predictive
Analytics
Geospatial
Analytics
Advanced
Statistics
© 2015 IBM Corporation16
Use cases
Features
Business IntelligenceThe Power of IBM Cognos with PureData System for Analytics
▪ Leading Business Intelligence
- Interactive analysis
- Compelling visualizations - web, mobile or email
- Enterprise scalability
▪ Optimized for PureData for Analytics
- Offers high performing OLAP over relational experience
- Cognos Dynamic Query Mode extends benefits of PureData by adding in-memory & caching on top of already fast appliance performance
- Exploits Netezza analytic in-database functions
Rapid deployment of answers
to key business questions
Included with PureData for Analytics:
IBM Cognos Business Intelligence 10.2.1
5 Analytics User licenses,
1 Analytics Administrator license1
Included
▪ Reporting, analysis, scorecards, dashboards
▪ Data visualization
▪ Mobile business intelligence
▪ … and many others
1PureData System for Analytics N3001 must be the data source for Cognos.
© 2015 IBM Corporation17
Data Integration & TransformationInfoSphere DataStage, Designer Client and Data Click
Rich capabilities for
data integration
Included
Use cases
Features
▪ Ease of Use
- Provides an easy-to-use, top-down, work-as-you-think design interface that enables users to design once and deploy anywhere—batch or real time; extract, transform, load (ETL); or extract, load, transform (ELT)
- Self-service data integration to enhance business agility
▪ Accelerate time to value
- Includes a comprehensive library of transformation components for easily defining common integration processes
▪ Integration, transform and deliver trustworthy information to your data warehouse
▪ Analysts, data scientists or even line-of-business users can easily retrieve data and populate the PureData System for Analytics
▪ Move data from the data warehouse into a subject area data mart
Included with PureData for Analytics:
IBM InfoSphere DataStage 11.3 (280 PVU
Information Server Engine Tier)1,
Designer Client (2 concurrent users),
InfoSphere Data Click1
1PureData System for Analytics N3001 must be the source or target database.
© 2015 IBM Corporation18
Hadoop Data ServicesIncluded Capability with IBM InfoSphere BigInsights
▪ Big data analytical platform
- Best of open source + IBM technologies
- Big SQL
- High performance SQL access of Hadoop
- Federation across many data sources -
combine information from Hadoop and
PureData for Analytics
- BigSheets visualization tool
▪ Built-in analytics
- Text analytics, Big R
Bringing the power of
Hadoop to your enterprise
Included with PureData for Analytics:
InfoSphere BigInsights 3.0 software
licenses for 5 enterprise nodes to
manage up to ~100 TB of Hadoop data1
Included
▪ Federated SQL access across Hadoop and
your PureData System for Analytics
▪ Pre-processing and landing zone for all data
types prior to loading to data warehouse
▪ Queryable backup for cold data
Use cases
Features
1Based on 4 data nodes + 1 master node. 12 TB uncompressed per data node with 4 TB drives. 12 TB x 4 nodes = 48 TB uncompressed.
Using 2-2.5x compression yields 96-120 TB compressed data. Capacity will depend on hardware configuration selected.
© 2015 IBM Corporation19
Use cases
Features
Real-Time AnalyticsIncluded Capability from IBM InfoSphere Streams
▪ Analyze data in motion
- Provides sub-millisecond response times,
allowing you to view information and events as
they unfold
- Analyze all kinds of data: simple & advanced text,
geospatial, acoustics, images, video, sensors
- Eclipse-based development environment
Deploy analytic models on
data-in-motion to enable real-time
decisions and land data in the
warehouse to build the analytic models
Included with PureData for Analytics:
InfoSphere Streams Developer Edition 3.2.1
2 developer users, non-production licenses
Included
▪ Fraud detection
▪ Predict customer churn
▪ Telco real-time mediation and analysis
▪ Real-time monitoring of medical sensors to improve
healthcare outcomes
▪ Defect detection in manufacturing
▪ Traffic pattern analysis and management
© 2015 IBM Corporation20
FinancialMarkets
Use cases Features
Accelerating Industry Specific Business Analysis Accelerate Time to Value with IBM Industry Models
▪ Risk Management
▪ Wealth and Investment
Management
▪ Customer Intelligence
▪ Regulatory Reporting
▪ Health Care Analytics
Available
▪ Comprehensive with Built-in Expertise
- Data Warehouse design models, business terminology models and analysis templates
- Experience from >500 client engagements
▪ Solution bundles including data model, data appliance, ETL & Business Intelligence
− Banking, Healthcare, Insurance
Banking Healthcare Insurance Retail Telco
Industry Vertical Models
Customer Insight
Market & Campaign Insight
Supply Chain Insight
Horizontal
Model Packs
© 2015 IBM Corporation21
IBM InfoSphere Data Privacy and Security for
Data Warehousing
InfoSphere Data Privacy and Security for Data
WarehousingInfoSphere Data Security and Privacy
Define and ShareDiscover and Classify
Mask and RedactMonitor Data Activity
Purpose-Built Capabilities • Achieve and enforce
compliance• Secure and Protect sensitive
data in appliances• Reduce costs of attaining
enterprise security
Define and Share
• Define a warehousing glossary• Share sensitive data definitions and
policies• Create project blueprints
Discover and Classify
• Discover / profile data• Explore lineage and relationships• Classify sensitive data
Monitor Data Activity
• Monitor data warehouses• Real-time alerts• Centralized reporting of audit data
Mask and Redact
• De-identify sensitive data within the warehouse
• Apply obfuscation techniques to both structured and unstructured data
Available
© 2015 IBM Corporation22
What’s New in PureData System for Analytics N3001
Performance
▪ Faster performance with upgraded CPUs with more
cores
New appliance models
▪ New rack mountable, ultra lite/mini appliance for
midsize businesses
▪ New 8-rack, Petabyte capacity appliance
Security
▪ Improved security with Self Encrypting Drives
▪ Kerberos support
New Netezza Platform Software (NPS) 7.2
▪ Faster load rates
▪ Performance Portal enhancements
▪ and more
© 2015 IBM Corporation23
Introducing PureData System for Analytics NPS 7.2New Database Features, Improved Performance and Resiliency
Database
features
Better performance
and reliability
Improved resiliency
and fault tolerance
▪ Faster load rates up to
10 TB/hr
▪ Faster restore rates
▪ WLM throughput and
latency optimization
▪ Enhanced security
enables single sign-on
and centralized
management
▪ New built-in functions
and SQL updates
▪ Portal enhancements
▪ Enhanced Health
Check capabilities
▪ Enhanced storage
topology and
communication fabric
▪ Call Home via https
and SOAP
© 2015 IBM Corporation24
Introducing PureData System for Analytics N3001-001:
The Mini-Appliance
Bringing speed and simplicity to midsize organizations for big outcomes
• Rack mountable
• Production ready
• Full function appliance
• User data capacity 16 TB*
• High availability - All redundant
hardware, 4 disk spares, hot swap
power supply
• Self encrypting drives, Kerberos
support, LDAP/Active directory
Solution Highlights
*Assumes 4x compression
▪ Simple
Same user experience as all PureData System for
Analytics appliances
• Full function Netezza Platform Software with IBM
Netezza Analytics
• Support tools and Netezza Performance Portal
• ODBC/JDBC/OLE-DB/SQL Driver integration
Load and go with no tuning or administration
▪ Speed
10-100x faster than traditional custom systems1
▪ Smart
Rich set of in database analytic functions
Protection of all data from unauthorized access
Includes starter kits for Big Data and Business Intelligence
▪ Agile
Easily incorporated into the data center with simplified
installation into an existing rack
▪ Affordable
Purchase or lease
1Based on IBM customers’ reported results. “Traditional custom systems” refers to systems that are not professionally pre-built, pre-
tested and optimized. Individual results may vary.
© 2015 IBM Corporation25
Introducing PureData System for Analytics N3001-0808-rack System
▪ 1.5 PB of user data capacity1
▪ Hosts: 2x x3750M4 and 600 GB Self Encrypting Drives
▪ Blades: 56x HS23 with 20 core IvyBridge processors
▪ Storage: 96 EXP2524 disk enclosures with 24x 600 GB Self Encrypting Drives
1Assumes 4x compression
© 2015 IBM Corporation26
IBM DB2 Analytics AcceleratorEnhanced with PureData System for Analytics N3001
Benefits
▪ Extreme performance for complex queries
- Up to 2000x performance improvements
▪ Cost Savings
- Offload complex queries and eliminate
costly query tuning
- No need to create or maintain indices
- Improves access to and lowers the cost of
storing, managing and processing historical
data
▪ Integrated with DB2 z/OS and inherits
mission critical features such as security and
recoverability
▪ Access to DB2 Analytics Accelerator is
transparent to applications and users
▪ Fast deployment and time to value
- Installation is non-disruptive
- Plug it in, load data and go in 1-2 days
A high performance
appliance that integrates
Netezza technology with
zEnterprise technology to
deliver dramatically faster
business analysis
Highlights – What’s New?
The PureData System for Analytics N3001
provides additional benefits to DB2 Analytics
Accelerator customers:
▪ Advanced security through encryption of data
at rest with self-encrypting disks
▪ Performance improvements for analytic
workloads
▪ Improved serviceability with the recently
introduced automatic call home capability
▪ Broader range of SQL compatibility
Available
© 2015 IBM Corporation27
PureData System for Analytics Family
▪ 10-100x faster than
custom systems1
▪ 3.3x faster I/O scan
rate2
▪ Load and go, no tuning
▪ Designed to run
complex analytics in
minutes, not hours
▪ Rich set of in-database
analytics
N2002 N3001-xxx
N3001-001
DB2 Analytics Accelerator for z/OS
(now with N3001)
1Based on IBM customers' reported results. "Traditional custom systems" refers to systems that are not professionally pre-built, pre-tested and optimized.
Individual results may vary.2Comparing N1001 scan rate of 145 TB/hour to N2002 scan rate of 478 TB/hour
…plus
▪ Rack mountable
appliance
▪ Ideal for small and
medium business with
up to 16 TB of user data
...plus
▪ Entitled software capability for
real-time analytics, Hadoop
data services, data movement
and business intelligence
▪ Advanced security
▪ Partial rack to 8-rack
configurations
▪ The hybrid computing platform
integrating Netezza technology with
zEnterprise technology
▪ Supports transaction processing and
analytic workloads concurrently,
efficiently & cost effectively
▪ Accelerates complex queries, up to
2000x faster
▪ Required security compliance with
Data-at-Rest Encryption
© 2015 IBM Corporation28
The PureData System for Analytics N3001 Family
Specification N3001-001 N3001-002 N3001-005 N3001-010 N3001-020 N3001-040 N3001-080
Racks n/a, 2 x 2U 1 (1/4 full) 1 (1/2 full) 1 2 4 8
Active S-
Blades
n/a 2 4 7 14 28 56
CPU cores 40 40 80 140 280 560 1,120
User data
(TB) *
16 32 96 192 384 768 1,536
* Assuming 4x compression
Single rack systems Multiple rack systems
Linear Scalability!
© 2015 IBM Corporation29
WHAT MAKES THE N3001
BETTER?
© 2015 IBM Corporation30
What are the Demands of a Modern Data Warehouse?
Faster Insight
▪ Fast response times are
expected
▪ People are used to an
experience as easy as
▪ Users do not want to wait
for query results
Insight
Cost
Agility
Lower cost
▪ Initial acquisition
▪ Ongoing operation and
administration
▪ Total cost of ownership
Added Agility
▪ Ability to respond quickly to
the needs of the business
▪ By simplifying operations,
more time is provided for
innovation
▪ Better business outcomes
by utilizing more data
sources
© 2015 IBM Corporation31
PureData System for Analytics Delivers on the Demands of a
Modern Data Warehouse
Faster Insight
InsightCost
Agility
Lower cost Added Agility
“BCBSMA has combined IBM Cognos
Business Intelligence with an IBM
Netezza data warehouse appliance to
provide lightning-fast analysis of
medical and financial data. The
solution creates sophisticated reports
on clinical and financial risk and
operational efficiency.”
- Shashikanth Vangala, Manager & Chief
Solutions Architect of Business Intelligence,
Blue Cross Blue Shield of Massachusetts
“That simplicity cannot be
underrated. It is just
amazingly simple to do very,
very large scale things that in
any other environment takes
engineering, just to pull off.”
- David Birmingham, Senior
Consultant, Brightlight Consulting
“we tested PureData Systems late
last year on a set of very complex
use cases and we found,
compared to earlier architectures, it
was performing 2-3 times faster
on batch processes and anywhere
from 3-10 times better on our
concurrent workload”
- John Naduvathusseril, Chief Data Architect,
Nielsen Company
© 2015 IBM Corporation32
What do the Best-in-class Data Warehouses Deliver1?
1Source: Aberdeen Group, The Best-in-Class Data Warehouse: Fast, Simple, Impactful, May 2014.
99%of users are satisfied with speed
of information delivery(46% industry average)
97%are satisfied with ease-of-use
analytical tools(44% industry average)
97%of users are satisfied with access
to data needed to support
decisions(51% industry average)
Faster information delivery
Easy access to required data
Analytical tools that are easy to use
© 2015 IBM Corporation33
PureData System for Analytics Delivers
Faster information delivery
Easy access to required data
Analytical tools that are easy to use
“Making decisions based on data instead of intuition or gut feeling is better. There is
already a greater demand from users for data to support day-to-day operations –
solutions such as the InfoSphere Business Glossary empower them with this
information so that they can work more autonomously and efficiently.”
- Philippe Chartier, BI Team Lead, Information Delivery, Canadian National Railway Company
“With the IBM PureData System for Analytics, we can reduce the time to analyze
complex GIS data from days to minutes—a more than 98 percent improvement.”
- Steve Trammell, Strategic Alliances Marketing Manager, Esri
“We knew that our IBM SPSS Modeler software could scale to meet our needs;
the limitation was on the hardware and data warehousing side. Instead of having
separate databases and servers for each client, we wanted to build a single,
multi-tenant platform that could support a cloud-based service for the entire
business. In the IBM PureData System for Analytics, we found the answer.”
- Patrick Ritto, CTO, FleetRisk Advisors
© 2015 IBM Corporation34
Comparing PureData System for Analytics with Teradata
1ITG: Comparing Costs and Time to Value with Teradata Data Warehouse Appliance, May 2014.
2.6x higherpersonnel costs1
3.4x moreDBAs required1
33% higher3-year TCO1
3.8x higherdeployment costs1
Teradata has …
…than the IBM PureData System for Analytics
© 2015 IBM Corporation35
Comparing PureData System for Analytics with Oracle
1ITG: Comparing Costs and Time to Value with Oracle Exadata Database Machine X3, June 2014.
3x more
DBAs required1
45% higher
3-year TCO1
3.5x higher
deployment costs1
Oracle has …
…than the IBM PureData System for Analytics
© 2015 IBM Corporation36
Synergy with Data Integration and Reporting & Analysis Tools
SQ
L O
DB
C J
DB
C O
LE
-DB
SQ
L O
DB
C J
DB
C O
LE
-DB
Data In Data Out
Data IntegrationReporting &
Analysis
▪ IBM
▪ BigInsights
▪ Information Server
▪ InfoSphere Streams
▪ Ab Initio
▪ Hadoop
▪ Informatica
▪ Microsoft
▪ Oracle
▪ SAP
▪ SAS▪ Others using standard
ODBC/JDBC/OLE-DB/SQL
▪ IBM
▪ Cognos
▪ SPSS
▪ Campaign
▪ Hadoop
▪ Information Builders
▪ Microsoft
▪ MicroStrategy
▪ Oracle
▪ SAP
▪ SAS
▪ Tableau▪ Others using standard
ODBC/JDBC/OLE-
DB/SQL
Note: Sample list, not all inclusive
© 2015 IBM Corporation37
PureData System for Analytics Overview: Model N3001
▪ User Data Capacity: 192 TB1
▪ Data Scan Speed: 478 TB/hr*▪ Load Speed (per system): 10+ TB/hr
▪ Power Requirements: 7.5 kW▪ Cooling Requirements: 27,000 BTU/hr
1Assuming 4X compression
Scales up to 8 full Racks
Terabyte to Petabyte+ Capacity
2 Hosts (Active-Passive)▪ 2 Intel Ivy Bridge CPUs▪ 5X600 GB SAS Self Encrypting Drives▪ Red Hat Linux 6 64-bit
7 PureData for Analytics S-Blades™▪ 2 Intel 10 Core Ivy Bridge CPUs▪ 2 8-Engine Xilinx Virtex-6 FPGAs▪ 128 GB RAM + 8 GB slice buffer▪ Linux 64-bit Kernel
12 Disk Enclosures▪ 288 600 GB SAS2 Self Encrypting Drives
• 240 for User Data• 14 for S-Blades• 34 Spare
▪ RAID 1 Mirroring
© 2015 IBM Corporation38
Simplify …
Move Analytics into the Data Warehouse
▪ Integrate the
server, storage
and database
into one
optimized
package
▪ Move complex
analytics into
the database
▪ Leverage proven
technology that
accelerates
analytics with no
tuning or storage
administration
Database AnalyticsStorageServer
Server
Storage
Database
Analytics
© 2015 IBM Corporation39
INDUSTRY SPECIFIC
BENEFITS
© 2015 IBM Corporation40
Speed the analysis of customer
data for improved insight
Optimizing Offers and Cross Sell Use Case Goals
Capabilities Provided by PureData for Analytics
• Speed the analysis of customer data for improved insight
• Improve the cycle time for predictive models to continuously improve offer prediction accuracy
• Encrypt data at rest with self-encrypting disks for improved customer data security
Data
Preparation
Predictive
Analytics
Geospatial
Analytics
Advanced
Statistics
Optimizing Offers and Cross Sell in Banking with…The New PureData System for Analytics N3001
Business Outcomes
• Asian bank increased credit card marketing response rate more than 300%
• European bank increased key client interaction performance metrics by 98 %
• Improve response rates to offers for increased revenue
• Improve cross-selling for increased wallet share
• Improve customer advocacy through improved offer targeting
© 2015 IBM Corporation41
Speed the analysis of fraud data for
better fraud detection and prevention
Fraud Detection and Mitigation Use Case Goals
Capabilities Provided by PureData for Analytics
• Speed the analysis of fraud data for improved fraud detection
• Improve the cycle time for predictive fraud models to continuously improve prediction accuracy
• Encrypt data at rest with self-encrypting disks to protect against security threats
Data
Preparation
Predictive
Analytics
Geospatial
Analytics
Advanced
Statistics
Fraud Detection and Mitigation in Banking with…The New PureData System for Analytics N3001
Business Outcomes
• Global securities exchange reduced the time to run market surveillance by 99%
• Japanese bank improved analysis speed by 90% for improved money laundering detection
• Reduce fraud losses and lower costs to fight fraud
• Improve fraud detection to stop fraud before significant impact
• Improve customer satisfaction with reduction in fraud false positives
© 2015 IBM Corporation42
WHERE THE N3001 FITS IN THE
LOGICAL DATA WAREHOUSE
© 2015 IBM Corporation43
Data Sources
Transactional
Social
Application
User Generated
Journal
Video and Audio
Machine / Sensor
Documents
Third Party
Enterprise Data Warehouses have evolved into Logical Data
Warehouses which optimize access and reduce costs
Internal Insight
Reporting
Enterprise
Content
Discovery
Exploration
Decision
Management
Predictive
Analytics
Visualization
External-Facing
Applications
Web or Mobile
Systems of
Engagement
Information Governance
Real-time Analytics
NoSQL Doc
Store
Data Warehouse Deep Analytics,
Modeling
Transactional
Systems
Landing,
Exploration,
Archive
Reporting,
Analytics
Logical Data Warehouse
© 2015 IBM Corporation44
On Premise Cloud
Flu
id Q
ue
ry
IBM Fluid Query – Powering the Logical Data Warehouse
▪ In the world of big data, can you
really afford to move all your data
to the analytics?
▪ Intelligently route queries to the
correct data store
▪ Simplify and unify information
access for end users and
applications
▪ Access all data within the logical
data warehouse for analytics and
business insight
Move the query to the data, not the data to the query
Question
Answer
Hadoop
Data Warehouse
Data Mart
Operational
Other
© 2015 IBM Corporation45
IBM Fluid Query – Powering the Logical Data WarehouseWithin both PureData and BigInsights
SQL access to data across any
system from Hadoop, including
relational data via IBM Big SQL
Data Warehouse
PureData System
for Analytics
Hadoop
IBM BigInsights for
Apache Hadoop
Run Hadoop queries from
your data warehouse and
move data to/from Hadoop
via IBM Fluid Query 1.0
Other SourcesIBM Fluid Query
© 2015 IBM Corporation46
IBM Fluid Query 1.0
Cross platform query & data movement
between PureData System for Analytics and Hadoop
Question
Answer
Unifying PureData System for Analytics with Hadoop
Hadoop Queries
Data Movement
© 2015 IBM Corporation47
IBM Big SQL adds to the capability of Fluid Query
Cross platform query & data movement
from Hadoop to PureData System for Analytics
Answer
Question
Unifying PureData System for Analytics and Hadoop
PDA Queries
Data Movement
© 2015 IBM Corporation48
Cloudera and Hortonworks can access PureData but
only offer “Fluid Data” not Fluid Query
This is inefficient
Big Data is about moving “Little Data”
Answer
Question
Queries move all data back to Hadoop and do the filtering there
Always Data
Movement
© 2015 IBM Corporation49
IBM Fluid Query Use Cases
Discovery, Exploration and Archive
Land all data in Hadoop for discovery, exploration & “day 0” archive
Queries originate on Hadoop can explore data stored on PureData (Big SQL)
Queries originate from PureData to Hadoop to combine Hadoop data with data in the
data warehouse
Multi-temperate data management
Run queries combining hot data from PureData with colder data from Hadoop
Utilize Hadoop as part of the logical data warehouse
Overall database can be split between PureData and Hadoop based upon frequency of
access, with hot tables or hot data in tables on PureData and colder, less frequently
accessed data residing on the Hadoop distribution
Data Warehouse Capacity Relief and Disaster Recovery
Offload colder data from PureData to Hadoop to relieve resources on the data warehouse
Copy data to Hadoop as a disaster recovery solution (Can be queried in an emergency)
Backup your database to Hadoop, in an immutable format
Queryable Archive
Query archived data on Hadoop with Big SQL or from PureData
Utilize IBM Big SQL to combine Hadoop data with other data sources
© 2015 IBM Corporation50
How do I get Fluid Query 1.0?
Data Warehouse Appliance Requirements
Machine Models Minimum Software requirements
TwinFin N100x systems NPS 7.0.2 and IBM Netezza Analytics 2.5
StriperN2001 NPS 7.0.4 and IBM Netezza Analytics 2.5.4
N2002 NPS 7.1 and IBM Netezza Analytics 3.0
Mako N3001 NPS 7.2 and IBM Netezza Analytics 3.02
Included free as a feature in Netezza Platform Software (NPS)
for PureData System for Analytics appliances
Specifications
IBM Fluid Query download
Supported Hadoop Providers
IBM BigInsights 2.1, 3.0 Cloudera 4.7 and 5.3 Hortonworks 2.1, 2.2
© 2015 IBM Corporation51
WAP12710-USEN-04