présentation ibm infosphere mdm 11.3
DESCRIPTION
Présentation d'Aomar BARIZTRANSCRIPT
InfoSphere User Group France
MDM v11.3 New Features
Aomar BARIZ Information Governance Client Technical Professional Mobile:+33 6 73 48 40 72 E-mail: [email protected]
InfoSphere MDM – Release Timeline
eGA: June 2013
Target eGA:
Q4 2014
eGA: Oct. 2013
This Release
Target eGA: June 27, 2014
v11.0 refresh
vNext
v11.0
V11.3
• Continued Integration of Development
organization (all MDM Flavors)
• Simultaneous delivery of all supported
platforms
• Simultaneous release with a number of
InfoSphere offerings (incl. Information
Server)
2
• InfoSphere MDM probabilistic matching engine for BigInsights:
Enhances bulk search and performance for BigInsights matching.
• Salesforce.com integration: Improves Salesforce.com experience by
leveraging InfoSphere MDM for search, remediation, and integration with
external data. Invokes virtual MDM.
• IBM Health Care Provider data warehouse: Enables patient-centric
analytics by pushing a 360-degree view of the patient into the
warehouse.
• Clinical data services: Integrate patient clinical data for research, care
coordination, and wellness initiatives.
• Application-based licensing: Offers new licensing and pricing options
when acquiring MDM in support of other IBM products.
• InfoSphere DataStage integration: Reduces implementation time and
cost by speeding and simplifying integration development both for initial
load and ongoing production integration.
MDM Industry
/LOB Solutions
• Collaboration Server free text search: Improves user experience with a
more intuitive search facility powered by IBM Watson Explorer.
• Virtual MDM performance: Enhances control of record linking to reduce
the risk of system performance issues.
Core MDM
Enhancements
Overview of Key InfoSphere MDM v11.3 Features
MDM for
Big Data
IIG Portfolio
Integration
3
InfoSphere MDM v11.3 (Bali)
Collaborative MDM Free Text Search with
Watson Explorer
4
Positioning – Free Text Search with Watson Explorer
5
What are the business or technical benefits of
solving this problem(s)?
What problem(s) and for whom (role, industry)
does this capability solve?
How is this capability different in solving this
problem than competitive offerings?
How does this capability solve this problem(s)?
• Searching PIM systems is not intuitive and
often requires training
• Search performance becomes an issue when
dealing with large volumes
• Providing external access to PIM data is
complex and costly
• All roles across all industries are impacted: • Analysts, Admins, Stewards, Executives, etc
• Retail, Banking, Industrial, Telco, All
• Intuitive access to data with Google-like
capabilities
• Improved search performance with larger
volumes
• Externalize search and access outside of the
PIM system
• Data is pushed to Watson Explorer and indexed
for high performance
• New static search bar on every screen allows
users to search by key words and phrases
• Results page allows for opening a single record
or multiple records for mass edit
• Competitors are not currently offering a free
text search capability
• Watson search leverages best in class Big
Data capabilities for high performance
• Only offering to provide a true web experience
for search.
Free Text Search powered by Watson Explorer
6
Search Results
Open for single edit
or select multiple
for bulk edit
7
Type Ahead
Spelling correction
Search can be externalized for Enterprise access
Search directly on
Watson Explorer
10
Capabilities
Enable or disable through configuration file
Static search bar on every screen
Search examples
–Term
–Term1 OR Term2
–Term1 AND Term2 (Term1 Term2 also works)
–Catalog:CatalogName Term
–Combine the above for complex searches Search across specific catalogs
Launch in single edit or bulk edit
Limited use license of Watson Explorer included in the v11.3 bundle
11
InfoSphere MDM v11.3 (“Bali”)
Big Match
12
Matching Approaches
Deterministic
Rules-based
fuzzy matching
Apply logical rules
sequentially or
hierarchically
Deterministic
Rules-based
exact matching
Deterministic
Scoring-based
matching
Probabilistic
Self-learning
algorithms
Compare records attribute
by attribute.
Assign a score for each
attribute match.
If the total score is high
enough, they match.
How do you decide how
much a partial date match
should be worth? An edit
distance of 2? A nickname
match?
Account for misspellings
and typographical errors
Metaphone
Edit distance
The algorithm learns how
to score attributes from
the data itself
How common are
partial date matches
within your data?
How common are
nicknames?
Tuned to your data
Big Data needs more sophisticated
capability
13
Using out of the box fuzzy functions to enable accurate data
searching/matching in your Hadoop environment
Nov 6,
Phonetics
Mohammed vs.
Mahmoud
Synonyms
Andrew = Andy
George = Jorge
1st = First
Abbreviations
AIG = American
International
Group
Road = Rd
Concatenation
Van de Velde =
Vandevelde
Misalignment
Kim Jung-il =
Kim il Jung
Edit Distance
867-5309 ~ 876-
5309
Region Specific
トヨダ =
トヨタ株式会社
Date Similarity
01/01/1973 ~
01/03/1973
Proximity
Geocodes and
great-circle
distance
Noise Words
Roadster Inc. =
Roadster
Typographical
Errors
John Smith vs.
John Snith
14
C. Johnson
123 Main Street
512-545-1234
CRM Supply Chain
Fulfillment
Support Ticketin
g
External Sources
3rd Party
Chris Johnston
123 Main Street
512-554-1234
Shipping:
456 Pine Ave
Christine. Johnson
123 Main Street
Call length
Semi-structured notes
Satisfaction
C. Johnson
Main Street
512-554-1234
C. Johnson
125 Main Street
512-554-1234
ChrisJohnson65
“Likes” Clothes,
Camping Gear @ChristyJohnson65 Christy65
Circle / Network data
Order Mgmt.
Internal / Structured
External / Unstructured
Web
Big Match empowers customer analytics at Hadoop scale
Big Match
matches all
these records
Big Match combines the MDM probabilistic matching engine & pre-built algorithms & BigInsights for customer matching natively
within Hadoop
Increased Value of Customer only if…
Christine Johnson
Married
1 child
4/15/74 Christy65
Mail Order responder
Specialty Apparel
Partner Sales data
VIP: Gold
Customer Sat: 80%
Influence Score: 8/10
15
What is Big Match
Big Match allows you to run the MDM probabilistic
matching engine natively within IBM’s open source
Hadoop distribution (Infosphere BigInsights)
Your clients are implementing customer analytics projects using Hadoop today
Use Big Match to differentiate the IBM stack – no other vendor has it
Infosphere Master Data Management (Advanced
Edition, Standard Edition, Collaborative Edition)
16
Big Match as a foundation of your customer analytics in
Hadoop
Accurate – Matches via statistical learning algorithms based on your data
(customer see improvements between 5-15%)
Simple & Fast Time to Value - Hours to use configurable pre-built customer
algorithms, instead of weeks or months of developing code
Performance - Hours to match initial data sets of big data volumes via use
of MapReduce distributed processing
Proven - Leverages the experience of over 10 years and 900 customers
across worldwide deployments dealing with individuals and organizations
Is your client using Hadoop within customer analytics?
Then they need Big Match
17
InfoSphere MDM v11.3 (Bali)
IBM Stewardship Center for
- Physical MDM, Individual Domain
Three takeaways
1. Deliver a differentiating, prescriptive user experience for LOB users
WHY?
1. LOB users need to explore and discover how master data can help their business
2. Knowledgeable LOB users make the most informed data quality decisions & their
involvement increases their confidence in master data
20
Stewardship Center is a physical MDM application for LOB
users and stewards
Positioning – IBM Stewardship Center
21
What are the business or technical benefits of
solving this problem(s)?
What problem(s) and for whom (role, industry)
does this capability solve?
How is this capability different in solving this
problem than competitive offerings?
How does this capability solve this problem(s)?
• Data quality decisions are low quality because
they do not include LOB user insight
• LOB users do not trust master data because they
struggle to understand how their system’s data
contributes to the golden record
• Stewardship managers struggle to show their
team’s value & contribution to the business
• All roles across all industries are impacted: • Analysts, Admins, Stewards, etc
• Retail, Banking, Industrial, Telco, All
• LOB discovers MDM value by browsing and
investigating MDM, gaining new insights
• If issues are identified, LOB users can make master
data updates directly
• Direct data quality decisions to the right LOB users
at the right time
• Stewardship dashboard enables stewardship
managers to demonstrate efficacy and make
informed decisions to improve team performance
• IBM Design Thinking brings the prescriptive UX
necessary to leverage knowledge workers
• Enable data quality users to collaborate using
social and mobile features
• Business rules tailor which user is assigned a
given task based on task type and entity
segment
• The dashboard displays task breakdown and
team/individual performance by reporting from
the Stewardship Center’s data warehouse
• Prescriptive OOTB UX allows Stewards and LOB
users to commune on data quality decisions using
social collaboration and mobile
• Keep the business connected to master data with
mobile stewardship, approval, & notifications
• Dashboard allows for the quick assessment the
data quality metrics, team monitoring and rerouting
• Intelligent Inbox prioritizes work w/ auto-escalation
• Quickly extend or customize the Stewardship
Center’s WF or UI using MDM AT & IBM BPM
• Options for Cloud deployment
22
LOB Knowledge Workers LOB Owners & Governance Team IT Stewards
Web Mobile Social Collaboration
Data Quality Application
Business Processes Analytics
Stewardship Center
Customer Centricity Know Your Customer Operational Excellence
Dashboard Workflow & Rules
Comprehensive data quality application delivering business
confidence
Capabilities needed to deliver data quality to the business?
Ensure most knowledgeable LOB users contribute in quality decisions
Provide LOB users business context and prescriptive experience
Align stewards and LOB users to efficiently remediate data quality tasks
Include the right participant at the right time within the data quality process
Ensure task ownership and accountability with traceability
Demonstrate team performance and provide management insight
23
Stewardship Center keeps the business connected to master
data, driving ownership
LOB users
Explore, learn, and discover master data
– Discover relationships
– Data quality root cause analysis, take corrective action
– Master data survivorship
– Review/approval and notifications for critical data issues
– Stay connected with mobile stewardship
– Only view appropriate information
24
Stewardship Center increases data quality through LOB user
and steward collaboration
Stewards and LOB user
Cross team connectedness with social collaboration
Data quality workflow assigns tasks to the right user at the right time
Automate common decisions reduce time and cost of human involvement
Increase throughput by including LOB users while infusing business
knowledge into DQ decisions
Increase business confidence and ownership of master data
Prescriptive business tools for data maintenance and matching records
25
Stewardship Center provides visibility and insight to ensure
effective team performance
Data Steward Manager
Dashboard optimized for data steward manager activities
Quickly assess areas of risk and take corrective action
Identify active stewards and commune
Track at risk/high priority tasks for better team mgmt and resource loading
Identify data quality trend/bottlenecks and make informed improvements
Ensure task ownership and accountability along with traceability
26
Complete task
visibility
Identify data quality trends
View team and status
Commune with team
View team’s tasks and manage
Data Steward Management Dashboard
27
InfoSphere MDM v11.3 (“Bali”)
Powering Salesforce CRM Initiatives using
InfoSphere MDM
All CEOs; n = 229
Which technology-enabled capabilities will be an important area of investment to improve your business over the next five years?
Source: Gartner Report - CRM in a Sea of Change 2013
29
Gartner Technology Investment Survey 2013
What are the primary objectives of your 2013 CRM programs?
0 10 20 30 40 50
Improve customer data quality
Increase customer loyalty
Improve lead quality and conversion
Increase customer retention
Create a single view of the customer
Enhance cross-sell or upsell of products and services
Increase customer satisfaction
Increase sales revenue
Increase acquisition of new customers
Enhance customer experience
Percentage of Respondents
Revenue
Information
Loyalty/ Satisfaction
Revenue
Information
Loyalty/ Satisfaction
Source: Gartner Report - CRM in a Sea of Change 2013
n = 190
Gartner CRM Survey 2013: Top 10 CRM Objectives in the U.S.
30
Source: Aberdeen Group, July 2011
A CRM assessment report published by Aberdeen in 2011 showed that Peak CRM Performance is directly related to the accuracy
and availability of customer records
n = 261
Gartner believes that
bad data quality is the #1 reason
why CRM projects to fail
Data Quality and Accessibility – by Best in Class
31
CRM Magic Quadrant
32
The Solution
33
Customer Relationship Mgmt
Improve win-rates & seller productivity
Contact
• Identify duplicate customer and prospect records & reduce duplication at the point of entry
• Find the right customer faster by leveraging advanced search capabilities from MDM
• Enrich customer data in Salesforce with collective knowledge from internal & external data sources
• Identify relationships between customers/entities
InfoSphere MDM can help organizations optimize client-
focused initiatives by delivering a Single View of Customer
34
An InfoSphere MDM powered Salesforce initiative can deliver
real business benefits
35
InfoSphere MDM – SFDC solution capabilities
36
InfoSphere MDM Powered Probabilistic Search
Publish enriched master information from InfoSphere MDM to SFDC
Event Notifications
Search for Accounts from SFDC as well as from other sources
Enrich Account information in SFDC by leveraging the broader enterprise
master information in InfoSphere MDM
Automatically receive MDM events in SFDC whenever a SFDC record gets
added, updated or linked in MDM
Enhanced Security using WebSphere Cast Iron
Secure data and calls between InfoSphere MDM (on-premise) and SFDC (an
external SaaS application)
Perform Bulk data ops using InfoSphere Information Server (SF-Pack)
Perform Bulk data movement from MDM to SFDC using Information Server
(SFPack)
IBM MDM
No In-out bound No Inbound
IS-SP
HTTPS
JMS
Queue HTTPS
SSL Tunnel
1
2
3
1 Real-time Sync
2 Near Real-time Sync with reliable
queuing
1
3 Batch Import/Export using InfoSphere
Server Pack (IS-SP) for SFDC
Cu
sto
mer F
irew
all
InfoSphere MDM-Salesforce Integration Architecture
37
InfoSphere MDM v11.3 (Bali)
InfoSphere MDM and Information Server
Integration
User Technologies 41
An MDM Hub is only useful when integrated with the “Extended” enterprise. MDM projects require a high performance ETL solution for loading data
into and extracting data from an MDM hub. The ETL solution should be integrated so as to hide the complexity of the
underlying MDM data model and to make integration easy.
IBM MDM by itself, not provide a fully integrated, end –to-end integration , quality and data governance for master data
The bundled 3rd party Clover ETL is just a point solution for ETL Clover ETL is not integrated with other IBM IIS components MDM bundles only IIS for Data Quality
IBM MDM by itself, does not provide end-to-end metadata management for
MDM data Changes in the MDM model are not automatically shared across
integration components and governance tools. MDM assets are not „natively‟ available to the Information Server
Information Governance Catalog (IGC)
The Problem?
41
42
IBM IIS Enterprise Edition bundled with IBM MDM (with license restrictions) providing end to end ETL, governance and data quality
Out-of-the-box MDM Connector Stage within InfoSphere DataStage/QualityStage Designer that simplifies MDM load and extract for DS/QS developers
InfoSphere MDM metadata in IIS drives automation and makes integration configurable so very little development is required for MDM data load and/or extract
Enables configurable data integration for MDM Provides governance for the MDM information supply chain Provides design lineage through MDM metadata in IGC
Clover ETL is removed from MDM V11.3 as a bundled component
Customers can obtain licensing and support directly from Javelin in order to use existing Clover Graphs with MDM V11.3
The Solution?
42
User Technologies
Build a MDM metadata model to describe assets in the IS repository Enables all tools in the suite, particularly IMAM, IGC, DataStage and DataClick to
work with MDM assets Build export functions into the MDM workbench
Allows individual MDM users to document and manage their projects metadata assets for use by IS repository
Build a MDM Connector that can consume the MDM metadata and enable the memget and memput MDM interactions
memget and memput interactions are broadly used to read and write data resp. Implement the MDM Connector over the Java Integration Stage Leverage the Parallel engine capability
Bundle Information Server Enterprise Edition with MDM Limited license terms Gives MDM customers the relevant Information Server functions out-of-the-box
How are we integrating?
43
Metadata Admin DataStage/Quality Stage
Job Developer
XMeta (MDM, ASCL)
MDM Developer
MDM
Workbench
Exports the project
metadata as XMI
files for use by MDM
Connector
XMI Files
SCM or DevOps Repository
Import XMI files,
analyze, preview and
upload to Metadata
Server
IMAM Asset
Manager
[ MDM Design MetaData ]
DataStage/
QualityStage
Designer
Configure MDM
Stage with MDM Hub
Connection and other
settings
Query MDM
Model in
XMeta
MDM Model Bridge
XMeta (DSX)
Persist Stage
configuration in
DSX Model
Compile,
Deploy & Test
Jobs [ Job MetaData ]
MDM Connector Usage (Design time)
45
Questions?
46
47
Notable:
No support for Power Linux
Windows only supported for
Standard Edition
WAS only (started
previously)
No FireFox support for
Collaborative Edition
Information Server v11.3
release has two delivery
tiers (platforms)
Latest information online – Refer to InfoSphere MDM System Requirements on ibm.com
Component Flavor Version(s) Notes
Operating
System
AIX (Power)
Solaris (SPARC)
Linux (RHEL)
Linux (SLES)
zLinux (RHEL)
zLinux (SLES)
Windows
v6.1 and v7.1
v10
v6
v11
v6
v11
2008R2, 2012
X86-64 only
X86-64 only
SE ONLY
App Server WAS v8.5.5
Database DB2 LUW
DB2 for z/OS
Oracle
v10.1, v10.5
v10.1, v11
11g R2, 12c
Web Browser IE
FireFox
9, 10
ESR 24
Not CE
Other Information Server
MQ
BPM
Portal Server
v11.3
v7.5
v8.5.0.1
V8.0.0.1
Tiered Release
InfoSphere MDM v11.3 – Supported Platforms
48
Makes it easier for customers to
deploy an MDM environment (e.g.
Development)
There should not be an
expectation that all deployment
scenarios are supported with
the Supporting Programs …
Additional licensing (e.g. RAD)
is often required!
Limitations/Restrictions:
– Primary Limitation – Only when in
support of InfoSphere MDM
– See LI for others
Important Note: The LIs for the
Supporting Programs is also in
effect
Supporting Programs for InfoSphere MDM v11.3
IBM Rational Application Developer for WebSphere Software v9.0
IBM DB2 Enterprise Server Edition V10.5
IBM Content Integrator 8.6
IBM Cognos Business Intelligence V10.1.1
IBM Cognos Business Intelligence Modeling v10.1.1
IBM Cognos Business Intelligence Samples v10.1.1
IBM Cognos Supplementary Language Documentation v10.1.1
IBM Process Server Standard v8.5
IBM Process Server Standard for Non-production Environment v8.5
IBM Process Center Standard v8.5
IBM Process Designer v8.5
IBM InfoSphere Information Server Enterprise Edition v11.3 (for
MDM Editions)
IBM InfoSphere Information Server for Data Quality v11.3 (for RDM
and CDH Stand-alone)
IBM InfoSphere Data Explorer v9.0
IBM InfoSphere BigInsights Standard Edition v2.1.2
IBM InfoSphere Blueprint Director v2.2
IBM WebSphere Message Broker v8.0.0.3
IBM WebSphere Message Broker Connectivity for Healthcare v8.0
IBM WebSphere Application Server Network Deployment V8.5.5
IBM WebSphere Application Server Base 8.5.5
IBM WebSphere MQ V7.5
IBM WebSphere Portal Server 8.0.0.1
IBM Installation Manager & IBM Packaging Utility for Rational Software
Development Platform v1.7
IBM Security Directory Server v6.3.1
IBM Support Assistant Data Collector v2.0.1
Supporting Programs – v11.3