high precision matching at the heart of mdm
TRANSCRIPT
High precision matching at the heart of Master Data Management
Holger Wandt, Director Thought Leadership & Education, Neopost/Human Inference
Gartner MDM Summit, 12 March 2015
Human Inference –a Neopost company
• More than 25 yearsexperience
• Experts dedicated to DQM and MDM
• European Company
• Solutions based on naturallanguage processing
Human Inference helps organizations to benefit from personal and relevant interaction, based on trustworthy
information.
0
1
0
1st
Qtr
E
W
N
3
4
Single customer view 1960
5
Single customer view 2015
6
A combination of technology, processes and services to deliver an accurate, timely and complete view of the customer across multiple channels, lines of business, departments and divisions drawing customer data from multiple sources and systems.
Matching is key!
MDM for customer data
Matching is a “catch-all” term for comparing records, detecting duplicates and combining them into the “true” version of a record (Golden record)
Matching is a complicated process:
• Different data formats
• Volatility of customer data
• Human errors
• Expanding internationalisation, i.e. linguistic diversity
• Industry- or company-specific requirements
Matching customer data across heterogeneous sources
• Deterministic matching: Country- and subject-specific knowledge, business rules, phonetic conversion and comparison, algorithms:
EVO AG Energieversorgungsgesellschaft Ost
• Probabilistic matching: Fuzzy logic, fault-tolerance algorithms, contextual frequency rules, patterns
Jack London Ltd., Thompson London Ltd.
To deliver the best required result, matching engines must combine both approaches. The better a matching is able to determine what is what in a particular context, the better the probability calculation of a certain match or a certain non-match. It is, in essence, the same way we humans apply matching.
High precision matching: combining adeterministic and a probabilistic approach
Mimicking human interpretation of customer data is the road to successful high precision matching: What is what?
We saw the girl with the binoculars…
Genco International Ltd.Orchard Brae House30 Queensferry Road
Edinburgh EH4 2UZ
+44 (0)131 343 7176
4518 Madame Françoise Durand
Int. Transp. Comp. J. Blackpool Aberdeen Ltd.
JJCAD John James Computer Aided Design
CQCS International Consulting Ltd.
16
Peter Arnold Frank
Mohammad Ouazzani Benhaddou
Demis Archontakis
John J. Farren jr. PhD
Dr. John J. Pharran jr.
Natural Language Processing
• Test with a lot of of real data
• Profile data sources in order to assess accuracy and completeness of data
• Compare and benchmark matching results
• Combine deterministic and probabilistic approaches
• Consider the interface for data stewardship (doubtful matches)
• Consider current AND future requirements
• Think across the border
To keep in mind when assessing matching capabilities for your MDM solution
The quality of your MDM solution is only as powerful as the quality of your matching engine
•A closer look…….
More information?
Visit us at booth PL6
Utrechtseweg 310, Building H31 6812 AR ArnhemThe Netherlands
Dr. Holger WandtDirector Thought Leadership& Education [email protected]
Phone: + 31 (26) 355 06 55 Fax: + 31 (26) 355 06 66
Mobile +31 (6) 22477594 www.humaninference.com