introducing data quality services and its role in an enterprise information management (eim) process...

47
Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Upload: rosalyn-bennett

Post on 17-Jan-2016

232 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Introducing Data Quality Servicesand its role in an Enterprise Information Management (EIM) Process

James BeresfordGroup Manager, Avanade

DBI217

Page 2: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Who am I?… and what am I doing here?

Over a decade of MS BI & DW ExperienceBlogger @ www.bimonkey.comTweeter @BI_Monkey

Page 3: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Introducing Data Quality Servicesand its role in an Enterprise Information Management (EIM) ProcessGoals of session:

• Introduce DQS• Concepts• Terminology

• Showcase• Automation• Integration into an EIM process

Page 4: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Data Quality Servicesand its role in an Enterprise Information Management (EIM) Process

Outcomes for you:

• Manage data quality• Automate data cleansing• Support Master Data Management

Page 5: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality Problem

!=

Page 6: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Hands up if you’ve been 2 years old

Page 7: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Your first data quality problem

Page 8: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Data Quality = Shape Sorting

Page 9: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

There is Good Data

Page 10: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

There is Bad Data

Page 11: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

There is Repairable Data

Page 12: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Data Quality Services TermsCollection of Shapes = Knowledge Base

Page 13: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Data Quality Services TermsA Shape = Domain

Page 14: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Data Quality Services TermsShape Sorting = Domain Rule

Page 15: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality Client

Allows you to:

• Create or Maintain• Knowledge Bases• Data Quality Projects

Page 16: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

demo

Create a Knowledge Base using Knowledge Discovery

Page 17: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientCreate a Knowledge Base

What did we see?

• Create a Knowledge Base from Data using Knowledge Discovery

• Set values as Correct, Error or Invalid• Teach the KB some simple auto correction using Domain

Rules

Page 18: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientCreate a Knowledge Base

What can we do?

• Keep our knowledge about data quality in one location• Accept, Correct or Reject values• Update it as an ongoing process

Page 19: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientPop Quiz: Who owns a Knowledge Base?

Maintaining KB’s is done by:

a) Qualified Data Professionalsb) A BI Developerc) Usersd) Microsoft

Page 20: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientPop Quiz: Who owns a Knowledge Base?

Maintaining KB’s is done by:

a) Qualified Data Professionalsb) A BI Developerc) Usersd) Microsoft

Page 21: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientCreate a Composite Domain

+ n

Page 22: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

demo

Create a Composite Domainwith a Domain Rule

Page 23: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientComposite Domains

What did we see?

• Use values from one domain to interact with another

Page 24: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientComposite Domains

What can we do?

• Manage interdependent data fields for quality purposes

Page 25: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientOther Domain Functions

Things that I won’t demonstrate

• Term Based Relations – autocorrect substrings (e.g. Inc. > Incorporated)

• Reference Data – Validation against external sources on Azure DataMarket, e.g. Melissa Data

Page 26: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientData Quality Projects

Application of a Knowledge Base through the DQS Client

• Interactively process data• Output results

Page 27: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

demo

Create a Data Quality Project

Page 28: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientCreate a Data Quality Project

What did we see?

• Interactive Cleansing of data• Export of cleansed data set

Page 29: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientCreate a Data Quality Project

What can we do?

• Clean Data• Export results

Page 30: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

The Data Quality ClientOther DQS Features

Things that I won’t demonstrate

• Matching• De-Duplication

Page 31: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Automation with SSIS

Page 32: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Automation with SSISThe DQS Cleansing Task

Page 33: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

demo

Automation with SSIS and the DQS Cleansing Task

Page 34: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

What did we see?

• Automatic Cleansing of data• Managing different results

Automation with SSISThe DQS Cleansing Task

Page 35: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

What can we do?

• Clean data in an integrated manner• User input to data quality can affect DW• Updating DQ is independent of updating ETL

Automation with SSISThe DQS Cleansing Task

Page 36: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Data Quality Services

Master Data Services

Integration Services

EIM: Credible, Consistent Data

Page 37: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Master Data Services (MDS)

MDS Provides

• A central store of reliable data• Web and Excel UI’s• Easy access to Master Data

Page 38: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Loading MDSCreating our Entity

Page 39: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Updating MDSHandling new, trusted data

Page 40: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Feeding from MDSPublishing with Subscription Views

?

Page 41: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

demo

Integrating DQS processes with MDS processes

Page 42: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

DQS + MDSAs part of an EIM process

What did we see?

• Knowledge from DQS captured in MDS using Excel• New knowledge • captured in DQS• transferred to MDS using SSIS

Page 43: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

MDSAs part of an EIM process

What can we do?

• Use DQS to provide the foundation for Master Data• Use DQS as a means of providing updated Master Data• Have a user driven EIM process

Page 44: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Introducing Data Quality Servicesand its role in an Enterprise Information Management (EIM) ProcessGoals of session:

• Introduce DQS• Concepts• Terminology

• Showcase• Automation• Integration into an EIM process

Page 45: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Introducing Data Quality Servicesand its role in an Enterprise Information Management (EIM) ProcessYou can find me at the speaker meet & greet – 11.00-11.30, speakers lounge, Thursday 13th

And follow me at:• Blogger @ www.bimonkey.com• Tweeter @BI_Monkey• MSDN SSIS & DQS Forums

Page 46: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

Introducing Data Quality Servicesand its role in an Enterprise Information Management (EIM) ProcessQ&A

Page 47: Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217

© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to

be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS

PRESENTATION.