context is everything - aairaair.org.au/app/webroot/media/sigs/sig forum 2013... · • the data...

29
www.certussolutions.com Context is Everything Using policies, data quality and stewardship to govern information Ashwin Sinha Practice Manager – Information Management [email protected] Date: August 2013

Upload: others

Post on 30-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

www.certussolutions.com

Context is Everything Using policies, data quality and stewardship to govern information

Ashwin Sinha Practice Manager – Information Management [email protected] Date: August 2013

Page 2: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

• Information Governance

• Business Glossary

• University example

• Context of Data Quality

• Data Quality Framework

• Certus in University

• Questions

Agenda

Page 3: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Six Steps to Governance

1. Set your Goals - the core statements that guide the operation and development of the

information supply chain.

2. Define Your Metrics - the set of measurements used to assess the ongoing effectiveness of

the program and associated governance processes.

3. Make Decisions - the organisational structure and changing ideological model to analyse and

make policy decisions.

4. Communicate Policy - the tools, skills and techniques used to communicate policy decisions to

the organisation.

5. Measure Outcomes – Compare policy results with goals, inputs, decision models, and

communication to provide constant feedback on policy effectiveness.

6. Audit results – the tool you use to benchmark everything.

Page 4: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Information Governance – Key areas

Page 5: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Information Governance – Operational

Planning Design

Taxonomy Identify

Stewards Glossary Training

Monitor Quality and Compliance

Scoping

Business Terms

Data Quality

Security and Classification

MDM

Define Framework

Profiling Assess Issues

Implement Rules

Implement Scorecard

Planning

Planning Define Process

Publish Dictionary

Publish Policies

Pilot MDM Profiling

Reference Data Management

Import Policies

Import Rules

Connect Rules to Terms

Implement Metrics

Implement Scorecard

Business Case

Charter Organisation

Structure Deploy

Stewards Reference

Architecture Governance

Import Terms

EDW Planning Define Process

Scope Industry Model

Customise Industry Model

Deploy Model to Glossary

Stewardship and Review

Deploy KPI Monitoring

BAU

Page 6: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Definition of a Glossary

• What is a Glossary?

• Most Glossaries are labels and descriptions stored in PDF, Word or HTML format.

• There is a lack of consistency, ownership and re-use across these Glossaries. They are expensive to build and maintain.

Page 7: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Glossary for Universities

• Student ?

• EFTSL – Equivalent Full Time Student Load. That is, a full time student with 24 subject units will have an EFTSL of 1, and a half time student will have an EFTSL of 0.5

• An EFTSL - or Equivalent Full Time Student Load - is a representation of the amount of load a student would have when studying full time for one year. At Curtin 200 credit points equates to 1.0 EFTSL.

• The measure used to determine a student's enrolled load. One 'EFTSL' is the amount of student load determined by the University to be equal to a full-time load for one student for one year, and is expressed at the University as 36 units.

Page 8: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

More on same term

At the University of Sydney:

• 1 EFTSL is equivalent to 48 credit points (1 year of full-time study)

How do I find out the EFTSL of a unit of study?

• The EFSTL a unit of study can be determined by dividing the unit credit point value by 48

Can we have a consistent meaning of above terms always, in absence of context?

Page 9: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Why Use a Glossary?

Page 10: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Central Nervous System

Data Integration

Data Quality

Master Data Management

Big Data Analytics

Information Lifecycle

Management

Privacy & Security

Policies Glossary

• The Central Nervous System receives information from all parts of the body and coordinates activity.

Page 11: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Giving Context to Information

• Under a traditional documentation and development approach artefacts decline in value

• In a Metadata Driven architecture Business Glossary and data lineage becomes more valuable over time.

Contain inaccuracies

Become Stale Get Lost

Somewhere Represent old

standards

Retains SME Knowledge

Shared across projects

Reflects the real world

Drives Change Management

Page 12: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

DQM – Why?

Benefits of undertaking a dedicated data quality management (DQM) initiative as part of the Information Management programme include:

– Engagement on DQM as a strategic issue throughout the enterprise

– Cost-effective actions, targeted at the most critical issue

– Supports treating information as a strategic asset that supports business process

execution

– Measurable outcomes. Providing context to Data Quality issues.

– Sustainability for the long term

– Assurance that business value & benefit meets the initial business case

– Clear roadmap for future progress

Page 13: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Data Quality Framework

Information Analyzer

Data Rule and Exception

Management

Data Cleansing

And Mastering

Data Transformation

validate

cleanse &

enrich

“master”

assess &

discover define

objectives

report monitor /

track

Data Profiling

And Analysis

Exception Reporting

Business Glossary

Policies

Data Quality Scorecard

• Mobilising People, Processes and Tools to improve the quality of information

Page 14: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

1

4

Improving Data Quality Safeguards Data Quality Scorecards are part of a framework to reduce risk:

• The Data Quality Scorecard is a measure of problems, the overall Information Management framework of Business Glossary,

Metadata reporting and exception handling provides the support for handling these problems.

• A Scorecard is the ongoing audit that data is safe to use and the safeguard against complacency.

Data Lineage: where did the problem

occur?

Stewardship: who is responsible for this

data?

Business Glossary: what do we know about this data?

Exception Management: what do we do about

this problem?

Page 15: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

DQM – How (Scalable Approach)

Scope

• Define DQM and boundaries

• Ensure stakeholder buy-in

• Define data domains

• Identify DQ Issues

Measure

• Establish DQ analysis environment

• Profile data

• Capture business rules

• Root cause analysis

• Quantify business impacts

• Qualitative

• Quantitative

• Agree action plan

Action

• Establish DQ resolution environment

• Address DQ issues

• People

• Process

• Technology

• Establish DQ prevention activity

• People

• Process

• Technology

Operate

• Embed DQM as BAU

• Extend and evolve DQM across organisation

• Extend and evolve as part of approach to Information Management

Page 16: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

DQM – How (Scalable Approach)

Page 17: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

DQM – Keys to Success

There are many factors to ensure the success of establishing and operating a DQM capability:

– DQM needs to be clearly linked to business objectives with business sponsorship

– Clearly articulated business impacts both quantitative as well as qualitative

– Clearly understand root causes and costs to remediate / prevent

• People, Process and Technology

– Start with high value and low cost to demonstrate value and ensure ‘right-sized’

approach

– Define KPI’s that leverage D.A.T.A.

• Digestible,

• Actionable

• Timely

• Auditable

Page 18: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Certus provides the framework

What rule sets are available? A combined set of over 250 data quality rules and over 200 Business Glossary terms for Education sector developed and provided by Certus and IBM is available to kick start a project. Examples of the rules available include:

• Party/person rules - validation of an actual person's name, gender and age. Some examples of this are reasonableness checks on whether name is in a valid format, age range is valid, etc.

• Address rules - validation of the various parts that make up the address, and reference checks of cities and post codes.

• Email and web rules - validate the format of email addresses, email domain names, host names, URLs and IP addresses.

• Field format rules- A large number of generic field format validation rules that cover a range of different field types such as indicators, codes, dates, numeric formats, etc.

Page 19: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Certus DQF Overview

Page 20: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Data Quality Framework

The artefacts within the Data Quality Framework provide the client with a top down understanding of the implementation of the Data Quality Process. The Framework gives you an understanding of the roles, responsibilities, tasks and processes involved in the implementation.

The DQF Hierarchy Tree defines the high level steps required for the deployment of the framework. These steps then break down into supporting artefacts at the median level, providing the client with:

– Business Process Diagram for each main level in the Hierarchy tree.

– RACI document which details the tasks and technologies involved in each level of the hierarchy tree and roles and responsibilities across each task.

– Project Plan template

Technical artefacts are also included that detail each step involved in the process.

Page 21: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Certus DQF Framework

Page 22: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

DQF Framework Components

• Level 1 CDQF Hierarchy Tree - Top level processes involved in the framework.

• Level 2 Detailed artefacts supporting the top level processes

• RACI Document Roles – An outline of the roles required within your organisation for this initiative. Responsibilities – responsibilities outlined and assigned by role. Technologies – technology components required in each step.

• Project Plan The project plan provides a template for deployment covering each task within the Framework broken down by duration and role involved.

• Business Process Method Map

Provides a visual representation of the flow of tasks throughout each business process.

Page 23: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

DQF Technical Components

DQF Data Model • The DQF data model provides the underlying store for the metadata that is

transferred throughout the framework.

Framework Integration Components The components below make up the technical plumbing of the Framework.

• ETL (IBM DataStage)

• Database Scripts (DB2, Oracle, SQL Server)

• Operating System Script (Linux, Windows)

Certus DQ Dashboard and Active Reporting Solution • High Level Dashboard

• Reporting that gives you the ability to drill down into the high level metrics.

• Available for iPad for mobility

Data Rule Sets • Data rules sets are available to assist in the development of your own internal data

rules. They give you a head start with common rules found in separate data sets i.e. Name and Address.

Page 24: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Educational Experience

Who we’ve been working with...

James Cook University

NSW Board of Studies

Christchurch Polytechnic

Institute of Technology

Edith Cowan University

Deakin University

Griffith University

Massey University

Ministry of Education

NSW Department of Education & Communities

NSW TAFE

Open University

Study Group International

Southern Cross University

University of Auckland

University of South Australia

University of Technology Sydney

University of Wollongong

Two examples

James Cook University

As part of Information to Analytics project

Certus team has worked with JCU to establish

an information management framework, BI

reporting and Budgeting & Planning. Solution. It

also addresses some of the early Information

Governance requirements.

Open University

The key development work was on student

enrolments and the cost of running particular

courses. It was enrolment-driven, looking at

the individual subject units, and the student

profiles which may contribute to additional

course requirements.

Page 25: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

What we do

ASSET MANAGEMENT

INFORMATION MANAGEMENT

INFRASTRUCTURE SERVICES

MIDDLEWARE SOLUTIONS

COLLABORATION SERVICES

WEB SOLUTIONS

SUPPORT SERVICES

LICENSING

Enhance operational efficiency and optimise management processes in your asset intensive business

Create, utilise and maintain trusted information assets with IM strategies and solutions

Maintain system performance and unleash productivity with reliable, on-demand business infrastructure

Build enterprise applications and remove information silos for improved business agility

Connect people, share information and empower your organisation to work smarter

Drive improved business performance with unique user-centred design methodologies

Subscribe to flexible IBM software support and access expert technical resources to maintain operational efficiency

Extract maximum value from your IBM software with effective, compliant licence management

BUSINESS ANALYTICS

ENTERPRISE ARCHITECTURE & SDLC

Align corporate strategy with business process, information, applications and technology

Deliver complete, consistent and accurate information to decision-makers for improved business performance

Page 26: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

Thanks & Questions

Page 27: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

FAQ

• How dependent is the solution on customer having key components of IBM InfoSphere and IBM Cognos? Can the solution be ported to other technologies with a limited effort?

To realise the full benefits of the solution the customer would need to be an IBM Information Server and Cognos user.

The framework is also useful to clients who wish to implement a DQ project on separate technology. The architectural and project management components give the customer a generic framework to implement there project.

Page 28: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

FAQ

• How flexible is the DQ rule creation and editing for future maintenance?

The data rules that are created for the project are easily edited from the IA interface.

• What sort of support would be needed to keep the solution running?

Once the technical components of the project are productionised a number of roles will have to be filled to ensure an ongoing data quality improvement. A technical resource to monitor the execution and deal with issues that arise. A business resource from each area to ensure the data quality results are monitored and initiatives are taken to ensure the improvement of that data over time.

• What will justify the ROI business case for a customer buying this?

In our experience full implementation of the Data Quality Framework would save the customer up to 12 months of development effort getting it off the ground from scratch.

Page 29: Context is Everything - AAIRaair.org.au/app/webroot/media/SIGs/SIG Forum 2013... · • The Data Quality Scorecard is a measure of problems, the overall Information Management framework

FAQ

• What the pre-requisites on software, hardware and discovery to finish this activity?

At a generic level the framework is a tool to manage and plan a data quality project.

At the full technical level the Framework requires IBM Information Server Components;

IBM Information Analyzer V8.7 min

IBM Datastage V8.7 min

IBM Business Glossary V8.7 min

IBM Cognos User licences V10.1 min

• How do we ensure that the solution is flexible for customers with different data quality needs and different types of data?

The framework has been generically designed it provides a structure to implement over an industry. It is not s rigid structure or formulated from an industry specific point of view.