data analytics tools - amazon s3 · 2016-06-06 · what is data analytics? •analytics is not a...

28
Data Analytics Tools

Upload: others

Post on 25-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Data Analytics Tools

Page 2: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

What is Data Analytics?

• Analytics is not a technology – it’s a CONCEPT

• It refers to the use of certain technologies, skill sets and processes for the exploration, evaluation and investigation of business operations

• It is the use of raw data to produce insights or conclusions that can be acted upon

• The practice of Analytics make extensive use of data, statistical and quantitative analysis, confirmatory data analysis, as well as data management

2

Page 3: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Capabilities of Data Analytics

Hindsight

• What happened and why?

Insight

• Where is the problem and what action needs to be taken to solve it?

Foresight

• What will happen if these trends continue?

3

Facts Understanding Knowledge

Page 4: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Data Analytics: What is It?

Traditional Business

Intelligence

Increasing business valueTransactional

An

alyt

ical

Mat

uri

ty

Strategic

• Data integrity & quality

• Basic employee lists & extracts

• Understanding business

parameters

• Visualizing transactions

• Integrated analytics

• Multiple sources of data

• Forecasting and predicting

future outcomes

• Modelling and

understanding correlations

and causalities

• Simulate and

experiment with

possible scenarios

• Optimize efficiency

and resource usage

What is happening?

Why is it happening?

What might be happening?

Basic KPI Reporting

Visual data exploration

Segmentation

PredictiveModelling

Simulation &Optimization

• Understand groups

and outliers

• Discover and target

opportunities

Page 5: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Data analytics can be broadly categorized into a three layered process

5

ETL (Extract, Transform, Load) AnalyzePresent

Page 6: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

ETL (Extract, Transform & Load) Process

6

Page 7: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Example of ETL layer Analytics Tool

7

Page 8: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

What do Consultants need to know about ETL layer

8

To be honest, not much!

Unless you are a system design consultant, chances are you will never have to deal directly with an ETL Tool. In most cases, clients will already have their ETL setup and will just provide you with the data.

Key Learning points:

• Know the language. Next time your client says their ETL system is not robust, you know what they are talking about

• Ask them about the data format- it is important to understand the format in which data is extracted from the ETL system. Is it text based, Is it excel, is it already in tabular format, what additional transformations are required, etc.

Page 9: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

A few new ETL Tools (Will add, maybe)

9

Page 10: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Analysis process

10

Page 11: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Example of Analysis layer Analytics Tool

11

Page 12: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Factors to consider while choosing Analysis Analytic tool

12

• Data size and format

• In memory vs. HD storage

• Usage Complexity vs. available talent

• Cost

• Ad-hoc vs. continuous usage

• Integration with ETL & Presentation layer

Page 13: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Comparison of some most used Analysis Tools

13

Comparison points based on previous slide

Page 14: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Sample Scenario1- In house analytics for limited data size and ad-hoc use

14

Page 15: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Sample Scenario 2- Continuous analytics for a client for large sized data

15

Page 16: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Analysis Tools to look out for

16

Page 17: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

The Presentation Layer

17

Page 18: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Example of Presentation Layer Analytics Tool

18

Page 19: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Factors to consider while choosing Presentation tool

Visual analysis Information delivery

Used by analysts Used by operational workers

Test hypotheses Predefined subjects and metrics

User driven interaction Restricted interaction

Linked charts show multiple data aspects Individual familiar charts

Smaller learning curve Build from prepackaged components

Preferable for smaller teams Push reports to the enterprise

Visualization focused vendors Part of an enterprise BI stack

Used for exploration Used for monitoring

Some statistical and manipulation capabilities Analysis outside the tool

Requires data exploration skills Requires developers, SQL skills

19

Exploratory Analysis BI & Dashboard Reporting

Choosing the appropriate tool requires considering the purpose of the visuals, the users who consume the data and integration with advanced analytical components. These considerations can be mapped to the spectrum attributes to target the tool search process.

Page 20: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

20

Where do we place each visualization tool?

20

AnalyticVisualization

DashboardReporting

BI Platforms Spotfire

QlikView

MicroStrategy

Tableau

Exploratory Analysis BI & Dashboard Reporting

• The spectrum region where QlikView lies overlaps each category, since it also supports dashboards

• Tableau’s strength is analytic visualization, while its BI functionality is more limited

Page 21: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Sample Scenario 1 with Visualization example for Tableau

21

Page 22: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Sample Scenario 2 with Visualization example for QlikView

22

Page 23: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Hybrid Tools to look out for

23

Page 24: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Best practices

• Know what you want : Have a detailed program enlisting all the procedures

• Get the data format right: Provide list of key fields, layout

• Agree on a source document: A financial report, MIS report, System control totals – screen shots should

be determined before hand to check for completeness of data

• Extract Data much before: Check what happens when huge data is extracted. Truncation,

Inconsistent/Incorrect values

• Convert data into useable format: Have your tools ready to convert data into a format that can be

analyzed. Check for data consistency (related/paired fields) and data accuracy (data integrity of key fields)

before starting your analysis

24

If you don’t have right data;

You don’t have right answers

Page 25: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Challenges

•Clients: Why is data required? What do I get in return?

•System: Getting entire data becomes a challenge due to slow systemperformance and/or lack of knowledge on data extraction techniques

•Format of Data: Data files get extracted in a format which cannot beanalyzed. E.g. Report files in text, PDF files, Excel files (these too are notalways the best)

•Volume of Data: Data files are provided in multiple files with differentformat and layouts; incomplete files; truncated data

25

Page 26: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Challenges (contd.)

•How would you know if your data is:

• Complete (nothing was missed in transition – client system to data file to your system)

• Accurate (Represents what you want to test – has requisite key fields with the values you expect)

• Consistent (Paired/Related Fields have values that makes sense – is consistent with business logic)

26

Know your needs Get the Right Data

Page 27: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Analytics – a journey and not a destination

• Start where you are: Assess your current capabilities and get a clear picture of gaps

• Know which questions matter most to your industry, strategy and priorities

• Accelerate insights through automation. Automate delivery of the information – to management, operational and analytical processes

• Engage and visualise: Output must deliver insights people need – in whatever forms required – to make fact based decisions

• Develop a fact driven culture. Embed analytics capabilities into decision making processes matching your approach with your style

• Practice “right fit” analytics: Match statistical and analytics techniques to the job at hand

27

Page 28: Data Analytics Tools - Amazon S3 · 2016-06-06 · What is Data Analytics? •Analytics is not a technology –it’s a CONCEPT •It refers to the use of certain technologies, skill

Conclusion