data mining (part i)

11
Rodrigo Ramos Dornel www.rdornel.com (Site/Blog/Videos) @rdornel Microsoft MCP, MCTS, MCITP MCT SolidQ – Data Platform Engineer http://www.solidq.com/br-pt

Upload: rodrigo-dornel

Post on 17-Dec-2014

663 views

Category:

Technology


1 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Data mining (Part I)

Rodrigo Ramos Dornelwww.rdornel.com (Site/Blog/Videos)

@rdornel

Microsoft MCP, MCTS, MCITP e MCTSolidQ – Data Platform Engineerhttp://www.solidq.com/br-pt

Page 2: Data mining (Part I)

Data Mining

• Reference

http://msdn.microsoft.com/en-us/library/bb510516

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 3: Data mining (Part I)

Data Mining

• Data Mining Concepts

• Data Mining Algorithms

• Mining Structures

• Mining Models

• Testing and Validation

• Data Mining Queries

• Data Mining Solutions

• Data Mining Architecture

• Data Mining Tools

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 4: Data mining (Part I)

Data Mining

• Data Mining Concepts

• Is Data Mining part of BI – Business Intelligence?

• Is Data Mining part of BA – Business Analytics?

Reference and Recommendation:

http://timoelliott.com/blog/2011/03/business-analytics-vs-business-intelligence.html

http://en.wikipedia.org/wiki/Business_analytics

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 5: Data mining (Part I)

Data Mining

• Data Mining Concepts

• In other words, querying, reporting, OLAP, and alert tools can answer questions such as

what happened, how many, how often, where the problem is, and what actions are

needed.

(Summarize)

• Business analytics can answer questions like why is this happening, what if these trends

continue, what will happen next (that is, predict), what is the best that can happen (that is,

optimize)

(Tendency)

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 6: Data mining (Part I)

Data Mining

• Data Mining Concepts

• Data mining is the process of discovering information, trend and knowledge from large

sets of data (any data).

• Uses statistical and mathematic techniques to derive patterns and trends that exist in

data.

• This task cannot be resolved with the traditional database query's, OLTP or OLAP.

• In Data Mining world you want recommendations, sequences, groups and risk.

• You have not structured decision´s.

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 7: Data mining (Part I)

Data Mining

• Data Mining Concepts

• First Step: What do you want? Oh God, it´s hard to define it!!!

• What are you looking for? What types of relationships are you trying to find?

• Do you want to make predictions from the data mining model, or just look for interesting

patterns and associations?

This is very important: “To answer these questions, you might have to conduct a data

availability study, to investigate the needs of the business users with regard to the available

data. So, if the data does not support the needs of the users, you might have to redefine the

project.”

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 8: Data mining (Part I)

Data Mining

• Data Mining Concepts

• Second Step: Ok, I have the data and now ?! …

• We need to standardize, normalize, discretize, clean, and correct this data. Put this data in

one place.

• How can we do this?

• SQL Server 2012 and older versions can help you:

– Integration Services in Business Intelligence Development Studio

– Master Data Services

– Data Quality Services

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 9: Data mining (Part I)

Data Mining

• Data Mining Concepts

• Integration Services in Business Intelligence Development Studio

– Master Data Services

– http://msdn.microsoft.com/en-us/sqlserver/ff943581.aspx

– Data Quality Services

– http://technet.microsoft.com/en-us/sqlserver/hh780961.aspx

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 10: Data mining (Part I)

Data Mining

• Data Mining Concepts

• Fisrt Demonstration

• Discretizing data

• Normalizing data

• SSIS Look Up

Introducing Data Mining Concepts and Tools with SQL Server 2012

Page 11: Data mining (Part I)

Rodrigo Ramos Dornelwww.rdornel.com (Site/Blog/Videos)

@rdornel

Microsoft MCP, MCTS, MCITP e MCTSolidQ – Data Platform Engineerhttp://www.solidq.com/br-pt

Little Tip:

(Basic Data Mining Tutorial) http://msdn.microsoft.com/en-us/library/ms167167