data mining and knowledge

Post on 15-Feb-2017

118 Views

Category:

Engineering

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

1

Presented by :1.Kartik N. Kalpande.

The data mining is the process in Data Knowledge

Discovery in Database that produces useful patterns or the modules from the data, Database.

Data mining can be used to mine understandable meaningful patterns from large databases and these patterns may then be converted into knowledge.

The KDD stands for Knowledge Discovery in Database.

It refers to the overall process of discovering useful knowledge from the data. 2

Data Mining and KDD

Classification:-

Mining patterns that can classify future data.

Association Rule Mining:-Mining any rule of the form X Y,

where X and Y are sets of data items.

Clustering:-Identifying a set of similarity groups in

the data.3

Main Data Mining Tasks

Sequential Pattern Mining:A sequential rule: A B, says that event A will

be immediately followed by event B with a certain confidence.

Deviation detection: Discovering the most significant changes in

Data.

Data visualization: - Using graphical or Diagrammatically methods to show patterns in data. 4

Main data mining tasks (cont. …)

Valid: generalize to the future. Novel: what we don't know. Useful: be able to take some action. Understandable: leading to insight. Iterative: takes multiple passes. Interactive: human in the loop .

5

What is data mining?

Data Mining process :-

OriginalData

TargetData

PreprocessedData

TransformedData

Patterns

KnowledgeSelection

PreprocessingTransformation

Data Mining

Interpretation

7

Regression:- Assign a new data record to one of several predefined categories or classes. Regression deals with predicting real-valued fields. Also called supervised learning. Clustering: Partition the dataset into subsets or groups such that elements of a group share a common set of properties.

Data Mining Techniques

1. Selection: Selecting data relevant to the analysis task from

the database 2. Preprocessing: Removing noise and inconsistent data;

combining multiple data sources 3. Transformation: Transforming data into appropriate forms

to perform data mining 4. Data mining: Choosing a data mining algorithm which is

appropriate to pattern in the data; Extracting data patterns 5. Interpretation/Evaluation : Interpreting the patterns into

knowledge. 8

KDD Methods..

Related Areas of Data Mining And KDD

Database technologyand data warehouses efficient storage,

access and manipulationof data

DM

statistics

machinelearning

visualization

text and Web mining

softcomputing pattern

recognition

databases

Conti…

Statistics, machine learning, pattern recognition and soft computing:-

Techniques forclassification and knowledge extractionfrom data.

DM

Statistics

MachineLearning

Visualization

Text and web mining

SoftComputing Pattern

Recognition

Databases

Conti…Text And Web

Mining:- Web page analysis, Text categorization, filtering and structuring of textual information Natural language

processing

DM

Statistics

MachineLearning

Visualization

Text and web mining

SoftComputing Pattern

Recognition

Databases

Text and web mining

Knowledge discovery can be broadly defined

as the automated discovery of novel and useful information from commercial databases. Data mining is one step at the core of the knowledge discovery process, dealing with the extraction of patterns and relationships from large amounts of data. Data Mining Techniques are used to analyze data and extract useful information from large amount of data.

Conclusion..

Thank You…..!!!

Any Queries …??

13

top related