using neural networks in database mining tino jimenez cs157b mw 9-10:15 february 19, 2009

15
Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Upload: bertha-newton

Post on 29-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Using Neural Networks in Database Mining

Tino JimenezCS157B

MW 9-10:15February 19, 2009

Page 2: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Data Mining Overview

What is Data Mining? The process of extracting values from a

database

Why do we need/use it? Predictive technology Allows for automated decision making

Page 3: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Data Mining Overview (continued)

What problems does it solve? Stock Market prediction Credit card fraud Loan approval/denial

How does it work? Data analysis of a given set of

information

Page 4: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Data Mining Tools

Decision Trees A series of rules that allows for

automated decision. Common use: credit card and health insurance approvals

Regression Analysis of the association between a

dependent variable and an independent variable. Common use: prediction

Neural Networks

Page 5: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

The Basis of Neural Networks

Adapted from the research of Artificial Intelligence

Based loosely on the biological functionality of neurons

Mimics the ability to “learn”A neuron is a specialized cell that sends

an electrochemical signal

Page 6: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

The Basis of Neural Networks (cont.)

Each neuron has a specific function and is grouped with other neurons to be able to perform complex tasks

Each neuron has a “weight” which is a determining factor in the importance of the specific function being processed

Page 7: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

How Neural Networks Work

An individual neuron has a step activation function which means that it can have either a -1,0 or 1 value. A value of -1 means that it is an inhibitor and

will lessen the weight of the combined neuronsThe individual neurons are the connected

to each other as inputs and outputs. The inputs carry the values of variables of

interest The outputs form predictions or control signals

Page 8: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

How Neural Networks Work (cont.)

Feedforward Structure The most useful in solving real-world problems Signals flow from inputs through hidden units,

eventually to the output units Input layer is used only to introduce the values

of the input variables The hidden and output layer neurons are each

connected to the all of the units of the preceding layer

Page 9: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

How Neural Networks Work (cont.)

When the network is used, the variable values are placed in the input units and each subsequent layer, calculates the weighted sum of the outputs of the preceding layer until reaching the final layer.

Page 10: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

How Do You Apply a Neural Network

Exact nature of inputs and outputs will be unknown

Large quantities of data are necessary Data can be “noisy”

2 ways to set-up the network Supervised Learning Unsupervised Learning

Page 11: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Supervised Learning

Data involves historical data sets containing input variables, which correspond to an output

Uses training and testing data to build a model The training data is what the neural network

uses to “learn” how to predict the known output. Also used for validation Famous algorithm is back propagation Uses the data to adjust the weights to minimize the error

in its predictions.

Page 12: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Unsupervised Learning

Very uncommon to use

Attempts to locate clusters within the input data regardless of variable Supervised Learning only uses input variables

from a training set

Page 13: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Advantages to Using a Neural Network

High Accuracy Able to approx. complex non-linear mapping

Noise Tolerance Flexible with respect to missing and noisy data

Ease of maintenance Can be implemented in parallel hardware Can be updated with new data, making them

dynamic

Page 14: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Disadvantages to Using a Neural Network

Poor Transparency Operate as “black boxes” with little/no

knowledge of the algorithms used

Trial-and-Error Design The selection of hidden nodes and training

parameters are heuristic

Data Hungry Requires large amounts of data to be accurate

which also means more computing power

Page 15: Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009

Applications of Neural Networks

Detection of medical phenomena Recognizes predictive patterns to prescribe

appropriate treatment

Stock market prediction Large numbers of factors are introduced and

used by technical analysts

Credit assignment Identifies most relevant characteristics and

classifies applicants as good or bad credit risks