data mining teaching experience at the fib. what is data mining? a broad set of techniques and...

10
Data Mining Teaching experience at the FIB

Upload: lee-riley

Post on 02-Jan-2016

218 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

Data Mining

Teaching experience at the FIB

Page 2: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

What is Data Mining?

A broad set of techniques and algorithms brought from machine learning and statistics to make decisions based on data … plus a lot of experience and common sense

Page 3: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

What subset is taught at the FIB?

Introduction to DM Multivariate statistics Clustering Association rules Regression and GLMs Decision trees Bayesian gaussian classifiers Nearest neighbours Neural networks Support vector machines

Page 4: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

Additional subjects

Feature selection and extraction Model evaluation, selection and combination Data mining in the real world ….

– Does it need professional software?– A talk by a professional

Page 5: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

Practical side

No exam Three practical homeworks:

– 1. Multivariate statistics (15%)– 2. Clustering & association rules (15%)– 3. Full Data Mining Project (70%)

Involves a 20’ oral defense!

Lab sessions (2 hours/week) Language selected: R

Page 6: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

Now for the funny part …

Why two teachers?– One from Software department (a computer

scientist)– One from Statistics department (a statistician)

Page 7: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

An equivalence table

Page 8: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

Some tips for the students

A successful data mining project has four components:

1. Good data2. Clean goals3. Good algorithms4. Human expertise

Page 9: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

The students’ replies (and subtext)

R is not a programming language– (it can’t be programmed as if C)

The results are not as good as we expected– (although we are very clever)

Too much theory– (but later on, we shall be missing it)

All in all, we have enjoyed it– (?)

Page 10: Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics

In conclusion

The students like the course, specially the practical work

They tend to work autonomously, but not always for the good

In the end, no two results are identical This is a course with much room to improve

on the teacher’s part