a tutorial on support vector machines for pattern recognition asli taŞÇi christopher j.c. burges,...

15
A TUTORİAL ON SUPPORT VECTOR MACHİNES FOR PATTERN RECOGNİTİON ASLI TAŞÇİ Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

Upload: francine-horton

Post on 29-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

A TUTORİAL ON SUPPORT VECTOR MACHİNES FOR PATTERN RECOGNİTİON

ASLI TAŞÇİ

Christopher J.C. Burges, Data Mining and

Knowledge Discovery 2, 121-167, 1998

Page 2: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

OUTLİNE

• Introduction• Linear Support Vector Machines• Nonlinear Support Vector Machines• Limitations• Conclusion

Page 3: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

INTRODUCTİON

Classification and Regression tool

Supervised Learning

Linear and non-linear classification performance

Page 4: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

APPLİCATİON AREAS

Handwritten Digit Recognition

Object Recognition

Speaker Identification

Text Categorization

Face Detection in Images

Page 5: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

LİNEAR SUPPORT VECTOR MACHİNES

Simplest Case: Seperable Data

SVM Equaiton:

Lagranian:

Page 6: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

KARUSH-KUHN-TUCKER CONDİTİONS

Constraint optimization

Page 7: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

NON-SEPERABLE CASEIntroducing Slack variables for a feasible solution with linear SVM

Lagranian for non-seperable data:

Page 8: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

NONLİNEAR SUPPORT VECTOR MACHİNES

Mapping data to a feature space

Example:

Kernel Function:

Page 9: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

MERCER’S CONDİTİON

Positive Semi-definite

Page 10: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

OPTİMİZATİON PROBLEM

Quadratic programming optimizaiton

Page 11: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

TRAİNİNG

Decomposition algorithms for larger problems• Chunking method• Osuna’s decomposition algorithm

Page 12: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

LİMİTATİONS

• Choice of the Kernel• Speed• Size• Discrete Data• Multi-class classification

Page 13: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

PERFORMANCE OF SVMThe Virtual Support Vector Method• Training the system than creating a

new data by distorting the resulting support vectors.

The reduced set method• Increases the speed of SVM

Page 14: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

CONCLUSİON

• New approach to the problem of pattern recognition

• SVM training always find a global minimum• Largely characterized by the choice of its

Kernel

Page 15: A TUTORIAL ON SUPPORT VECTOR MACHINES FOR PATTERN RECOGNITION ASLI TAŞÇI Christopher J.C. Burges, Data Mining and Knowledge Discovery 2, 121-167, 1998

THANK YOU FOR LİSTENİNG