1 pattern classification all materials in these slides were taken from pattern classification (2nd...

32
1 Pattern Classificatio n All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley & Sons, 2000 with the permission of the authors and the publisher

Post on 19-Dec-2015

231 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

1

Pattern Classification

All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley & Sons, 2000 with the permission of the authors and the publisher

Page 2: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

2

Intro to Pattern Recognition Chapter 1: Sections 1.1-1.6

Machine Perception An Example

Pattern Recognition Systems The Design Cycle

Learning and Adaptation Conclusion

Page 3: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

3

Machine Perception

Build a machine that can recognize patterns:

Speech recognition

Fingerprint identification

OCR (Optical Character Recognition)

DNA sequence identification

Page 4: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

4

An Example

“Sorting incoming Fish on a conveyor,

according to species using optical sensing”

Sea bassSpecies

Salmon

Page 5: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

5

Page 6: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

6

Problem Analysis

Take sample images … to extract features Length Lightness Width Number and shape of fins Position of the mouth etc…

= set of features to use in our classifier!

Page 7: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

7

Use a segmentation operation to isolate fish from one another and from the background

Information from each single fish is sent to a feature extractor, whose purpose is to reduce the data by measuring certain features

The features are passed to a classifier

Preprocessing

Page 8: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

8

Page 9: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

9

Select the length of the fish as a possible feature for discrimination

Classification

Page 10: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

10

Page 11: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

11

The length is a poor feature alone!

Select the lightness as a possible feature.

Page 12: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

12

Page 13: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

13

Move decision boundary toward smaller values of lightness, to minimize cost reduce number of sea bass classified as salmon!

Threshold decision boundary and cost relationship

Task of decision theory

Page 14: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

14

Adopt the lightness and add the width of the fish

Fish xT = [x1, x2]

Lightness Width

Page 15: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

15

Page 16: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

16

Perhaps add other features not correlated with current ones Warning: “noisy features” will reduce

performance

Best decision boundary == one that provides an optimal performance Eg …

Page 17: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

17

“Optimal Performance” ??

Page 18: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

18

Goal: Optimal performance on NOVEL data Performance on TRAINING DATA !=

Performance on NOVEL data

Objective: Dealing with Novel Data

Issue of generalization!

Page 19: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

19

Page 20: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

20

Pattern Recognition Systems

Sensing Using transducer (camera, microphone, …) PR system depends of the bandwidth, the

resolution sensitivity distortion of the transducer

Segmentation and grouping Patterns should be well separated

(should not overlap)

Page 21: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

21

Page 22: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

22

Feature extraction Discriminative features Want features INVARIANT wrt translation,

rotation, scale. Classification

Using feature vector (provided by feature extractor) to assign given object to a category

Post Processing Exploit context

info not from the target pattern itself to improve performance

Page 23: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

23

The Design Cycle

Data collection Feature Choice Model Choice Training Evaluation

Computational Complexity

Page 24: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

24

Page 25: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

25

How do we know when we have collected an adequately large and representative set of examples for training and testing the system?

Data Collection

Page 26: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

26

Depends on characteristics of problem domain.

Ideally… Simple to extract Invariant to irrelevant transformation Insensitive to noise.

Feature Choice

Page 27: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

27

If not satisfied with performance (EG, of our fish classifier)

Consider another class of model

Model Choice

Page 28: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

28

Use data to obtain good classifier identify best model determine appropriate parameters

Many procedures for training classifiers and choosing models

Training

Page 29: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

29

Measure error rate ~= performance

May suggest switching from one set of features to another one

Evaluation

Page 30: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

30

Trade-off between computational ease and performance?

How algorithm scales as function of number of features, patterns or categories?

Computational Complexity

Page 31: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

31

Learning and Adaptation

Supervised learning

A teacher provides a category label or cost for each pattern in the training set

Unsupervised learning

The system forms clusters or “natural groupings” of the input patterns

Page 32: 1 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John

32

Conclusion

Reader seems to be overwhelmed by the number, complexity and magnitude of the sub-problems of Pattern Recognition

Many of these sub-problems can indeed be solved

Many fascinating unsolved problems still remain