csci 5521: pattern recognition - university of...

43
CSCI 5521: Pattern Recognition Prof. Paul Schrater

Upload: lamngoc

Post on 08-Mar-2018

241 views

Category:

Documents


6 download

TRANSCRIPT

Page 1: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

CSCI 5521: Pattern Recognition Prof. Paul Schrater

Page 2: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 2

Business

• Course web page:

http://gandalf.psych.umn.edu/~schrater/schrater_lab/courses/PattRecog07/PattRecog.html

Page 3: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 3

Syllabus• Reading materials:

– Pattern Classification, 2nd Ed. Duda, Hart, Stork– Statistical Pattern Recognition, 2nd Ed. Andrew Webb– Elements of Statistical Learning: Data Mining, Inference, Prediction,

Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Springer,2001.

– Gaussian Processes for Machine Learning, Carl Rasmussen andChristopher K.I. Williams, MIT Press, 2006.

– Papers posted on the web site.– Downloads will be password protected.

• Grading• 60% on the homework assignments• 40% on the final project.

Page 4: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 4

Syllabus cont’d• Final Project 12-15 page paper involving:1) Simulation or experiments. For example, implement a pattern

recognition system for a particular application, e.g. digitclassification, document clustering, etc.

2) Literature survey (with critical evaluation) on a given topic.3) Theoretical work (detailed derivations, extensions of existing

work, etc)

Important dates:• Sept. 28: Topic selection. One or two pages explaining the

project with a list of references.• Nov. 7: Partial report (3 to 5 pages).• Dec. 20: Final report (12 to 15 pages).• Students may work in groups of 2-4.

Page 5: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 5

Policies/Procedures

• DO NOT CHEAT.• Do NOT work in groups for homework

(consulting with fellow students permitted,but group submissions are not).

• Electronically submit homework.• Homework must be submitted by Midnight

on the day it is due.

Page 6: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Introduction to PatternRecognition

• Syllabus• What are Patterns?• Pattern Recognition• An Example• Pattern Recognition Systems• The Design Cycle• Learning and Adaptation• Conclusion

Page 7: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 7

Examples of Patterns

Page 8: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 8

Examples of Patterns

Natural orNot?

How can wedescribe thesepatterns?

Page 9: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 9

Shape Patterns

D’arcy Thompson’s suggestion ofspecies change throughcontinuous deformation

This figure shows the effects ofAlzheimer's Disease on the ventricularexpansion rate measured from serial MRI.

Page 10: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 10

Explaining patterns Voice Puppetry, M. Brand;

Siggraph’99

Page 11: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 11

Pattern Examples

Page 12: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 12

Pattern Examples

Natural Language is a pattern

Page 13: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 13

What is a Pattern?• A set of instances that:

– Share some regularities and similarities.– Are Repeatable.– Are Observable, sometimes partially, using sensors

with noise and distortions.• How do we define “regularity”?• How do we define “similarity”?• How do we define “likelihood” for the repetition

of a pattern?• How do we model the sensors?• What is not a pattern?

Page 14: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 14

Defining PatternsIn a mathematical notation, Ulf Grenander (1976-1995)propose to define patterns as follows:Regularity R=<G, S, ρ, Σ >• G --- a set/space of generators (the basic elements in a

pattern), each generator has a number of “bonds” thatcan be connected to neighbors.

• S --- a transformation group (such as similarity transform)for the generators

· ρ --- a set of local regularities (rules for the compatibilityof generators and

· Σ --- a set of global configurations (graphs with generatorsbeing vertices and connected bonds being edges).

Page 15: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 15

ConceptIllustration

S

ρ

Σ

Basic description - Points with connections

Page 16: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 16

Two Schools of Pattern Rec.• Generative methods:

Bayesian school, pattern theory. The goal is to learn a complete modelfor the pattern - one capable of generating new instances.

1). Specify patterns in terms of measurable signals2). Specify likelihood model for how signals are generated from hidden

structures3). Learning probability models on hidden structures from ensemble of

signals4). Infer generating causes from particular signals.

• Discriminative methods:– The goal is to tell apart a number of patterns, say 100 people, 10

digits, directly, without causes or mathematical description ofpattern generating mechanisms.

– “Don’t oversolve your problem”.

Page 17: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 17

Pattern Recognition applications• Build a machine that can recognize patterns:

– Speech recognition

– Fingerprint identification

– OCR (Optical Character Recognition)

– DNA sequence identification

– Text Classification

Page 18: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 18

An Example

• “Sorting incoming Fish on a conveyoraccording to species using optical sensing”

Sea bassSpecies

Salmon

Page 19: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 19

• Problem Analysis

– Set up a camera and take some sample images to extract features

• Length• Lightness• Width• Number and shape of fins• Position of the mouth, etc…

• This is the set of all suggested features to explore for use in our classifier!

Page 20: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 20

PreprocessingSegment fish fromBackground

Feature Extraction Image data from each fish Summarized by feature extractor whose purpose is to reduce the data by measuring certain features

ClassificationThe features are passedto a classifier, that uses thefeatures to decide whichclass the instance belongsto.

Page 21: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 21

Page 22: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 22

The length is a poor feature alone!

Select the lightness as a possiblefeature.

Page 23: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 23

Page 24: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 24

• Threshold decision boundary and cost relationship

– Move our decision boundary toward smaller values oflightness in order to minimize the cost (reduce thenumber of sea bass that are classified salmon!)

Task of decision theory

Page 25: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 25

• Adopt the lightness and add the width of thefish

Fish xT = [x1, x2]

Lightness Width

Page 26: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 26

Page 27: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 27

• We might add other features that are not correlatedwith the ones we already have. A precaution shouldbe taken not to reduce the performance by addingsuch “noisy features”

• Ideally, the best decision boundary should be theone which provides an optimal performance such asin the following figure:

Page 28: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 28

Page 29: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 29

• However, our satisfaction is premature becausethe central aim of designing a classifier is tocorrectly classify novel input

Issue of generalization!

Page 30: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 30

Page 31: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 31

Pattern Recognition Systems

• Sensing

– Use of a transducer (camera or microphone)– PR system depends of the bandwidth, the resolution

sensitivity distortion of the transducer

• Segmentation and grouping

– Patterns should be well separated and should notoverlap

Page 32: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 32

Page 33: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 33

• Feature extraction– Discriminative features– Invariant features with respect to translation, rotation and scale.

• Classification– Use a feature vector provided by a feature extractor to assign the

object to a category

• Post Processing– Exploit context: input dependent information other than from the

target pattern itself to improve performance

Page 34: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 34

The Design Cycle

• Data collection• Feature Choice• Model Choice• Training• Evaluation• Computational Complexity

Page 35: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 35

Page 36: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 36

• Data Collection

– How do we know when we have collected anadequately large and representative set ofexamples for training and testing the system?

Page 37: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 37

• Feature Choice

– Depends on the characteristics of the problemdomain.

– Feature desirata:• Simple to extract,• invariant to irrelevant transformations• insensitive to noise.

Page 38: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 38

• Model Choice

– Unsatisfied with the performance of our fishclassifier and want to jump to another class ofmodel

Page 39: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 39

• Training

– Use data to determine the classifier. Manydifferent procedures for training classifiers andchoosing models

Page 40: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 40

• Evaluation

– Measure the error rate (or performance) andswitch from one set of features to another

Page 41: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 41

• Computational Complexity

– What is the trade-off between computationalease and performance?

– (How an algorithm scales as a function of thenumber of features, patterns or categories?)

Page 42: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 42

Learning and Adaptation• Supervised learning

– A teacher provides a category label or cost for eachpattern in the training set

• Unsupervised learning– The system forms clusters or “natural groupings” of the

input patterns

• Reinforcement Learning/Semi-supervised– Environment provides feedback indicating the quality

of decisions, but no labeled examples.

Page 43: CSCI 5521: Pattern Recognition - University of …vision.psych.umn.edu/users/schrater/schrater_lab/courses/PattRecog...Prof. Paul Schrater Pattern Recognition CSCI 5521 20 Preprocessing

Prof. Paul Schrater Pattern Recognition CSCI 5521 43

Conclusion

• The number, complexity and magnitude of thesub-problems of Pattern Recognition areformidable.

• Many of these sub-problems can indeed be solved

• Many fascinating unsolved problems still remain