lecture 8 - university of california, berkeley

N.M 8.1

ECTURE ON PATTERN RECOGNITION

PE Spring,1999

ORGAN / B.GOLD LECTURE 8

University of CaliforniaBerkeley

College of EngineeringDepartment of Electrical Engineering

and Computer Sciences

rofessors : N.Morgan / B.GoldE225D

Pattern Classification

Lecture 8

N.M 8.2

nitionporal sequence

lass labels used

: class labels not

Speech Pattern Recog•Soft pattern classification plus tem

integration

•Supervised pattern classification: c

in training

•Unsupervised pattern classification

available or used

N.M 8.3

1 k K<≤

Feature

Extraction

Pattern

Feature

Vector

Classificati

N.M 8.4

assifier

et, compare with

•Training: learning parameters of cl

•Testing: classify independent test s

labels and score

N.M 8.5

N.M 8.6

N.M 8.7

Feature Extraction Crit

•Class discrimination

•Generalization

•Parsimony (efficiency)

N.M 8.8

ent gains

plosive + vowel energies for 2 differ

E t( )

N.M 8.9

t∂∂ CE t( )log

t∂∂ Clog E t( )log+( )=

t∂∂ E t( )log=

N.M 8.10

tion on training

tion to test set are

Feature Vector Size

•Best representations for discrimina

set are large (highly dimensioned)

•Best representations for generaliza

(typically) succinct)

N.M 8.11

L transform,

Dimensionality Reduc

•Principal components (i.e., SVD, K

eigenanalysis ...)

•Linear Discriminant Analysis (LDA

•Application-specific knowledge

•Feature Selection via PR Evaluatio

N.M 8.12

N.M 8.13

N.M 8.14

PR Methods

•Minimum Distance

•Discriminant Functions

•Linear Discriminant

•Nonlinear Discriminant

(e.g, quadratic, neural networks)

•Statistical Discriminant Functions

N.M 8.15

t closest to new

plicit statistical

mplicates this

Minimum Distance•Vector or matrix representing elem

•Define a distance function

•Choose the class of stored elemen

•Choice of distance equivalent to im

assumptions

•For speech, temporal variability co

N.M 8.16

xTx ziTzi 2xTzi–+( )

zi template vector (prototype)=

x input vector=

Choose i to minimize distance

argimin x zi–( )T x zi–( ) argimin x zi–( )T x zi–( ) argimin= =

argimaxzi

Tzi 2xTzi–2–

------------------------- argimax xTzi

12---zi

Tz–=

If ziTzi 1 for all i= argimax xTzi( )⇒

N.M 8.17

, discrimination)

Problems with Min Dist

•Proper scaling of dimensions (size

•For high dim, sparsely sampled sp

N.M 8.18

stance

t of infinite

f optimum

potentially large

Decision Rule for Min Di

•Nearest Neighbor (NN) - in the limi

samples, at most twice the error o

classifier

•k-Nearest Neighbor (kNN)

•Lots of storage for large problems;

searches

N.M 8.19

to reduce its

variance often a

recognition

Some Opinions

•Better to throw away bad data than

weight

•Dimensionality-reduction based on

bad choice for supervised pattern

N.M 8.20

sect class, min

line, for 3 is

ωωωωTx ωωωω0+ + 0=

Discriminant Analysi•Discriminant functions max for corr

for others

•Decision surface between classes

•Linear decision surface for 2-dim is

plane; generally called hyperplane

•For 2 classes, surface at

•2-class quadratic case, surface at

ωωωωTx ωωωω0+ 0=

N.M 8.21

N.M 8.22

ctions

Training Discriminant Fun

•Minimum distance

•Fisher linear discriminant

•Gradient learning

N.M 8.23

- ANNs

Generalized Discriminators

•McCulloch Pitts neural model

•Rosenblatt Perceptron

•Multilayer Systems

N.M 8.24

erceptron

The Perceptron

McCulloch-Pitts Neuron - Rosenblatt P

N.M 8.25

ncele will converge in a

Perceptron ConvergeIf classes are linearly separable the following rufinite number of steps :

For each pattern x at time step k;

x k( ) class 1, ωT k( )x k( ) 0≤∈ ω k 1+( ) = ω k( ) cx(+⇒

x k( ) class 2, ωT k( )x k( ) 0≥∈ ω k 1+( ) = ω k( ) cx(–⇒

ω k 1+( ) = ""ω k( )

N.M 8.26

s :(DAID, 1961)

Multilayer Perceptron•Heterogeneous, “hard” nonlinearity

•Homogeneous, “soft” nonlinearity

(“modern” MLP)

PerceptroGaus. classsubsets

feature

N.M 8.27

N.M 8.28

f y( )

f y( ) 11 e y–+--------------- (sigmoid)=

0 f y( ) 1<<

N.M 8.29

N.M 8.30

ples: overfitting

Some PR Issues

•Testing on the training set

•Training on the test set

•No. parameters vs no. training exam

and overtraining

lecture 8 - university of california, berkeley

Documents

california golden bears 2009 california baseball...berkeley,...

umesh v. vazirani university of california, berkeley ·...

u.c. berkeley calendar network - university of california,...

hcc class lecture 8 - university of california, berkeley

ee247 lecture 26 - university of california, berkeley

university of california, berkeley arxiv:1905.13285v3...

lecture 7 transport protocols: udp, tcp eecs 122 university...

eecs122 – lecture 2 department of electrical engineering...

lecture :5 2d transformations - university of california,...

university of california, berkeley · university of...

lecture 1: introduction - university of california, berkeley

ee247 lecture 9 - university of california, berkeley

ee247 lecture 20 - university of california,...

1 physics at hadron colliders lecture iii beate heinemann...

ee247 lecture 20 - university of california, berkeley

department of eecs university of california, berkeley eecs...

berkeley california

lecture 8 congestion control eecs 122 university of...

cs61a lecture 1 - university of california, berkeley

lecture #37 - university of california, berkeley ·...