lirong xia

Lirong Xia

Approximate inference: Particle filter

Tue, April 1, 2014

• Friday April 4

– Project 3 due

– Project 4 (HMM) out

2

Reminder

Recap: Reasoning over Time

3

• Markov models

p(X1) p(X|X-1)• Hidden Markov models

p(E|X)

X E prain umbrella 0.9

rain no umbrella 0.1

sun umbrella 0.2

sun no umbrella 0.8

• Filtering: Given time t and evidences e1,…,et, compute p(Xt|e1:t)

Prior Sampling (w/o evidences)

5

• Given a Bayesian network (graph and CPTs)

• Generate values of random variables sequentially

– According to an order compatible with the graph

– In each step, generate the value of the top unvalued variable given previously generated values

• This sampling procedure is consistent

Samples:

+c, -s, +r, +w

-c, +s, -r, +w

Rejection Sampling

6

• We want p(C|+s)– Do prior sampling, but ignore (reject)

samples which don’t have S=+s– It is also consistent for conditional

probabilities (i.e., correct in the limit)

• Problem– If p(+s) is too small then many

samples are wasted

+c, -s, +r, +w+c, +s, +r, +w-c, +s, +r, -w+c, -s, +r, +w-c, -s, -r, +w

Likelihood Weighting

7

• Step 1: sampling unvalued variables z (given evidences e)

• Step 2: fix the evidences

• Step 3: the generated samples have weight

• Likelihood weighting is consistent

1

, |m

i ii

w z e p e Parents E

z: C, R

e: S, W

• Particle filtering

• Dynamic Bayesian networks

• Viterbi algorithm

8

Today

Particle Filtering

9

• Sometimes |X| is too big to use exact inference– |X| may be too big to even store

B(X)– E.g. X is continuous– |X|2 may be too big to do updates

• Solution: approximate inference– Track samples of X, not all values– Samples are called particles– Time per step is linear in the number

of samples– But: number needed may be large– In memory: list of particles

• This is how robot localization works in practice

Representation: Particles

10

• p(X) is now represented by a list of N particles (samples)– Generally, N << |X|– Storing map from X to counts would

defeat the point

• p(x) approximated by number of particles with value x– Many x will have p(x)=0!– More particles, more accuracy

• For now, all particles have a weight of 1

Particles:(3,3)(2,3)(3,3)(3,2)(3,3)(3,2)(2,1)(3,3)(3,3)(2,1)

Particle Filtering: Elapse Time

11

• Each particle is moved by sampling its next position from the transition model

x’= sample(p(X’|x))– This is like prior sampling – samples’

frequencies reflect the transition probabilities

– Here, most samples move clockwise, but some move in another direction or stay in place

• This captures the passage of time– If we have enough samples, close to

the exact values before and after (consistent)

Particle Filtering: Observe

12

• Slightly trickier:– Don’t do rejection sampling (why not?)– Likelihood weighting

– Note that, as before, the probabilities don’t sum to one, since most have been downweighted

• in fact they sum to an approximation of p(e)

|| '

w x p e xB X p e X B X

Particle Filtering: Resample

13

• Rather than tracking weighted samples, we resample

• N times, we choose from our weighted sample distribution (i.e. draw with replacement)

• This is analogous to renormalizing the distribution

• Now the update is complete for this time step, continue with the next one

Old Particles:(3,3) w=0.1(2,1) w=0.9(2,1) w=0.9(3,1) w=0.4(3,2) w=0.3(2,2) w=0.4(1,1) w=0.4(3,1) w=0.4(2,1) w=0.9(3,2) w=0.3

New Particles:(2,1) w=1(2,1) w=1(2,1) w=1(3,2) w=1(2,2) w=1(2,1) w=1(1,1) w=1(3,1) w=1(2,1) w=1(1,1) w=1

• Elapse of time

B’(Xt)=Σxt-1p(Xt|xt-1)B(xt-1)

• ObserveB(Xt) ∝p(et|Xt)B’(Xt)

• Renormalize

B(xt) sum up to 1

14

Forward algorithm vs. particle filtering

Forward algorithm Particle filtering

• Elapse of timex--->x’

• Observew(x’)=p(et|x)

• Resample

resample N particles

Robot Localization

15

• In robot localization:– We know the map, but not the robot’s position – Observations may be vectors of range finder readings– State space and readings are typically continuous (works basically like a

very fine grid) and so we cannot store B(X)– Particle filtering is a main technique

Dynamic Bayes Nets (DBNs)

16

• We want to track multiple variables over time, using multiple sources of evidence

• Idea: repeat a fixed Bayes net structure at each time• Variables from time t can condition on those from t-1

• DBNs with evidence at leaves are HMMs

Exact Inference in DBNs

17

• Variable elimination applies to dynamic Bayes nets • Procedure: “unroll” the network for T time steps,

then eliminate variables until p(Xt|e1:t) is computed

• Online belief updates: eliminate all variables from the previous time step; store factors for current time only

DBN Particle Filters

18

• A particle is a complete sample for a time step

• Initialize: generate prior samples for the t=1 Bayes net– Example particle: G1

a=(3,3) G1b=(5,3)

• Elapse time: sample a successor for each particle– Example successor: G2

a=(2,3) G2b=(6,3)

• Observe: weight each entire sample by the likelihood of the evidence conditioned on the sample– Likelihood: p(E1

a|G1a)*p(E1

b|G1b)

• Resample: select prior samples (tuples of values) in proportion to their likelihood

SLAM

19

• SLAM = Simultaneous Localization And Mapping– We do not know the map or our location– Our belief state is over maps and positions!– Main techniques:

• Kalman filtering (Gaussian HMMs) • particle methods

• http://www.cs.duke.edu/~parr/dpslam/D-Wing.html

HMMs: MLE Queries

20

• HMMs defined by:– States X– Observations E– Initial distribution: p(X1)– Transitions: p(X|X-1)– Emissions: p(E|X)

• Query: most likely explanation:

1:

1: 1:arg max |t

t tx

p x e

1:tx1:tx

State Path

21

• Graph of states and transitions over time

• Each arc represents some transition xt-1→xt

• Each arc has weight p(xt|xt-1)p(et|xt)• Each path is a sequence of states• The product of weights on a path is the seq’s probability• Forward algorithm

– computing the sum of all paths• Viterbi algorithm

– computing the best paths

X1 X2 … XN

Viterbi Algorithm

22

Example

23

X E p+r +u 0.9

+r -u 0.1

-r +u 0.2

-r -u 0.8

lirong xia

Documents

time t

t time steps

elapse time

current time

time stepinitialize

prior samples

consistent samples

generated samples