robust combination of local controllers

Robust Combination of Local Controllers

Carlos Guestrin

Dirk OrmoneitStanford University

Planning

Planning is central in real-world systems;

However, planning is hard: Motion planning is PSPACE-

hard [Reif 79]; State and Action spaces are

often continuous; Uncertainty is ubiquitous:

Imprecise actuators; Noisy sensors.

Global versus Local Controllers

Designing a global controller is hard, but… Many real-world domains allow us to design

good local controllers with no global guarantees:

How can we combine local controllers to obtain a global solution ?

Combining Local Controllers

Randomized algorithm: Nonparametric combination of local

controllers; Generalizes probabilistic roadmaps: [Hsu et

al.99] stochastic domains; Discounted MDPs;

Theoretical analysis: Characterizing local goodness of controllers; polynomial number of milestones is sufficient.

Motion Planning Case

Deterministic motion planning: Given some start and goal configurations,

find a collision free path; Stochastic motion planning:

Given some start and goal configurations, find a high probability of success path.

Start Goal

Nonparametric Combination of Local Controllers

Use simulation to estimate quality of local controllers

Quality: prob. controller reaches neighbor without collisions

Nonparametric Combination of Local Controllers

Finding a high success probability path Sample milestones uniformly at random:

X1, …, XN-1 ; Set start as X0 and goal as XN;

Simulation to estimate local connectivity: Estimate pij for j in the K nearest neigbors of i;

Shortest path algorithm to find most

probable path from X0 to XN:

Edge weights become –log pij .

Example: Maximum Success Probability Path

What About Costs ? MDPs find path with lowest expected

cost: Implicit trade-off: cost of hitting obstacles

and reward for goal; In Robotics, a successful path often more

important than a short path: Robotic museum guide; Manufacturing;

Thus, we make the trade-off explicit: What is the lowest cost path with success

probability of at least pmin ?

Restricted Shortest Path Lowest cost path with success prob. at least

pmin: Restricted shortest path problem; NP-hard, however, FPAS algorithms [Hassin 92];

Dynamic programming algorithm: Discretize [pmin,1] into S+1 values;

q(s) = (pmin)s/S, s = 0, …, S;

V(s,xi): minimum cost-to-go starting at xi, reaching

goal with success probability at least q(s).

Examples:Restricted Shortest Paths

Success prob.: 0.99Path length: 1.75

Examples:Restricted Shortest Paths

Theoretical Analysis:Characterizing quality of local controllers

Probabilistic roadmaps (PRMs): [Hsu et al. 99] Deterministic motion planning; Characterize space as (,,)-good; Bound number of milestones;

Extension to stochastic domains: Characterize space and controller as (,,,pp)-

good.XRX

RX – points reachable using controller from X with probability of success pp

Space is (,pp)-good if:Volume(RX) . Volume(free space)

Theorem For any >0, a roadmap with

N=28ln(8/)/+3/+2 milestones, with probability at least 1-, will contain a path between any two milestones in the same connected component and this path will have success probability of at least pp

Complete with probability at least 1-; Number of milestones poly(ln(1/), 1/, 1/, 1/); Final path has success probability of at least pp

In words:

Related Work Macro actions in discrete discounted

MDPs: Hauskrecht et al. 1998, Parr 1998;

Probabilistic Roadmaps (PRMs) for deterministic motion planning: Hsu et al. 1999;

Continuous state, discrete actions discounted MDPs: Rust 1997.

Centralized Control of Two Holonomic Robots

Success prob.: 0.99Total path length: 3.53

5 dof Robot Arm

7 dof Snake

Shortest: Most Success Probaility:

Conclusions Algorithm for planning in stochastic domains

with continuous state and action spaces: Nonparametric combination of local controllers;

Motion planning: Theoretical analysis quantifies local quality of

controllers; Proposed alternative objective function; Qualitative and quantitative properties demonstrated;

Also applicable for discounted MDPs: Describe methods for robustly combining local

controllers.

http://robotics.stanford.edu/~guestrin/Research/RobustLocalControl/

robust combination of local controllers

path length

probability of success

short path

probable path

successful path

final path

local controllersdesigning

local connectivity

Documents

optimal and robust controller...

design of robust fuzzy controllers for … of robust fuzzy...

ge nema rated full voltage starters cr306 nonreversing...

robust control tools for validating uas flight...

robust control methodology for the design of ... -...

cdem controllers’ development programme · the cdem...

synthesis of robust active queue management controllers...

combination of robust adaptive beamforming with...

combining optimal and neuromuscular controllers for agile...

robust video watermarking using secret sharing, svd, … ·...

containermaster+ the ultra-robust solution...the new...

research article robust nonfragile controllers...

robust hand tracking with refined camshift based on...

fisher 2502 controllers - emerson · fisher™ 2502...

systems with uncertainty. what are “stochastic, robust,...

robust ensemble classifier combination based on noise...

lqg/ltr, h-infinity and mu robust controllers design...

design of robust adaptive unbalance response controllers...

design of robust fuzzy controllers for … · design of...

l. celentano, robust tracking controllers design with ...l....