greedy on-line planning - sven koenigidm-lab.org/slides/greedyonline-tutorial-sven.pdf · greedy...
TRANSCRIPT
SA3 - 1 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy On-Line Planning
Sven Koenig
http://www.cc.gatech.edu/fac/Sven.Koenig/
Collaborators:
David Furcy, Yaxin Liu, Yuri Smirnov(Additional Programming: Colin Bauer, William Halliburton)
Craig Tovey, Maxim Likhachev,
SA3 - 2 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy On-Line Planning
- abstract overview: what is greedy on-line planning?
- greedy on-line planning makes planning tractable
- greedy on-line planning is reactive to the current situation(plus other advantages)
- fast replanning for greedy on-line planning
example: greedy localization
example: greedy mappingexample: moving a robot to goal coordinates in unknown terrain
example: greedy mappingexample: moving a robot to goal coordinates in unknown terrain
example: symbolic planning
example: replanning of shortest paths
Part 1:
Part 2:
Part 3:
heuristic search-based replanningcalculating the heuristics for heuristic search-based planning
SA3 - 3 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Nondeterministic Planning - The Problem
goal
planning in nondeterministic domains is time consumingdue to the many contingencies
goal
start
SA3 - 4 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
planning in nondeterministic domains is time consumingdue to the many contingencies
Nondeterministic Planning - A Solution
start
goal
agent-centered search makes it more efficient byinterleaving planning with limited lookahead and plan execution
Agent-Centered Search[Koenig; 2001]
goal
state space can even becomedeterministic
[Nourbakhsh, 1997]
SA3 - 5 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
planning in nondeterministic domains is time consumingdue to the many contingencies
Nondeterministic Planning - A Solution
agent-centered search makes it more efficient byinterleaving planning with limited lookahead and plan execution
Agent-Centered Search
planning plan execution
traditional search
agent-centered search
small (bounded) planning time between plan executions (depends on search area)
small sum of planning and execution time
state space can even becomedeterministic
goal
SA3 - 6 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
goal
planning in nondeterministic domains is time consumingdue to the many contingencies
Nondeterministic Planning - Another Solution
start
goal
assumption-based planning makes it more efficient bymaking assumptions about the outcomes of action executions
Assumption-Based Planning
desiredtrajectory
actualtrajectory
state space can even becomedeterministic
[Nourbakhsh, 1997]
SA3 - 7 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
both agent-centered search and assumption-based planning are
Nondeterministic Planning:Greedy On-Line Planning
greedy planning methodsbecause they make simplifying assumptions to make planning tractable
on-line planning methodsbecause they interleave planning and plan execution
Note: without additional assumptions, it is not guaranteedthat greedy on-line planning methods achieve the goal!
SA3 - 8 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Nondeterministic Planning:Robot Navigation under Incomplete Information
robot knows the map but not its location- localization
robot knows its location but not the map- mapping- goal-directed navigation in unknown terrain
Sensor-Based Planning[Choset and Burdick, 1994]
SA3 - 9 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Part 1
Greedy On-line Planningmakes Planning Tractable
SA3 - 10 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
The robot is always in exactly one cell.*The robot has a compass on board.The robot has no sensor or actuator uncertainty and knows the map.The robot initially does not know where it is.
short-range sensordiscretized space
The robot always senses which of the four adjacent cells is empty.The robot can move to one of the four adjacent empty cells.
Greedy Localization
The task of the robot is to find out where it is with a shortesttravel distance in the worst case (that is, for the worst possiblestart location) or detect that this is impossible. (Example: 5 moves)
* We also have results for continuous terrain that are similar to the ones presented in the following for discretized terrain.
SA3 - 11 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
It is in NP to determine whether there exists a valid localiza-tion plan that executes no more movements than a givenvalue.
It is NP-hard to find a localization plan in gridworlds of size whose worst-case number of movements to localiza-
tion is within a factor of optimum, even inconnected gridworlds in which localization is possible.
m n×O mn( )log( )
Theorem[Tovey and Koenig, 2000]
contrast with: [Dudek, Romanik, Whitesides, 1995]
Hardness of (Approximately) Optimal Localization
SA3 - 12 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
e1
e2
e3
e4
e5
S1
S2
S3
number of elementsnumber of setsnumber of sets that form a smallest set cover
x = 5y = 3y* = 2
Set Cover
Theorem
It is NP-hard to find a set cover whose number of sets iswithin a factor of optimum (for sufficientlysmall constants).
O x( )log( )
To prove the theorem, we reduce set cover problems to ourlocalization problems.
[Lund and Yannakakis, 1994]
Hardness of (Approximately) Optimal Localization
SA3 - 13 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
star
tS1
S2
S3
e0 e1 e2 e3 e4 e5
Whenever ei ∈ Sj,we make the correspondinge1
e2
e3
e4
e5
S1
S2
S3
To localize,
horizonal corridor i cells shorter.
that correspond to a set cover.all the horizontal corridors
the robot has to visit
Hardness of (Approximately) Optimal Localization
e0
SA3 - 14 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
and so on
x3 re
plic
atio
ns fo
r a
tota
l of m
= 3
x3 y+
1 ce
lls
xy cells
(We
leav
e ou
t som
e sm
all t
echn
ical
det
ails
.)
n = (xy+2)(x+1) cells
star
t
sign
atur
e
vertical = column
horiz
onta
l = r
ow
SA3 - 15 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Consider the following localization plan: Find the closest signature (= gives the robot itscurrent column). Then move into all vertical corridors that correspond to a smallest setcover (= gives the robot its current row).
The number of movements of this localization plan is at most 3y*xy.
Thus, the number of movements of an optimal localization plan is at most 3y*xy.
Thus, the number of movements of a localization plan whose worst-case number of move-ments to localization is within a factor O(log(mn)) of optimum is at most O(log(mn))3y*xy = O(log(x)) 3y*xy ≤ O(3x3y).
Thus, such a plan cannot leave its current east-west corridor and can only localize by mov-ing into all corridors that correspond to a set cover. Let y’ denote the cardinality of this setcover. Then, the number of movements is at least (2y’-1)(xy-x-1).
Thus, the number of movements is at least (2y’-1)(xy-x-1) and at most O(log(x)) 3y*xy,implying that y’ = O(log(x)) y* and thus that the set cover is within a factor O(log(x)) ofminimum.
However, it is NP-hard to find a set cover whose number of sets is within a factorO(log(x)) of minimum.
qed
Hardness of (Approximately) Optimal Localization
SA3 - 16 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Theorem[Tovey and Koenig, 2000]
For every gridworld of size , there exists a valid local-ization plan that executes movements to localizationand that can be found in time .
This result is the best possible in the sense that there existgridworlds of size in which every valid localizationplan must execute movements to localization andcan only be found in time .
m n×O mn( )
O mn( )
m n×Ω mn( )
Ω mn( )
Cost of (Approximately) Optimal Localization
SA3 - 17 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Cost of (Approximately) Optimal Localization
qed
Map and Robot Trajectory Knowledge of the Robot
Matching the Map and Knowledge of the Robot
m cells
n ce
lls
start
SA3 - 18 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Cost of (Approximately) Optimal Localization
SA3 - 19 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy Localization
Greedy Localization repeatedly makes the robot execute a shortest(deterministic) movement sequence (subplan) that is guaranteed toreduce the number of possible robot cells by at least one.
Greedy localization uses new information right away.[Genesereth and Nourbakhsh, 1993][Koenig and Simmons, 1998]
ABCDEF
1 2 3 4 5 6 7 8A1,C1,E1,B4,D4
A2,B5 C2,E2,D5
D2,E5 F2
move east
move south
...
...
SA3 - 20 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy Localization = Agent-Centered Search
start
goal
Greedy Localization repeatedly makes the robot execute a shortest(deterministic) movement sequence (subplan) that is guaranteed toreduce the number of possible robot cells by at least one.
Thus, it plans in the deterministic part of the nondeterministic statespace until a plan is found that achieves a gain in information.
Note: Assume localization is possible. The state space is safely explorable.Greedy Localization always achieves a gain in information.
Thus, Greedy Localization localizes the robot.
goal
SA3 - 21 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Theorem
The planning and plan-execution times of Greedy Localiza-tion are guaranteed to be low-order polynomials in the sizeof the gridworld.
Cost of (Approximately) Optimal LocalizationGreedy
Greedy Localization makes planning tractable.
SA3 - 22 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Random Acyclic Mazes
gridworldsize
obstacledensity
av. numberof subplans
av. numberof steps per
subplan
av. totalnumber ofmovements
11 x 11 41.345.446.847.648.148.448.6
2.43.33.84.14.54.74.9
1.51.71.71.81.81.81.9
3.65.46.67.58.08.69.1
61 x 61
%%%%%%%
xxxxxxx
=======
21 x 2131 x 3141 x 4151 x 51
71 x 71
to localization to localization to localization
Cost of (Approximately) Optimal LocalizationGreedy
Greedy Localization is fast in practice.
(5041 cells)
SA3 - 23 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Example for a Corridor-Like Terrain[Tovey and Koenig, 2000]
The worst-case number of movements of Greedy Localiza-tion can be a factor worse than the optimal worst-case number of movements to localization in gridworlds ofsize , even in connected gridworlds in which localiza-tion is possible.
Ω mn3( )
m n×
Cost of (Approximately) Optimal LocalizationGreedy
However, its plan-execution time cannot be optimal.
SA3 - 24 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
star
t
and
so o
n
qed
Cost of (Approximately) Optimal LocalizationGreedy
SA3 - 25 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
gridworldsize
obstacledensity
av. numberof subplans
av. numberof steps per
subplan
av. totalnumber ofmovements
11 x 25 50.2 4.5 2.3 10.2% x =13 x 3615 x 4917 x 6419 x 8121 x 10023 x 12125 x 14427 x 16929 x 19631 x 22533 x 25635 x 289
50.250.250.250.250.150.150.150.150.150.150.150.1
%%%%%%%%%%%%
5.97.48.9
10.411.513.414.416.018.019.420.822.5
xxxxxxxxxxxx
2.93.23.44.04.44.54.95.25.45.75.86.1
==
=========
=
16.923.730.642.050.060.471.182.598.0
110.5121.5137.7
to localization to localization to localization
Our Acyclic Mazes
Cost of (Approximately) Optimal LocalizationGreedy
(5684 cells)(4563 cells)
SA3 - 26 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Example for a Room-Like Terrain*
The worst-case number of movements of Greedy Localiza-tion can be a factor worse than theoptimal worst-case number of movements to localization ingridworlds of size , even in connected gridworlds inwhich localization is possible.
Ω mn( ) mn( )log( )⁄( )
m n×
Cost of (Approximately) Optimal LocalizationGreedy
However, its plan-execution time cannot be optimal.
* We also have even better lower bounds (although in morecomplex gridworlds) and small upper bounds.
SA3 - 27 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Cost of (Approximately) Optimal LocalizationGreedy
start
qed
0 0 0 0 0 1 010 . . .
SA3 - 28 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
(Approximately) Optimal Localization
planning timeplan-execution time
Greedy Localization
low-order polynomiallow-order polynomial
(likely) exponentiallow-order polynomial
Cost of (Approximately) Optimal LocalizationGreedy
Summary
SA3 - 29 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
no sensor uncertainty, no actuator uncertaintyminimax model
Extension: Actuator and Sensor Noise
so far:
sensor uncertainty, actuator uncertaintyprobabilistic model
POMDP-based (“Markov”) Localization
more realistic on robots:
Mobile robots have- noisy actuators- noisy sensors
sonar ring occupancy grid
SA3 - 30 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Landmark-Based Navigation Metric-Based Navigation“sensitive to the environment” “sensitive to robot movements”
be sensitive to both the environment and the robot movements
discretize the locations, butallow arbitrary location distributions
restrict location distributions,but don’t discretize the locations
Kalman Filters POMDPs
maintain a probability distribution over all locations (location distribution)+
Extension: Actuator and Sensor Noise
0.20 0.100.10 0.200.10 0.10 0.050.05
(Partially Observable Markov Decision Process Models)
SA3 - 31 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
goal location
mapping from location distributions to directives (“policy”)
directive selection
policy generation
POMDP
motion generation
desired directive
motor commandsraw sonar data raw odometer data
occupancy grid [Elfes]
sensor
sensor report motion report
location estimation
current location distribution
topological mapprior actuator model
prior sensor modelprior distance model
POMDP
Navigation
Obstacle Avoidance
Real-Time Control
path planning
Destination Planner
path
model learning
interpretation
compilation
using GROW-BW(based on Baum-Welch)
(Bayes’ rule)
SA3 - 32 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
POMDP-Based Navigation on Xavier
Xavier
Extension: Actuator and Sensor Noise
now very popular with large amount of follow-up work
operated for three years with > 200 km travel distance
[Simmons and Koenig, 1995][Koenig and Simmons, 1998]
[Thrun, 2000]
SA3 - 33 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
- explicitly models all uncertainty using probabilities- maintains arbitrary probability distributions over the locations
- utilizes all available sensor data (landmarks, robot movements)- robust towards sensor errors (no explicit exception handling required)
- uniform, theoretically grounded framework for localization
POMDP-based (“Markov”) Localization
Extension: Actuator and Sensor Noise
SA3 - 34 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Extension: Actuator and Sensor Noise
no sensor uncertainty, no actuator uncertaintyminimax model
sensor uncertainty, actuator uncertaintyprobabilistic model
POMDP-based (“Markov”) Localization
0.1 0.10.1
0.1 0.1
0.1 0.1 0.1
0.2
50 2020
10
(simplified)
SA3 - 35 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
no sensor uncertainty, no actuator uncertaintyminimax model
sensor uncertainty, actuator uncertaintyprobabilistic model
POMDP-based (“Markov”) Localization
Extension: Actuator and Sensor Noise
It is NP-hard to find an optimal homing sequencefor a colored finite state automaton.
It is NP-hard to find an optimal localization sequencein a gridworld.
add more structurethe robot can only move north, east, south, or west
[Schapire, 1992]
It is PSPACE-hard to find an optimal policy for a POMDP.[Papadimitriou, Tsitsiklis, 1987]
?????
add more structurethe robot can only move north, east, south, or west
s
s
s
s
sn
nn
n
e
e
ew
w
w
n
a
b
aba
b
a
a
b ab
b
SA3 - 36 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Extension: Actuator and Sensor Noise
no sensor uncertainty, no actuator uncertaintyminimax model
sensor uncertainty, actuator uncertaintyprobabilistic model
POMDP-based (“Markov”) Localization
Greedy Localization repeatedly makes the robot execute a shortest(deterministic) movement sequence (subplan) that is guaranteed toreduce the number of possible robot cells by at least one.
Greedy Localization repeatedly makes the robot execute a shortest(deterministic) movement sequence (subplan) that is guaranteed toreduce the entropy of the probability distribution over the possiblerobot cells.
[Burgard, Fox, Thrun, 1997]
SA3 - 37 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Part 2
Greedy On-line Planningis Reactive to the Current Situation
(plus other advantages)
SA3 - 38 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy MappingGreedy Mapping always moves the robot on a shortest path to clos-estunobserved(or unvisited) cell.
3
4
4 4...
we assume here that the robot can move in eight directions
[Koenig, Tovey, Halliburton, 2001] [Thrun et al. 1998] [Romero, Morales, Sucar, 2001]
SA3 - 39 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy Mapping = Agent-Centered Search
start
goal
Greedy Mapping always moves the robot on a shortest path to clos-estunobserved(or unvisited) cell.
Thus, it plans in the deterministic part of the nondeterministic statespace until a plan is found that achieves a gain in information.
Note: Assume mapping is possible. The state space is safely explorable.Greedy Mapping always achieves a gain in information.
Thus, Greedy Mapping maps the terrain.
goal
SA3 - 40 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
4
can easily be integrated into robot architectures (“reactive planning”)
4 4
does not need to be in control of the robot at all times (“reactive planning”)
Greedy Mapping - Advantages
for example, our implementation combines greedy mapping andschema-based navigation (MissionLab)[Mackenzie, Arkin, Cameron, 1997]
we assume here that the robot can move in eight directions
SA3 - 41 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
4
utilizes prior map knowledge, if available
can be used by multiple robots that share their maps
Greedy Mapping - Advantageswe assume here that the robot can move in eight directions
SA3 - 42 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.28 feet
20 feet
Greedy Mapping - Robot Implementation
SA3 - 43 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy Mapping - Robot Implementation
SA3 - 44 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy Mapping - Travel Distance
SA3 - 45 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
0 200 400 600 800 1000 1200 1400 1600 1800 20000
200
400
600
800
1000
1200
1400
1600
1800
Number of vertices
Tra
vel d
ista
nce
Greedy Mapping - Travel Distance
number of vertices
trav
el d
ista
nce
SA3 - 46 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy Mapping - Travel Distancewe assume here that the robot can move in eight directions
SA3 - 47 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy Mapping - Travel DistanceHere: Greedy Mapping always moves the robot on a shortest path tothe closestunvisited cell. This version of Greedy Mapping workson any strongly connected undirected graph.
start
= visited (known) vertex
= unvisited known vertex
= known edge
= a shortest path to a closest unvisited vertex
1 2 3
4 5 6
7 8 9
10 11 12
= current vertex of the robot
SA3 - 48 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
The worst-case number of move-ments of Greedy Mapping isand , where is the numbervertices of the graph, even forundirected planar graphs.
Ω s( )O s2( ) s
Theorem:Trivial Theorem
The worst-case number of move-ments of Greedy Mapping is
and , where isthe number vertices of the graph,even for undirected planar graphs.
Ω slogsloglog
------------------s( ) O s slog( ) s
Theorem:[Koenig, Tovey, Smirnov, 2001]
More Interesting Theorem
robotstart
Here: Greedy Mapping always moves the robot on a shortest path tothe closestunvisited cell.
Greedy Mapping - Travel Distance
SA3 - 49 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
order of |V|
order oflog |V|
log log |V|lower bound for Greedy Mapping
tight bound for chronological backtracking
trav
el d
ista
nce
(logs
cale
)
|V| (logscale)
lower bound
identity function
n
3456789
10
travel distance
2072279
31253515085
9928271219130987
5448100629150617283953
|V|
80778
9612144014
254252851744018
119320130030753086422
travel distance|V|
2.592.933.253.583.904.234.574.90
|V|
Greedy Mapping - Travel Distance
can we use structure todecrease the travel distance?
order of upper bound for Greedy Mapping|V|log |V|
SA3 - 50 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
[Brumitt and Stentz, 1998] [Hebert, McLachlan, Chang, 1999] [Matthies et al., 2000] [Stentz and Hebert, 1995] [Thayer et al., 2000]
Planning with the Freespace AssumptionPlanning with the Freespace Assumption always moves the roboton a shortest potentially unblocked path to the goal cell.
3
4
4 4...
we assume here that the robot can move in eight directions
SA3 - 51 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
[Brumitt and Stentz, 1998] [Hebert, McLachlan, Chang, 1999] [Matthies et al., 2000] [Thayer et al., 2000]
Planning with the Freespace AssumptionPlanning with the Freespace Assumption always moves the roboton a shortest potentially unblocked path to the goal cell.
HMMWV that navigated 1,410 meters of natural outdoor terrain in 1995
- Demo Vehicles of the Darpa UGV II Program- Mars Rover Prototype- Prototypes of Urban Reconnaissance Robots
[Stentz and Hebert, 1995]
SA3 - 52 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption = Assumption-Based PlanningPlanning with the Freespace Assumption always moves the roboton a shortest potentially unblocked path to the goal cell.
Thus, it makes assumptions about outcomes of actions that makethe nondeterministic state space deterministic.
goal
start
goal
desiredtrajectory
actualtrajectory
Note: Assume moving to the goal is possible. The state space is safely explorable.Planning with the Freespace Assumption always achieves a gain in information.
Thus, Planning with the Freespace Assumption moves to the goal.
SA3 - 53 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - Travel DistanceHere: Planning with the Freespace Assumption always moves therobot on a shortest (potentially unblocked) path to the goal vertex.
start goal
1 2 3
4 5 6
7 8
= edge known to be unblocked
= edge assumed to be unblocked
= a shortest potentially traversable path to the goal
= edge known to be blocked
robot can- move north- move east- move south- move west
SA3 - 54 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - Travel DistanceHere: Planning with the Freespace Assumption always moves therobot on a shortest (potentially unblocked) path to the goal vertex.
start goal
= unblocked edge / edge known to be unblocked
= blocked edge / edge known to be blocked
gridworld graph initial knowledge of graph
= edge assumed to be unblocked
start goalstart goal
robot can- move forward- move backward- turn 90 degree left (or right)
SA3 - 55 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - Travel Distance
Planning with the Freespace Assumption results in smalltravel distances if the freespace assumption is approxi-mately satisfied, that is, if the obstacle density is small.
However, the travel distances are also small if the freespaceassumption is not satisfied.
SA3 - 56 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
0 200 400 600 800 1000 1200 1400 1600 1800 20000
50
100
150
200
250
300
350
400
Number of vertices
Tra
vel d
ista
nce
Freespace Assumption - Travel Distance
number of vertices
trav
el d
ista
nce
SA3 - 57 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
The worst-case number of movements ofPlanning with the Freespace Assumptionis and , where is thenumber vertices of the graph, even forundirected planar graphs.
Ω slogsloglog
------------------s( ) O s3 2⁄( ) s
Theorem:[Koenig, Tovey, Smirnov, 2001]*
* we also have even better bounds
Freespace Assumption - Travel Distance
start
goal
Here: Planning with the Freespace Assumption always moves therobot on a shortest (potentially unblocked) path to the goal vertex.
SA3 - 58 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Part 3
Fast Replanningfor Greedy On-line Planning
SA3 - 59 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
3
4
4 4...
1 1111
21 1 1
122
23
1 1 1
332 2
1 1 12 23 3 3
11
222
333
33 3
3
Greedy Mapping - ImplementationGreedy Mapping always moves the robot on a shortest path to theclosestunobserved(or unvisited) cell.
we assume here that the robot can move in eight directions
SA3 - 60 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
32101
123
2223
333345
3
4
4 4...
5 4445
5
Freespace Assumption - Implementation
32101
123
2223
533345
5 4455
5
32101
123
2223
333345
5 4445
5
32101
123
2223
533345
5 4455
5
32101
123
2223
533345
5 4455
5
goal
Planning with the Freespace Assumption always moves the roboton a shortest potentially unblocked path to the goal cell.
we assume here that the robot can move in eight directions
SA3 - 61 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
original eight-connected gridworld
Path Planning - Examplewe assume here that the robot can move in eight directions
sstart sgoal
SA3 - 62 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
changed eight-connected gridworld
Path Planning - Examplewe assume here that the robot can move in eight directions
sstart sgoal
SA3 - 63 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
original eight-connected gridworld
Path Planning - Examplewe assume here that the robot can move in eight directions
1223
34
6
45
56
5
7 6
56
6 66 6
8 8
7 7 76 7
78
14159
8
12 11 10 10
7 866788
9
899
111111121212
9
10101010
1818
1111
10
12
14
14
1413
12 14
1414
15
151516161515
16
17
16
1516 1716
181718
18
1 23 4
5 6 71010 11
8
3 5 714
sstart sgoal128
55
77
8
3
33
4
66 6
7777
9
7
3
4
4
5
5
9
5
5 56
6
6
6
78 9
89
8 9
9
12
12
1111
11
99
10
13131313
12 131314
131516
1616
16
1717
11
13
13
18
16
16
SA3 - 64 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
changed eight-connected gridworld
Path Planning - Examplewe assume here that the robot can move in eight directions
1223
4
8
45
99
10
7 6
56
6 66 6 6
10 9
7 7 76 7
79
141510
8
12 11 10 10
7 866788
9
899
111111121212
10
10101010
1919
1111
10
12
15
15
1413
12 14
1414
16
161617161515
17
17
17
1617 1817171818
1918
1 23 4
5 6 71112 12
8
3 5 715
sstart 138
55
77
8
3
33
10
108 6
7889
10
7
3
4
4
5
5
9
5
5 56
6
6
6
9 9
89
8
10
12
12
1111
11
99
10
13131313
13 141415
141617
1716
16
1818
11
13
13
18
16
17
13
sgoal
SA3 - 65 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
heuristic incrementalsearchsearch
how to search efficientlyusing heuristic to guide the search
how to search efficientlyby reusing information
Path Planning - Example
from previous searches
Artificial Intelligence Algorithm Theory
SA3 - 66 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
uninformed search
Breadth-First Search
DynamicSWSF-FP
com
plet
e se
arch
incr
emen
tal s
earc
h
heuristic search
A*
Lifelong Planning A*
[Ramalingam, Reps, 1996]
[Hart, Nilsson, Raphael, 1968]
with early termination (our addition)
Path Planning - Lifelong Planning A*
SA3 - 67 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
uninformed search
com
plet
e se
arch
heuristic search
Lifelong Planning A*
original eight-connected gridworldPath Planning - Experimental Evaluation
incr
emen
tal s
earc
h
sgoalsstartsgoalsstart
sgoalsstart sgoalsstart
SA3 - 68 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
uninformed search
com
plet
e se
arch
heuristic search
Lifelong Planning A*
changed eight-connected gridworldPath Planning - Experimental Evaluation
incr
emen
tal s
earc
hsstart
sstart
sstart sgoal
sgoal
sstart sgoal
sgoal
SA3 - 69 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
ve =va =hp =
ve =va =hp =
ve=va=hp=
ve=va=hp=
1331.726207.25985.3
173.05697.4956.2
284.06177.31697.3
25.61235.9240.1
+/-+/-+/-
+/-+/-+/-
+/-+/-+/-
4.484.019.7
4.9167.026.6
+/-+/-+/-
5.9129.339.9
2.075.016.9
uninformed search
com
plet
e se
arch
heuristic search
Lifelong Planning A*
changed eight-connected gridworld - first implementationPath Planning - Experimental Evaluation
incr
emen
tal s
earc
h
ve = vertex expansions, va = vertex accesses, hp = heap percolates
(with the same tie-breaking as LPA*)
SA3 - 70 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
ve =
hp =
ve =
hp =
ve=
hp=
ve=
hp=
801.76
2359.60
115.95
561.48
172.20
724.60
18.80
182.15
uninformed search
com
plet
e se
arch
heuristic search
Lifelong Planning A*
changed eight-connected gridworld - second implementationPath Planning - Experimental Evaluation
incr
emen
tal s
earc
h
ve = vertex expansions, hp = heap percolates
(with the same tie-breaking as LPA*)
SA3 - 71 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Path Planning - Experimental Evaluation
ve=hp=t1=
ve=hp=t1=
68.17547.7213.62
18.80182.15
6.66
heuristic search
Lifelong Planning A*
ve = vertex expansions, hp = heap percolates, t1 = time in main search routine,
(with better tie-breaking than LPA*)
t2= 18.61
t2= 13.22
tie-b
reak
ing
mat
ters
A*
expa
nds
node
s fa
ster
than
LPA
*
time
spee
dup
=x1
.5 in
the
long
run
after the third replanning episode,the total planning time of LPA* over all episodes is less than that of A*
t2= total runtime (including maze generation etc.)
SA3 - 72 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
procedure CalculateKey(s)return [min(g(s), rhs(s)) + h(s,sgoal); min(g(s), rhs(s))];procedure Initialize()U = Ø;for all s∈ S rhs(s) = g(s) =∞rhs(sstart) = 0;U.Insert(sstart, CalculateKey(sstart)];procedure UpdateVertex(u)if (u ≠ sstart) rhs(u) = mins’ ∈ Pred(u)(g(s’)+c(s’,u));if (u ∈ U) U.Remove(u);if (g(u) ≠ rhs(u)) U.Insert(u, CalculateKey(u));
procedure ComputeShortestPath()while (U.TopKey < CalculateKey(sgoal) OR rhs(sgoal) ≠ g(sgoal))
u = U.Pop();if (g(u) > rhs(u))
g(u) = rhs(u);for all s∈ Succ(u) UpdateVertex(s);
elseg(u) =∞;for all s∈ Succ(u)∪ u UpdateVertex(s);
procedure Main()Initialize();forever
ComputeShortestPath();Wait for changes in edge costs;for all directed edges (u, v) with changed edge costs
Update the edge cost c(u,v);UpdateVertex(v);
Path Planning - Lifelong Planning A*
This version of LPA* can beoptimized further without changing
We also have versions of LPA* that- break ties differently- work with inconsistent heuristics
its overall operation.
U.TopKey() returns the smallest priorityof all vertices in the priority queue U.If U is empty, then U.TopKey() returns[∞; ∞]. U.Pop() deletes the vertex with thesmallest priority in priority queue U andreturns the vertex. U.Insert(s,k) insertsvertex s into priority queue U withpriority k. Finally, U.Remove(s) removesvertex s from priority queue U.
The heuristics need to be nonnegative and(forward) consistent:
for all vertices s∈ S and s’∈ Succ(s).and h(s,sgoal) ≤ c(s,s’) + h(s’,sgoal)h(sgoal,sgoal) = 0
- terminate earlier- contain several runtime optimizations.
[Koenig, Likhachev, 2001]
SA3 - 73 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Lifelong Planning A*- applies to the same finite search problems as A*
- produces the same (optimal) solution as A*- handles arbitrary edge cost changes
- is algorithmically very similar to A*- is more efficient than A* in many situations
- applies to- route planning problems (traffic, networking, ...)- robot control- symbolic artificial intelligence planning
- has nice theoretical properties
- ...
Path Planning - Lifelong Planning A*
SA3 - 74 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
2
3 3
4 4
35 54 4
goal
Path Planning - Lifelong Planning A*
3
4
5
6
A
B
C
D
1 2 4 5 6
SA3 - 75 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
2
3
4
Path Planning - Lifelong Planning A*
3
4
5
3+1
5+1
minimum 4
A
B
C
...
...
...
g-valuerhs-value
g-value = rhs-value:
g-value > rhs-value:g-value < rhs-value:
cell is locally consistent
cell is locally overconsistentcell is locally underconsistent
the priority queue contains exactly the locally inconsistent vertices s
g-value ≠ rhs-value: cell is locally inconsistent
their priority is [min(g(s),rhs(s))+h(s,sgoal); min(g(s),rhs(s))]smaller priorities first, according to a lexicographic ordering
SA3 - 76 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
2
3 3
4 4
35 54 4
goal
Path Planning - Lifelong Planning A*
3
4
5
6
A
B
C
D
1 2 4 5 6
SA3 - 77 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
2
3 3
4 4
35 54 4
goal
Path Planning - Lifelong Planning A*
3
4
5
6min(2,4)+2min(2,4)
A
B
C
D
1 2 4 5 6
priority queueC3: [4;2]
SA3 - 78 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
∞3 3
4 4
35 54 4
goal
Path Planning - Lifelong Planning A*
3
4
5
6min(∞,4)+2min(∞,4)
min(3,5)+1min(3,5)
A
B
C
D
1 2 4 5 6
priority queueD3: [4;3]; C3: [6;4]
SA3 - 79 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
∞3 3
4 4
∞5 54 4
goal
Path Planning - Lifelong Planning A*
3
4
5
6min(∞,5)+1min(∞,5)
min(4,6)+0min(4,6)
4min(4,6)+2min(4,6)
A
B
C
D
1 2 4 5 6
priority queueD2: [4;4]; D4: [6;4]; D3: [6;5]
SA3 - 80 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
∞3 3
4 4
∞5 5∞ 4
goal
Path Planning - Lifelong Planning A*
3
4
5
6min(∞,5)+1min(∞,5)
min(∞,6)+0min(∞,6)
min(4,6)+2min(4,6)
A
B
C
D
1 2 4 5 6
priority queueD4: [6;4]; D3: [6;5]; D2: [6;6]
SA3 - 81 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
∞3 3
4 4
∞5 5∞ ∞goal
Path Planning - Lifelong Planning A*
3
4
5
6min(∞,6)+0min(∞,6)
min(∞,6)+2min(∞,6)
min(5,7)+3min(5,7)
A
B
C
D
1 2 4 5 6
priority queueD2: [6;6]; D5: [8;5]; D4: [8;6]
SA3 - 82 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
∞3 3
4 4
∞5 56 ∞goal
Path Planning - Lifelong Planning A*
3
4
5
6min(∞,6)+2min(∞,6)
min(5,7)+3min(5,7)
min(∞,7)+1min(∞,7)
A
B
C
D
1 2 4 5 6
priority queueD5: [8;5]; D4: [8;6]; D3: [8;7]
SA3 - 83 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start0 11
1
22
∞3 3
4 4
∞5 56 ∞goal
Path Planning - Lifelong Planning A*
3
4
5
6min(∞,6)+2min(∞,6)
min(5,7)+3min(5,7)
min(∞,7)+1min(∞,7)
A
B
C
D
1 2 4 5 6
priority queueD5: [8;5]; D4: [8;6]; D3: [8;7]
SA3 - 84 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
ComputeShortestPath() expands every vertex at most twice andthus terminates.
Theorem:[Likhachev and Koenig, 2001]
Path Planning - Lifelong Planning A*
After ComputeShortestPath() terminates, one can trace back ashortest path from the start to the goal by always moving from thecurrent vertex s, starting at the goal, to any predecessor s’ that min-imizes g(s’) + c(s’,s) until the start is reached (ties can be brokenarbitrarily).
Theorem:[Likhachev and Koenig, 2001]
SA3 - 85 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
In the worst case, replanning cannot be moreefficient than planning from scratch.[Nebel, Koehler, 1995]
Path Planning - Lifelong Planning A*
old search tree
new search tree
start goal
old search tree
new search tree
start goal
SA3 - 86 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
ComputeShortestPath() does not expand any vertices whose g-val-ues were equal to their respective start distances before Compute-ShortestPath() was called.
Theorem:[Likhachev and Koenig, 2001]
Path Planning - Lifelong Planning A*
= LPA* is efficient because it uses incremental search
ComputeShortestPath() expands at most those vertices s with [f(s);g*(s)] ≤ [f(sstart); g*(sstart)] or [gold(s)+h(s); gold(s)] ≤ [f(sstart);g*(sstart)], where f(s) = g*(s)+h(s) and gold(s) is the g-value of sdirectly before the call to ComputeShortestPath().
Theorem:[Likhachev and Koenig, 2001]
= LPA* is efficient because it uses heuristic search
SA3 - 87 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
The first search of Lifelong Planning A* is the same as that of A*.Afterwards, Lifelong Planning A* operates in a very similar way toA*. (The theorem makes this more concrete. For example, Com-puteShortestPath() expands locally overconsistent vertices withfinite f-values in the same order as A*.)
“Theorem:”[Likhachev and Koenig, 2001]
Path Planning - Lifelong Planning A*
SA3 - 88 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
32101
123
2223
333345
3
4
4 4...
5 4445
5
Freespace Assumption - Implementation
32101
123
2223
533345
5 4455
5
32101
123
2223
333345
5 4445
5
32101
123
2223
533345
5 4455
5
32101
123
2223
533345
5 4455
5
goal
Planning with the Freespace Assumption always moves the roboton a shortest potentially unblocked path to the goal cell.
we assume here that the robot can move in eight directions
SA3 - 89 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
goalrobot
Transforming Planning with the Freespace
here: search from the goal location towards the robot location- allows one to reuse parts of the search tree after the robot has moved- allows one to use heuristics to focus the search
(this additional argument holds for Greedy Mapping later)
Assumption to Path Planning
sstartsgoal
h(sstart,s) =g(s) =
approximation of the distance from the robot to vertex sapproximation of the goal distance of vertex s
SA3 - 90 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Transforming Planning with the Freespace
here: search from the goal location towards the robot location- makes incremental search efficient
Assumption to Path Planning
old search tree
new search tree
robot sstart goal sgoal
SA3 - 91 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
procedure CalculateKey(s)return [min(g(s), rhs(s)) + h(sstart, s); min(g(s), rhs(s))];procedure Initialize()U = Ø;for all s∈ S rhs(s) = g(s) =∞rhs(sgoal) = 0;U.Insert(sgoal, CalculateKey(sgoal);procedure UpdateVertex(u)if (u ≠ sgoal) rhs(u) = mins’ ∈ Succ(u)(c(u,s’)+g(s’));if (u ∈ U) U.Remove(u);if (g(u) ≠ rhs(u)) U.Insert(u, CalculateKey(u));
procedure ComputeShortestPath()while (U.TopKey < CalculateKey(sstart) OR rhs(sstart) ≠ g(sstart))
u = U.Pop();if (g(u) > rhs(u))
g(u) = rhs(u);for all s∈ Pred(u) UpdateVertex(s);
elseg(u) =∞;for all s∈ Pred(u)∪ u UpdateVertex(s);
procedure Main()Initialize();ComputeShortestPath();while (sstart≠ sgoal)
/* if (g(sstart) = ∞) then there is no known path */sstart = arg mins’ ∈Succ(sstart) (c(sstart,s’)+g(s’))Move to sstart;Scan graph for changed edge costs;
Freespace Assumption - D* Lite (Basic Version)
U.TopKey() returns the smallest priorityof all vertices in the priority queue U.If U is empty, then U.TopKey() returns[∞; ∞]. U.Pop() deletes the vertex with thesmallest priority in priority queue U andreturns the vertex. U.Insert(s,k) insertsvertex s into priority queue U withpriority k. Finally, U.Remove(s) removesvertex s from priority queue U.
if any edge costs changedfor all directed edges (u,v) with changed edge costs
Update the edge cost c(u,v);UpdateVertex(u);
for all s∈ UU.Update(s, CalculateKey(s));
ComputeShortestPath();
The heuristics need to be nonnegative andbackward consistent
for all vertices s∈ S and s’∈ Pred(s).and h(sstart,s)≤ h(sstart, s’)+c(s’,s)h(sstart,sstart) = 0no matter what the start vertex is:
[Koenig, Likhachev, 2002]
SA3 - 92 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
When the robot moves, the goal of the search (sstart) moves.This influences the priorities of the vertices in the priority queue
(but not which vertices are in the priority queue).
vertex s is locally inconsistent iffvertex s is in the priority queue
with priority [min(g(s),rhs(s))+h(soldstart,s); min(g(s),rhs(s))].
This value changes when the robot moves from soldstart to snewstart.Thus, one needs to reorder the priority queue.[Stentz, 1994]
Freespace Assumption - D* Lite (Basic Version)
h(snewstart,s)
Idea
SA3 - 93 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
priority queue A: [8;5]; B: [8;6]; C: [8;7]
priority queue C: [7;7]; B: [8;6]; A: [9;5]
Freespace Assumption - D* Lite (Basic Version)Fictitious Example
SA3 - 94 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
procedure CalculateKey(s)return [min(g(s), rhs(s)) + h(sstart, s) + km; min(g(s), rhs(s))];procedure Initialize()U = Ø;km = 0;for all s∈ S rhs(s) = g(s) =∞rhs(sgoal) = 0;U.Insert(sgoal, CalculateKey(sgoal);procedure UpdateVertex(u)if (u ≠ sgoal) rhs(u) = mins’ in Succ(u)(c(u,s’)+g(s’));if (u ∈ U) U.Remove(u);if (g(u) ≠ rhs(u)) U.Insert(u, CalculateKey(u));
procedure ComputeShortestPath()while (U.TopKey < CalculateKey(sstart) OR rhs(sstart) ≠ g(sstart))
kold = U.TopKey();u = U.Pop();if (kold < CalculateKey(u))
U.Insert(u, CalculateKey(u));else if (g(u) > rhs(u))
g(u) = rhs(u);for all s∈ Pred(u) UpdateVertex(s);
elseg(u) =∞;for all s∈ Pred(u)∪ u UpdateVertex(s);
procedure Main()slast = sstart;Initialize();ComputeShortestPath();
Freespace Assumption - D* Lite (Final Version)
while (sstart≠ sgoal)/* if (g(sstart) = ∞) then there is no known path */sstart = arg mins’∈Succ(sstart) (c(sstart,s’)+g(s’))Move to sstart;Scan graph for changed edge costs;if any edge costs changed
km = km + h(slast,sstart);slast = sstart;for all directed edges (u,v) with changed edge costs
Update the edge cost c(u,v);UpdateVertex(u);
ComputeShortestPath();
The heuristics need to be nonnegative andforward-backward consistent:
for all vertices s,s’,s’’∈ S.h(s,s’’) ≤ h(s,s’)+h(s’,s’’)
The heuristics also need to be admissibleno matter what the goal vertex is:h(s,s’)≤ shortest distance from s to s’for all vertices s,s’∈ S.
[Koenig, Likhachev, 2002]
ss’
s’’
triangle inequality
SA3 - 95 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - D* Lite (Final Version)
Reordering the priority queue is time consuming.
vertex s is locally inconsistent iffvertex s is in the priority queue
with priority [min(g(s),rhs(s))+h(soldstart,s); min(g(s),rhs(s))].
[Stentz, 1995]
We use lower bounds on the new priorities instead of the new priorities themselves.[min(g(s),rhs(s))+h(soldstart,s); min(g(s),rhs(s))]
≤ [min(g(s),rhs(s))+h(soldstart,snewstart)+h(snewstart,s); min(g(s),rhs(s))][min(g(s),rhs(s))+h(soldstart,s)-h(soldstart,snewstart); min(g(s),rhs(s))]
≤ [min(g(s),rhs(s))+h(snewstart,s); min(g(s),rhs(s))]The term h(soldstart,snewstart) is the same across vertices in the priority queue.Instead of deletingit from the all vertices in the priority queue,we addit to the vertices added to the priority queue in the future.When ComputeShortestPath() selects a vertex for expansion,it checks first whether its priority is correct.If so, it expands the vertex.If it is a lower bound, it calculates the correct priority and reinserts the vertex into the queue.
h(snewstart,s)
Idea[Stentz, 1995]]
SA3 - 96 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - D* Lite (Final Version)
priority queue A: [8;5]; B: [8;6]; C: [8;7]add vertex D with priority [10;5]
priority queue A: [6;5]; B: [6;6]; C: [6;7]add vertex D with priority [10;5]
priority queue A: [8;5]; B: [8;6]; C: [8;7]add vertex D with priority [12;5]
priority queue A: [8;5]; B: [8;6]; C: [8;7]
priority queue B: [8;6]; C: [8;7]; A: [9;5]
correct priority is A: [9;5]
correct priority is B: [8;6]
expand B
Fictitious Example
SA3 - 97 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - D* Lite
knowledge before the movement sequence of the robot
23
22
3
6
3
5
776
56
5
8
127
1213131378
14
1481414
121212
1314131313
1214
14
18
14
14
1414
14
13
1313
12
1212
4
6
4
333
3345
3
2
4
8
2112
3
5
76
56
43
2
4
8
1
3
6
76
66
66
6
8
1
1312
99
99
99
6
1212
13
14sstart
777
3
7
77
77
77
7
7
8
199
16
11
1312
10
101010
10
11
712
1010
1010
10
10
14
4111111
11
11
712
1111
1111
11
11
15
5
612
3
5
76
56
43
2
4
8
1112
3
5
76
56
43
2
4
8
1222
3
5
76
56
43
2
4
8
2333
3
5
76
56
43
3
4
8
3444
4
5
76
56
44
4
4
8
4555
5
5
76
56
55
5
5
8
5
6888
3
5
76
88
88
8
4
8
299
sgoal
6
we assume here that the robot can move in eight directions
SA3 - 98 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - D* Lite
knowledge after the movement sequence of the robot
23
22
3
6
3
5
776
56
5
819
157
14141415720
8
15
1581414
121212
1314131313
1214
14
21
14
14
1515
15
13
1313
12
1212
4
6
4
333
3345
3
2
4
8
2112
3
5
76
56
43
2
4
8
1
6
3
6
76
66
66
6
8
1
1413
6
99
14
1615
99
99
13
17
101010
12
14
715
1010
1010
11
13
17
4111111
12
14
715
1111
1111
12
13
18
56
1313
14
12
3
5
76
56
43
2
4
8
1112
3
5
76
56
43
2
4
8
1222
3
5
76
56
43
2
4
8
2333
3
5
76
56
43
3
4
8
3444
4
5
76
56
44
4
4
8
4555
5
5
76
56
55
5
5
8
5
6
sstart
sgoal777
3
7
77
77
77
7
7
8
19
1110
888
3
5
76
88
88
2
4
8
26
we assume here that the robot can move in eight directions
SA3 - 99 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - D* Lite
uninformed search
com
plet
e se
arch
heuristic search
D* Lite (Lifelong Planning A*)
before the movement sequence of the robot
incr
emen
tal s
earc
h
sgoal
sstart
sgoal
sstart
sgoal
sstart
sgoal
sstart
SA3 - 100 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - D* Lite
uninformed search
com
plet
e se
arch
heuristic search
D* Lite (Lifelong Planning A*)
after the movement sequence of the robot
incr
emen
tal s
earc
hsgoal
sstart
sgoal
sstart
sgoal
sstart
sgoal
sstart
SA3 - 101 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - D* Lite
A = overhead of D* Lite without incremental Search (A*)
va
ve
hp
B = overhead of D* Lite without heuristic searchSA3 - 102 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Freespace Assumption - D* Lite
overhead of Focussed D* =
va
ve
hp
[Stentz, 1995]
probably the first truly incremental heuristic search method(note: Focussed D* is likely a bit faster than D* Lite per vertex expansion)
SA3 - 103 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
3
4
4 4...
1 1111
21 1 1
122
23
1 1 1
332 2
1 1 12 23 3 3
11
222
333
33 3
3
Greedy Mapping - ImplementationGreedy Mapping always moves the robot on a shortest path to clos-estunobserved(or unvisited) cell.
we assume here that the robot can move in eight directions
SA3 - 104 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
RR
new vertex
Transforming Greedy Mapping toPlanning with the Freespace Assumption
= goal vertex
RR
SA3 - 105 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
knowledge before the movement sequence of the robot
Greedy Mapping - D* Lite
000
3
0
5
176
52
1
216
1616
161717171717
18
18
18181818
161616
1718171717
1618
18
18
18
18
1818
18
17
1717
16
1616
4
2
0
0
3
0
0
2
0112
3
5
76
53
43
2
4
3
1
106
1010
106
10
1716
1413
13
1313
1313
1313
13
13
151414
14
14
1414
1414
1414
15
14
14
16151515
15
15
1515
1515
1515
15
15
15
161616
17
5
45
44
53
4
5
55
55
53
5
5
66
66
63
6
5
77
77
73
7
5
88
88
84
8
5
99
99
95
9
sstart
777
3
7
11
1111
117
7
7
11
115
1516
151413
14
12
1212
1212
1212
15
13
12
1617
14
1011
0
0
112
32
4
1∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞
we assume here that the robot can move in eight directions
SA3 - 106 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
knowledge after the movement sequence of the robot
Greedy Mapping - D* Lite
16
1616
161717171717
18
18
18181818
192021
1919202122
2322
20
18
23
21
1818
18
23
1817
22
1718
0
0
112
3
5
543
2
4
1
6
1716
171819
14
14
1414
2223
2120
15
14
14
16171819
15
15
1515
2223
2120
15
15
15
161616
17
5
3
5
3
5
3
5
3
5
4
5
5
sstart
1617
2223
212223
222323
232424
242525
252626
262727
272828
28
29
5
76
530
43
2
3
31
32
12
76
31010
1045
455
566
677
788
899
91111
11
1819
13
1313
2223
2120
13
13
777
3
7
7
7
117
1514
16171819
14
12
1212
2223
2120
15
13
12
16
7
∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞∞∞∞
∞∞
∞
∞
we assume here that the robot can move in eight directions
SA3 - 107 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
uninformed search
com
plet
e se
arch
heuristic search
D* Lite (Lifelong Planning A*)
before the movement sequence of the robot
sstart sstart
sstart sstart
Greedy Mapping - D* Lite
incr
emen
tal s
earc
h
SA3 - 108 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
uninformed search
com
plet
e se
arch
heuristic search
D* Lite (Lifelong Planning A*)
after the movement sequence of the robot
sstart sstart
sstart sstart
Greedy Mapping - D* Lite
incr
emen
tal s
earc
h
SA3 - 109 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy Mapping - D* Lite
A = overhead of D* Lite without incremental Search (A*)
va
ve
hp
B = overhead of D* Lite without heuristic searchSA3 - 110 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
overhead of Focussed D* =
Greedy Mapping - D* Lite
va
ve
hp
[Stentz, 1995]
probably the first truly incremental heuristic search method(note: Focussed D* is likely a bit faster than D* Lite per vertex expansion)
SA3 - 111 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
- world changes over time
- what-if analyses- model of the world changes over time
emergency management
planning task 1slightly differentplanning task 2
slightly differentplanning task 3
...
Other Examples of Lifelong Planning
replanning (and plan reuse) is important!
SA3 - 112 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Other Examples of Lifelong Planning
- symbolic planning (with HSP)- continual planning- one-time planning
- mobile robotics- mapping- goal-directed navigation in unknown terrain
- computer games
- reinforcement learning and on-line dynamic programming- control (with the Parti-Game algorithm)
- route planning- in traffic networks- in computer networks
SA3 - 113 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Game Playing
Total Annihilation
SA3 - 114 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
- plan adaptation- repair-based planning- learning search control knowledge- case-based planning
- lifelong planning
- transformational planning- iterative repair methods in scheduling
CHEF, GORDIUS, LS-ADJUST-PLAN, MRL,NoLimit, PLEXUS, PRIAR, SPA...
plan quality of replanning is as good asplan quality of planning from scratch
plan quality of replanning is usually worse thanplan quality of planning from scratch
SHERPA
Symbolic Planning (with HSP) - Continual Planning
SA3 - 115 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
start goal
STRIPS-type planning in the elevator domain
Operators:
• The elevator moves from floor fi to floor fj with i ≠ j.• Person pk boards the elevator on floor fi provided that the elevator
is currently on floor fi and floor fi is the origin of person pk.• Person pk gets off the elevator on floor fi, provided that person pk
is in the elevator, the elevator is currently on floor fi, and floor ri isthe destination of person pk.
Symbolic Planning (with HSP) - Continual Planning
SA3 - 116 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
planning problem 1 planning problem 2 planning problem 3
SHERPASpeedy HEuristic search-based RePlAnner
[S. Koenig, D. Furcy, C. Bauer, 2002]
...
...
Symbolic Planning (with HSP) - Continual Planning
note: in the following, we consider only finding shortest plans
SA3 - 117 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
first search in theelevator domain
similar to HSP 2.0
[Bonet, Geffner, 2001]
with the hmax heuristic
g=5
h=0
rhs=5
g=1
h=3
rhs=1
g=2
h=2
rhs=2
g=2
h=2
rhs=2
g=1
h=2
rhs=1
g=3
h=2
rhs=3
g=∞
h=3
rhs=3
g=3
h=2
rhs=3
g=3
h=2
rhs=3
g=0
h=3
rhs=0
GOAL
9
876
54
3 2
1
g=∞
h=3
rhs=3
[6;3][6;3]
g=∞
h=2
rhs=4
[6;4]
g=∞
h=2
rhs=4
[6;4]
g=4
h=1
rhs=4
10
groundoperator
deleted afterthe search
generatedvertex
expandedvertex
iorder ofvertex
expansion
start
goal
Symbolic Planning (with HSP) - Continual Planning
using SHERPA
SA3 - 118 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
second search in the
using SHERPA from scratchelevator domain
similar to HSP 2.0with the hmax heuristic[Bonet, Geffner, 2001]
GOAL
12
11
76
9
54
3 2
1
10
8
g=0
h=3
rhs=0
g=1
h=3
rhs=1
g=1
h=2
rhs=1
g=2
h=2
rhs=2
g=2
h=2
rhs=2
g=3
h=2
rhs=3
g=3
h=2
rhs=3
g=3
h=3
rhs=3
g=4
h=2
rhs=4
g=4
h=2
rhs=4
g=6
h=0
rhs=6
g=5
h=1
rhs=5
g=∞
h=2
rhs=5
[7;5]
g=∞
h=3
rhs=4
[7;4]
generatedvertex
expandedvertex
iorder ofvertex
expansion
Symbolic Planning (with HSP) - Continual Planning
SA3 - 119 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
second search in theelevator domainusing SHERPA
GOAL
2
15
4
g=0
h=3
rhs=0
g=1
h=3
rhs=1
g=1
h=2
rhs=1
g=2
h=2
rhs=2
g=2
h=2
rhs=2
g=3
h=2
rhs=3
g=3
h=2
rhs=3
g=3
h=3
rhs=3
g=4
h=2
rhs=4
g=4
h=2
rhs=4
g=6
h=0
rhs=6
g=5
h=1
rhs=5
g=∞
h=2
rhs=5
[7;5]
g=∞
h=3
rhs=4
[7;4]
6
7
3 8
untouchedvertex
generatedvertex
expandedvertex
iorder ofvertex
expansion
Symbolic Planning (with HSP) - Continual Planning
SA3 - 120 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
savi
ngs
perc
enta
genumber of people
ve for elevator (5 floors)
80%
SHERPA achieves speedups up to 80 percent
planning from scratch with SHERPA
Symbolic Planning (with HSP) - Continual Planning
SA3 - 121 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
old search tree
new search tree
start goal
old search tree
new search tree
start goal
Symbolic Planning (with HSP) - Continual Planning
SA3 - 122 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
number of edges between the deleted edge in the plan and the goal state
savi
ngs
perc
enta
ge
ve for blocksworld
speedupbecomes negative
Symbolic Planning (with HSP) - Continual Planning
SA3 - 123 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
number of deleted ground operators
savi
ngs
perc
enta
ge
ve for blocksworld
Symbolic Planning (with HSP) - Continual Planning
SA3 - 124 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
planning problem 1 planning problem 2 planning problem 3
PINCHPrioritized, INCremental Heuristics calculation
...calcu-lateheu-risticvalue
1
calcu-lateheu-risticvalue
n
calcu-lateheu-risticvaluen-1
calcu-lateheu-risticvalue
2
calcu-lateheu-risticvalue
7
calcu-lateheu-risticvalue
6
calcu-lateheu-risticvalue
5
calcu-lateheu-risticvalue
4
calcu-lateheu-risticvalue
3
tens of thousands of calculations of heuristic values for each planning problem
[Liu, Koenig, Furcy, 2002]
...
...
Symbolic Planning (with HSP) - One-Time Planning
SA3 - 125 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
PINCHPrioritized, INCremental Heuristics calculation
hadd(state) =
gstate(proposition) = 0 if proposition in state
minoperator with proposition in add list(1 + gstate(operator)) otherwise
gstate(operator) =
Σproposition in goal state gstate(proposition)
Σproposition on precondition list of operatorgstate(proposition)
g=0
h=3
rhs=0
g=1
h=3
rhs=1
g=1
h=2
rhs=1
order of state expansions
here: for HSP 2.0 with the hadd heuristic[Bonet, Geffner, 2001]
Symbolic Planning (with HSP) - One-Time Planning
SA3 - 126 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
PINCH achieves speedups up to (another!) 80 percent.
problem size
savi
ngs
perc
enta
ge
method used by HSP 2.0 (value iteration)
generalized Bellman Ford
PINCH
80%
Symbolic Planning (with HSP) - One-Time Planning
SA3 - 127 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Reinforcement Learning and On-Line DP
while there exists at least one state with g(s) = rhs(s)pick a state s with g(s) = rhs(s) and then set g(s) := rhs(s)
Prioritized Sweeping[Moore and Atkeson; 1993]
Minimax LPA*- chooses the g-value of which state to update- updates the g-value of the chosen state in a particular way
- chooses the g-value of which state to update- updates the g-value of the chosen state in a particular way- minimizes the expected orworst-case plan-execution cost for MDPs
- minimizes the worst-case plan-execution cost for MDPs- uses heuristics to focus the search- terminates immediate once a shortest path is found
SA3 - 128 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Reinforcement Learning and On-Line DPPrioritized Sweeping[Moore and Atkeson; 1993]
s1
g=102∆=0
s3
g=101∆=2
s2
g=103∆=0
10
2
1
1
g=100∆=0s4
g=99∆=0s5
1
s1
g=102∆=2
s3
g=103∆=0
s2
g=103∆=2
10
2
1
1
g=100∆=0s4
g=99∆=0s5
1
s1
g=104∆=0
s3
g=103∆=2
s2
g=103∆=2
10
2
1
1
g=100∆=0s4
g=99∆=0s5
1
s1
g=104∆=2
s3
g=103∆=2
s2
g=105∆=0
10
2
1
1
g=100∆=2s4
g=99∆=0s5
1
s1
g=104∆=2
s3
g=105∆=0
s2
g=105∆=2
10
2
1
1
g=100∆=2s4
g=99∆=0s5
1
s1
g=106∆=0
s3
g=105∆=2
s2
g=105∆=2
10
2
1
1
g=100∆=2s4
g=99∆=0s5
1
1
and so on, for a total of 22 g-value updates. Minimax LPA* needs only 6.Note: Minimax LPA* expands every state at most twice.
SA3 - 129 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Control (with the Parti-Game algorithm)
state spaces of control problems areoften continuous and sometimes high-dimensional
might not be able to find a plan is very inefficientcoarse-grained discretization fine-grained discretization
SA3 - 130 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Control (with the Parti-Game algorithm)
avoids these problemsnonuniform discretization
Parti-Game algorithm [Moore and Atkeson; 1995]
SA3 - 131 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Control (with the Parti-Game algorithm)
goal
goal
goal
goal
goal
goal
goal
goal
goal
goal
here: using a deterministic state space for illustration
SA3 - 132 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
At1
t2
t0
A
A
O1
O2
O3
Control (with the Parti-Game algorithm)
the state space is really nondeterministicwe thus use Minimax LPA* instead of LPA*
SA3 - 133 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Control (with the Parti-Game algorithm)
Implementation
Uninformed Search from ScratchInformed Search from ScratchUninformed Incremental SearchInformed Incremental Search (Minimax LPA*)
Planning Time
362 minutes135 minutes14 minutes13 minutes
55 seconds15 seconds53 seconds53 seconds
terrains of size 2000 x 2000
SA3 - 134 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
References (in order of their appearance)
S. Koenig, Agent-Centered Search, Artificial Intelligence Magazine, 22(4), 2001, 109-131.I. Nourbakhsh, Interleaving Planning and Execution for Autonomous Robots, Kluwer, 1997.H. Choset, J. Burdick, Sensor-based planning and nonsmooth analysis. In Proceedings of the International Confer-ence on Robotics and Automation, 1994, 3034-3041.C. Tovey and S. Koenig, Gridworlds as Testbeds for Planning with Incomplete Information, Proceedings of theNational Conference on Artificial Intelligence, 819-824, 2000.G. Dudek, K. Romanik, S. Whitesides, Localizing a robot with minimum travel, In Proceedings of the ACM-SIAMSymposium on Discrete Algorithms, 437-446, 1995.C. Lund, M. Yannakakis, On the hardness of approximating minimization problems, Journal of the ACM, 41:960-981, 1994.M. Genesereth and I. Nourbakhsh, Time-saving tips for problem solving with incomplete information, In Proceed-ings of the National Conference on Artificial Intlligence, 1993, 724-730.S. Koenig and R. Simmons, Solving robot navigation problems with initial pose uncertainty using real-time heuris-tic search, In Proceedings of the International Conference on Artificial Intelligence Planning Systems, 1998, 145-153.R. Simmons, S. Koenig, Probabilistic Robot Navigation in Partially Observable Environments, Proceedings of theInternational Joint Conference on Artificial Intelligence, 1993, 99-105.S. Koenig and R. Simmons, Xavier: A Robot Navigation Architecture Based on Partially Observable Markov Deci-sion Process Models, In: Artificial Intelligence Based Mobile Robots: Case Studies of Successful Robot Systems,D. Kortenkamp, R. Bonasso, R. Murphy (Eds.), MIT Press, 1998.S. Thrun, Probabilistic Algorithms in Robotics, Artificial Intelligence Magazine, 21(4), 2000, 93-109.W. Burgard, D. Fox, S. Thrun, Active Mobile Robot Localization, Proceedings of the International Joint Conferenceon Artificial Intelligence, 1997.R. Schapire, The Design and Analysis of Efficient Learning Algorithms, MIT Press, 1992.C. Papadimitriou and J. Tsitsiklis, The complexity of Markov decision processes, Mathematics of OperationsResearch 12(3), 1987, 441-450.
SA3 - 135 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
References(in order of their appearance)
S. Koenig, C. Tovey, W. Halliburton, Greedy Mapping of Terrain, Proceedings of the International Conference onRobotics and Automation, 2001, 3594-3599.S. Thrun, A. Buecken, W. Burgard, D. Fox, T. Froehlinghaus, D. Hennig, T. Hofmann, M. Krell, T. Schmidt, Maplearning and high-speed navigation in RHINO, In : Artificial Intelligence Based Mobile Robotics: Case Studies ofSuccessful Robot Systems, D. Kortenkamp, R. Bonasso, R. Murphy (Eds.), MIT Press, 1998, 21-52.L. Romero, E. Morales, E. Sucar, An exploration and navigation approach for indoor mobile robots considering sen-sor’s perceptual limitations, Proceedings of the International Conference on Robotics and Automation, 2001, 3092-3097.D. Mackenzie, R. Arkin, J. Cameron, Multiagent mission specification and execution, Autonomous Robots, 4(1),1997, 29-57.S. Koenig, C. Tovey, Y. Smirnov, Performance Bounds for Planning in Unknown Terrain, 2001.B. Brumitt, A. Stentz, GRAMMPS: a generalized mission planner for multiple mobile robots. In Proceedings of theInternational Conference on Robotics and Automation, 1998.M. Hebert, R. McLachlan, P. Chang, Experiments with driving modes for urban robots, Proceedings of the SPIEMobile Robots, 1999.L. Matthies, Y. Xiong, R. Hogg, D. Zhu, A. Rankin, B. Kennedy, M. Hebert, R. Maclachlan, C. Won, T. Frost, G.Sukhatme, M. McHenry, S. Goldberg, A portable, autonomous, urban reconnaissance robot. Proceedings of theInternational Conference on Intelligent Autonomous Systems, 2000.A. Stentz and M. Hebert, A complete navigation system for goal acquisition in unknown environments. Autono-mous Robots, 2(2), 1995, 127-145.S. Thayer, B. Digney, M. Diaz, A. Stentz, B. Nabbe, M. Hebert, Distributed robotic mapping of extreme environ-ments. In Proceedings of the SPIE: Mobile Robots XV and Telemanipulator and Telepresence Technologies VII,Volume 4195, 2000.P. Hart, N. Nilsson, B. Raphael, A Formal Basis for the Heuristic Determination of Minimum Cost Paths in Graphs,IEEE Transactions on Systems Science and Cybernetics, SSC-4(2), 1968, 100-107.G. Ramalingam, T. Reps, On the computational complexity of dynamic graph problems, Theoretical Computer Sci-ence 158 (1-2), 1996, 233-277.
SA3 - 136 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
References(in order of their appearance)
S. Koenig, M. Likhachev, Incremental A*, Advances in Neural Information Processing Systems, 2001.M. Likhachev, S. Koenig, Lifelong Planning A* and Dynamic A* Lite: The Proofs, 2001.B. Nebel and J. Koehler, Plan reuse versus plan generation: A theoretical and empirical analysis, Artificial Intelli-gence, 76(1-2), 1995, 427-454.A. Stentz, Optimal and Efficient Path Planning for Partially-Known Environments, Proceedings of the InternationalConference on Robotics and Automation, 1994, 3310-3317.S. Koenig, M. Likhachev, D* Lite, Proceedings of the National Conference on Artificial Intelligence, 2002.A. Stentz. The focussed D* algorithm for real-time replanning. In Proceedings of the International Joint Conferenceon Artificial Intelligence, 1652-1659, 1995.S. Koenig, D. Furcy, C. Bauer, Heuristic Search-Based Replanning, Proceedings of the International Conference onArtificial Intelligence Planning Systems, 2002.B. Bonet, H. Geffner, Heuristic Search Planner 2.0, Artificial Intelligence Magazine 22(3), 2001, 77-80.Y. Liu, S. Koenig, D. Furcy, Speeding up the calculation of the heuristics for heuristic search-based planning, Pro-ceedings of the National Conference on Artificial Intelligence, 2002.
SA3 - 137 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy On-Line Planning and Lifelong Planning
Related Work:
K. Hammond. Explaining and repairing plans that fail, Artificial Intelligence 45, 1990, 173-228.R. Simmons. A theory of debugging plans and interpretations, in: Proceedings of the National Conference on Artifi-cial Intelligence, 1988, 94-99.A. Gerevini, I. Serina, Fast plan adaptation through planning graphs: Local and systematic search techniques, in:proceedings of the International Conference on Artificial Intelligence Planning and Scheduling, 2000, 112-121.J. Koehler, Flexible plan reuse in a formal framework, in: C. Baeckstroem, E. Sandewall (Eds.), Current Trends inAI Planning, IOS Press, 1994, 171-184.M. Veloso, Planning and Learning by Analogical Reasoning, Springer, 1994.R. Alterman, Adaptive Planning, Cognitive Science 12(3), 1988, 393-421.S. Kambhampati, J. Hendler, A validation-structure-based theory of plan modification and reuse, Artificial Intelli-gence 55, 1992, 193-258.S. Edelkamp, Updating Shortest Paths, Proceedings of the European Conference on Artificial Intelligence, 1998,655-659.S. Hanks, D. Weld, A domain-independent algorithm for plan adaptation, Journal of Artificial Intelligence Research2, 1995, 319-360.
... and many more
Artificial Intelligence
SA3 - 138 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy On-Line Planning and Lifelong Planning
Related Work:
G. Ausiello, G. Italiano, A. Marchetti-Spaccamela, U. Nanni, Incremental algorithms for minimal length paths,Journal of Algorithms 12(4), 1991, 615-638.S. Even, H. Gazit, Updating distance in dynamic graphs, Methods of Operations Research 49, 1985, 371-387.E. Feuerstein, A. Marchetti-Spaccamela, Dynamic algorithms for shortest paths in planar graphs, Theoretical Com-puter Science 116(2), 1993, 359-371.P. Franciosa, D. Frigioni, R. Giaccio, Semi-dynamic breadth-first search in digraphs, Theoretical Computer Science250(1-2), 2001, 201-217.D. Friogioni, A. Marchetti-Spaccamela, U. Nanni, Fully dynamic output bounded single source shortest path prob-lem, in: Prodeedings of the Symposium on Discrete Algorithms, 1996, 212-221.S. Goto, A. Sangiovanni-Vincentelli, A new shortest path updating algorithm, Networks 8(4), 1978, 341-372.G. Italiano, Finding paths and deleting edges in directed acyclic graphs, Information Processing Letters 28(1), 1988,5-11.P. Klein, S. Subramanian, Fully dynamic approximation schemes for shortest path problems in planar graphs, in:Proceedings of the International Workshop on Algorithms and Data Structures, 1993, 443-451.C. Lin, R. Chang, On the dynamic shortest path problem, Journal of Information Processing 13(4), 1990, 470-476.H. Rohnert, A dynamization of the all pairs least cost path problem, in: Proceedings of the Symposium on Theoret-ical Aspects of Computer Science, 1985, 279-286.P. Spira, A. Pan, On finding and updating spanning trees and shortest paths, SIAM Journal on Computing 4, 1975,375-380.
... and many more
Algorithm Theory
SA3 - 139 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy On-Line Planning and Lifelong Planning
Related Work:
V. Lumelsky and A. Stepanov. Path planning strategies for point mobile automaton moving amidst unknown obsta-cles of arbitrary shape. Algorithmica, 2:403-430, 1987.M. Barbehenn and S. Hutchinson. Efficient search and hierarchical motion planning by dynamically maintainingsingle-source shortest paths trees. IEEE Transactions on Robotics and Automation, 11(2):198-214, 1995.T. Ersson and X. Hu. Path planning and navigation of mobile robots in unknown environments. In Proceedings ofthe International Conference on Intelligent Robots and Systems, 2001.Y. Huiming, C. Chia-Jung, S. Tong, and B. Qiang. Hybrid evolutionary motion planning using follow boundaryrepair for mobile robots. Journal of Systems Architecture, 47(7):635-647, 2001.L. Podsedkowski, J. Nowakowski, M. Idzikowski, and I. Vizvary. A new solution for path planning in partiallyknown or unknown environments for nonholonomic mobile robots. Robotics and Autonomous Systems, 34:145-152. 2001M. Tao, A. Elssamadisy, N. Flann, and B. Abbott. Optimal route re-planning for mobile robots: A massively parallelincremental A* algorithm. In International Conference on Robotics and Automation, pages 2727-2732, 1997.K. Trovato. Differential A*: An adaptive search method illustrated with robot path planning for moving obstaclesand goals, and an uncertain environment. Journal of Pattern Recognition and Artificial Intelligence, 4(2), 1990.
... and many more
Robotics
SA3 - 140 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
Greedy On-Line Planning and Lifelong PlanningTheoretical Results
Related Work:
S. Carlsson, H. Jonsson, Computing a shortest watchman path in a simple polygon in polynomial time, in: S. Akl, F.Dehne, J. Sack, N. Santoro (Eds.), Proceedings of the Workshop on Algorithms and Data Structures, Vol. 955 ofLecture Notes in Computer Science, Springer, 1995, 122-134.X. Tan, T. Hirata, Constructing shortest watchman routes by divide-and-conquer, in: K. Ng, P. Raghavan, N. Bala-subramanian, F. Chin (Eds.), Proceedings of the International Symposium on Algorithms and Computation, Vol.762 of Lecture Notes in Computer Science, Springer, 1993, 68-77.S. Ntafos, Watchman routes under limited visibility, in: Proceedings of the Canadian Conference on ComputationalGeometry, 1990, 89-92.X. Deng, T. Kameda, C. Papadimitriou, How to learn an unknown environment I: the rectilinear case, Journal of theACM 45(2), 1998, 215-245.F. Hoffman, C. Icking, R. Klein, K. Kriegel, A competitive strategy for learning a polygon, in: Proceedings of theSymposium on Discrete Algorithms, 1997, 166-174.V. Lumelsky, Algorithmic and complexity issues of robot motion in an uncertain environment, Journal of Complex-ity 3, 1987, 146-182.A. Blum, P. Raghavan, B. Schieber, Navigating in unfamiliar geometric terrain, SIAM Journal on Computing 26(1),1997, 110-137.C. Icking, R. Klein, E. Langetepe, An optimal competitive strategy for walking in streets, in: C. Meinel, S. Tison(Eds.), Proceedings of the Symposium on Theoretical Aspects of Computer Science, Vol. 1563 of Lecture notes inComputer Science, Springer, 1999, 110-120.X. Deng, C. Papadimitriou, Exploring an unknown graph, in: Proceedings of the Symposium on Foundations ofComputer Science, 1990, 355-361.S. Albers, M. Henzinger, Exploring unknown environments, in: Proceedings of the Symposium on Theory of Com-puting, 1997, 416-425.
... and many more
SA3 - 141 of 141Greedy On-Line Planning; (c) Sven Koenig; Georgia Tech; January 2002.
http://www.cc.gatech.edu/fac/Sven.Koenig/greedyonline
Lifelong Planning Techniques - Our Work
Please see
We gratefully acknowledge funding from NSF and IBM.The views and conclusions contained in this material are those of the authors and should not be interpreted as representing the official policies,
either expressed or implied, of the sponsoring organizations and agencies or the U.S. government.