a h ierarchical g oal -b ased f ormalism and a lgorithm for s ingle -a gent p lanning aamas ‘12...

A HİERARCHİCAL GOAL-BASED FORMALİSM AND ALGORİTHM FOR SİNGLE-AGENT PLANNİNGAAMAS ‘12

Utku Şirin

1560838

OUTLİNE

Planning and Domain Models Hierarchical Goal Network (HGN) Planner

formalism and proof of its capabilities An algorithm for HGN planning, Goal

Decomposition Planner (GDP) Experimental Results Comments and Conclusions

AUTOMATED PLANNİNG

What is automated planning? There is goal and current situation, aim is to

achieve the goal by executing possible actions Current situation is defined by states Repeatedly;

execute an executable action, apply the changes to the state and check whether the goal is satisfied

How to do these automatically, fast, and in less number of steps?

Important ability for computurized agents Robotic Agents Game-Playing Agents Web-service Agents etc...

DOMAİN MODELS Planner should have a domain model defining the states, actions

and the relation between the actions and states How to build domain models?

Hand-crafted planner module Huge development effort !

Domain-configurable planner Utilization of a domain-model file Most of the uses are Hierarchical Task Network (HTN) Planning

There are methods dividing tasks into subtasks (will be analyzed deeper) Does not focus on goals, but tasks Just apply the tasks until there is no remaining tasks

Easier with respect to hand-crafted planning module Problem of lacking of task and goal correspondence makes it hard to translate

classical planning domains into HTN domains, hence to prove soundness Can we do better ?

Hierarchical Goal Network (HGN) Planner Similar to HTN formalism, but easier to develop domain models More flexible Integrates domain-independent heuristics Decomposes goals rather than tasks Provably HGN has same expressivity power as HTN, is sound and complete

FORMALİSM Classical Planning

Domain D is a finite-state transition system S is a set of states, each state is a finite set of ground atoms

Ex: onTable(block1), on(block2,block1) G is the specification of the goal state comprised of a set of

ground atoms O is a set of operators which is a triple

(head(o), pre(o), eff(o)) Each action is a ground instance of an operator An action a is executable in a state s if s╞ pre(a) (s entails pre(a))

Meaning that s satisfies the preconditions of action a After execution an action a, the new state s’ is

s’ = (s - eff-(a)) ∪ eff+(a) A plan = <a1, …, an> is executable in s if each action ai is

executable in the state produced by ai-1. A solution to a classical planning problem P = (D, s0, g) is , if δ(s0, )

╞ g, where D is the domain, s0 is the initial state and g is a goal definition

HİERARCHİCAL GOAL NETWORK (HGN) PLANNİNG Similar to classical planning but have methods

additionally A HGN method m is a quadruple (head(m), pre(m),

sub(m), post(m)) head(m) and pre(m) same as the ones in operators for

classical planning sub(m) list of goals <g1, …, gk> where each gi is a goal

formula post(m) = gk ; if sub(m) is non-empty

post(m) = pre(m) ; otherwise Relevance: An action a or a method m is relevant for a

goal formula g if eff(a) or post(m) entails at least one literal in g. Provides smaller search space than a classical planner

A HGN domain is D’ = (D,M) where D is a classical planning domain and M is the set of methods

PROOFS HGN planning is sound and complete. These are

proved by mapping HGN planning problem to classical planning problem Soundness:

HGN planning domain is D = (D’,M), where D’ is a classical planning domain

Every action executable in D is also executable in D’ Hence, every solution to problem P = (D, s0, g) is also a solution to

P = (D’, s0, g) Hence, HGN planning is sound.

Completeness For a path x in classical domain D, there can be constructed a

method m that specifies each state in x as a sub-goal in its sub(m). Then a single action will achieve each subgoal completing the path Hence for each classical planning problem P = (D, s0, g), there is a

HGN planning problem P’ = ((D, M), s0, g) where P and P’ have same set of solution

PROOFS HGN formalism expressivity power is equal to

HTN formalism From HGN formalism construct HTN formalism

Map subgoals to subtasks with same preconditions <g1, … , gk> mapped directly to <tg1, … , tgk> In HTN, however, it is needed to define primitive tasks

as well. So, define a new primitive task for each tgi having same precondition as gi and no subtasks (that’s why it is primitive, indeed).

From HTN formalism construct HGN formalism Map subtasks to subgoals with same preconditions <t1, … , tk> mapped directly to <fint1, … , fintk>

A LİTTLE BİT HTN

Associate methods with networks

Critics for different types of network

ALGORİTHM, GOAL DECOMPOSİTİON PLANNİNG (GDP)

GDP İS SOUND AND COMPLETE

Soundness, if GDP returns a plan, it is a solution indeed. Induction on length of the solution n

For n = 0, it means s0╞ g If is a solution of length k < n returned by GDP Then ’ of length k+1 returned by GDP is also a solution

as line 11 appends a relevant action/method u to the plan

Completeness, if there is a solution, then GDP will return it Induction on length of the solution n

For n = 0, GDP will return it as s0╞ g Assume there is a solution of length k and GDP returns it Then GDP returns solutions of length k+1 as at line 11

GDP appends relevant action/method

DOMAİN-İNDEPENDENT HEURİSTİCS

One of the most important contribution of HGN planning formalism

Line 9-13 was choosing action/methods nondeterministically, however, it can be chosen based on a heruistic value

So, line 9-13 will be replaced as below:

DOMAİN-İNDEPENDENT HEURİSTİCS

How to calculate heruistic value for each action/method:

First propositional

level in which p

appears in Plannig Graph

States-Levels

Action-Levels

PLANNING GRAPH

EXPERİMENTS An HTN planner SHOP2, a classical planner FF and the HGN planner GDP are compared in five

different domains DOMAINs:

Logistics Transportation Domain: There several cities. At each city there are several post-offices Aim is to move specified number of packages to different cities Intracity transportation is done via trucks Intercity transportation is done via airplanes Trucks and airplanes are unlimited

Blocks-World: There are n-many blocks in a specified configuration Convert the initial configuration to goal configuration by obeying the following rules:

Move one block at a time A block may be put on another block or table

Depots: Combination of Logistics and Blocks-World domain Trucks have hoist just like the arms of robots in Blocks-World domain Stacking the crates becomes Blocks-World domain

Towers of Hanoi: There are three sticks in which several disks are places on it Disks are put in such a way that each disk is smaller than the disk that it is put on it Move disks from one stick to other by obeying the following rules

Move one disk at a time No disk may be put onto a smaller disk

3-City Routing The only newly written domain, hence it is a weak domain model There are 3 cities Each city has several locations and locations are connected with roads arbitrarily in the cities There is one random road connected city1 to city3 and one random road connected city2 to city3 Aim is to go from city1 or city3 to city2

RESULTS

Logistics Domain Results For n = 15, 20, …, 60 packages

GDP-h does not bring much overhead for heuristic function calculation

FF has strong heuristics

RESULTS

Blocks-World Domain Results For n = 10, 20, …, 100 blocks

FF has known problems with Blocks-World GDP-h has heuristic value calcuation time

overhead GDP-h results in a bit smaller plans

RESULTS

The Depots Domain Results For n = 10, 20, …, 80 crates

FF cannot solve more than 24 crates GDP-h heuristic overhead is significant, also

have almost same plans with the other planners

RESULTS Towers of Hanoi Domain Results For n = 3, …, 14 rings

SHOP2 could not solve problems for n > 12 and GDP and GDP-h cound not solve problems for n > 14

Both is due to stack overflow, hence thought as implementation issue, FF did not use a stack

FF has very bad planning results while the others have almost optimal path results

RESULTS 3-City Routing Domain Results

All previous domains are strong and very well defined domains

This one is constructed as a weak domain model having only one method for HGN and three corresponding methods for HTN

For n = 10, 20, …, 100 cities

GDP and SHOP2 could not solve except for n = 10

FF may solve the problems up to n = 60, after that point it even could not parse the problem file

GDP-h solved all problems quickly and nearly optimal

The reason for the success of the GDP-h is the guided search thanks to the heuristics

As the model is weak, the other planners do not have enough information to constraint the search space and do a lot of backtrackings, however, GDP-h uses heuristic to be able to guide its search and narrow its search space

As a conclusion, we can say that if there is a strong domain model, heuristic calculation most probably will result in a overhead and give not significantly better result; however, if the model is weak than contribution of heuristic function is crucial

RESULTS Domain Authoring

Subjective to developers Measures as number of lisp symbols and compared for GDP and SHOP2

planners GDP almost always have less number of symbols HTN specifies more than one task to achieve a goal formula. It defines a

decomposition task, several primitive tasks and deletion-check conditions, while, GDP only needs to speficify those as goals and let the planner choose the appropriate action to do with

respect to the goal. There is a need for different base cases for each method in HTN, however, GDP does not need such bases cases as the semantic of a goal provides to do nothing if a goal is true.

COMMENTS AND CONCLUSİONS No cross-domain explanations for the experiments. For

example, why FF is unsuccessfull is not answered. Just the results are shown and it is said that GDP is capable enough the others, even produces better results for weak domain models.

Almost everything is compared with HTN but HTN is not explained, at least in principle. Moreover, main difference is not shown algorithmically. What was doing HTN and now what is the thing that HGN is not doing, thereby resulting better. For example, can we use heuristics in SHOP2 planner. I guess we can, and if we can, it may also produce similar results.

HGN is more intutive when comparing both, hence, seems good contribution to the literature (since 1974). However, HTN is being used many many years, hence more comprehensive comparison is expected

So the only contribution of HGN is the easy development domain models, which is even a subjective criteria.

REFERENCES V. Shivashankar, U. Kuter, D. S. Nau, and R. Alford. A hierarchical goal-based

formalism and algorithm for single-agent planning. In Eleventh Internat. Conf. on Autonomous Agents and Multiagent Systems (AAMAS), 2012

Kutluhan Erol, James A. Hendler, and Dana S. Nau. UMCP: A sound and complete procedure for hierarchical task-network planning. In Proceedings of the International Conference on AI Planning & Scheduling (AIPS), pages 249–254, 1994.

J. Hoffmann and B. Nebel. The FF planning system. JAIR, 14:253–302, 2001. D. S. Nau, T.-C. Au, O. Ilghami, U. Kuter, J. W. Murdock, D. Wu, and F. Yaman.

SHOP2: An HTN planning system. JAIR, 20:379–404, Dec. 2003. M. M. Veloso. Learning by analogical reasoning in general problem solving.

PhD thesis CMU-CS-92-174, Carnegie Mellon University, 1992. F. Bacchus. The AIPS ’00 planning competition. AI Mag., 22(1):47–56, 2001. M. Fox and D. Long. International planning competition, 2002. http

://planning.cis.strath.ac.uk/competition. Hui Li. Technical Report: Relaxed Plan Graph Heuristic Cost Estimation. 2006.

http://planning.cis.strath.ac.uk/competition



ANY COMMENTS OR QUESTİONS ?

a h ierarchical g oal -b ased f ormalism and a lgorithm for s ingle -a gent p lanning aamas ‘12...

Documents