andreas herzig irit, cnrs, univ. toulouse, franceandreas.herzig/p/pfia17_invited.pdf ·...

KR vs. ICAPS What’s in a planning problem? States and goals: EL Actions and plans: DEL

Multiagent planning and epistemic logic

Andreas HerzigIRIT, CNRS, Univ. Toulouse, France

Journee pleniere, preGDR sur les Aspects Formels et Algorithmiques de l’IA

July 5, 2017

1 / 25


Outline

1 Planning: KR vs. ICAPS

2 What’s in a planning problem?

3 States and goals: Epistemic Logic

4 Actions and plans: Dynamic Epistemic Logic

2 / 25


KR vs. ICAPS: two diverging communities

ICAPS KR2002 — France —2004 — Canada —2006 — UK —2008 — Australia —2010 — Canada —2012 Brazil Italy2014 USA Austria2016 UK South Africa2018 NL USA

where should I publish my paper on action and planning?case

when (complex concepts∨complex models) then submit(KR);when (implemented∧fast) then submit(ICAPS)

esac

3 / 25


KR vs. ICAPS: reasons for the divergence

1 KR: focus on representationwhich language?

which concepts? (action, initial state, goal,. . . )which logical form? (arguments,. . . )has truth values? (facts do, actions don’t)

which models?theoretical properties of reasoning problems?

plan existence decidable?complexity of decision problems?

“logic is like a toothbrush”2 ICAPS: focus on fast reasoning methods

standardised representation languages (PDDL, PDDL+++,. . . )performance on benchmarks?

4 / 25


KR vs. ICAPS: soon back together?

since ∼2013: ‘planning goes multiagent’workshop ‘Distributed and Multi-Agent Planning’ @ ICAPS[Petrick, Geffner, Domshlak, Brafman, Kambhampati, Nebel,. . . ]

since ∼2011: ‘Dynamic Epistemic Logic goes planning’[Bolander, van der Hoek, Wooldridge, Aucher, Schwarzentruber,. . . ]

Dagstuhl workshops on multiagent epistemic planning(2008, 2014, 2017)⇒ many new representation problems⇒ in particular: reasoning about other agents’ mental states

fundamental in any kind of interactionpeople who are unable to attribute mental states to others havea lot of difficulties to interact with themcounts as a disorder, cf. autism, schizophrenia,. . .

5 / 25


The need for epistemic reasoning in planning1 single-agent planning

uncertainty about initial situationuncertainty about action effectssensing actions (alias knowledge producing actions)

⇒ contingent/conformant planning2 multiagent planning

initial situation:first-order: I don’t know whether p.second-order: I don’t know whether you know that p.second-order: I know that you don’t know whether p.. . .

goal:first-order: I want to know whether p.second-order: I want to know whether you know that p.second-order: I want you to believe that q.. . .

actions with epistemic effectssensing actionscommunication actionshow to model the agents’ perception of actions and events?

6 / 25


Challenge: robots with theory of mind [Milliez et al. 2014]

at step 3, agent Green’s beliefs become falsecolored arrows = beliefs about white book position (red = robot)colored spheres = reachability of an object for an agent

1. 2.

3. 4.

7 / 25


Problems, problems

representation problems:simple integrations of epistemic and spatial reasoning?model ‘expiry date’ for knowledge/belief?

light in room x is on at Tj is in room x (so j believes that the light is on at T )j leaves the room at T+1at T ′ > T , does j still believe that the light in x is on?

higher-order belief revision?

⇒ to be solved in any application!reasoning problems:

epistemic reasoning is difficultat least PSPACEEXPTIME complete if common knowledge/belief involved

what are good benchmarks?“what is epistemic planning’s blocksworld?”

centralised or decentralised planning?

8 / 25


Outline





9 / 25


What’s in a planning problem?

planning problem = 〈init, goal, actionLaws〉

1 logical form of init: propositionproposition = set of possible worlds (states)can be described in various logical languages:

propositional logicepistemic logic. . .

classical planning:initial state = complete proposition

= a single possible world= a valuation of propositional logic

2 logical form of goal: proposition3 logical form of actionLaws: action type

action type: arm-raisingaction token: Bruno’s raising of his right arm in room 7 ofbuilding 007 of Caen IUT on July 5, 2017 at 10:55:55

10 / 25


What’s in an action?

“event brought about by an agent” [Davidson]

“proposition that can be phrased agent i sees to it thatproposition ϕ is true” [Belnap’s stit thesis]

“something that has precondition and effects” [AI folklore]

action = 〈precond, effect〉

precond = propositioneffect = ?

11 / 25


What’s in an action effect?STRIPS actions: effect = conjunction of literalshowever: an action type is instantiated in differentcircumstances⇒ effects typically depend on thesecircumstancesconditional effects:

effect ={〈condition1, L1,1 ∧ · · · ∧ L1,m1〉,

. . . ,

〈conditionn, Ln,1 ∧ · · · ∧ Ln,m1〉}

example: agent i’s action of flipping a switch

precond(flipi) = AtSwitchi

effect(flipi) = {〈¬On,On〉,

〈On, ¬On〉}

12 / 25


Outline





13 / 25


Epistemic logic: language

knowledge explained in terms of possible worlds [Hintikka]:Kiϕ = “agent i knows that ϕ”

= ϕ true in every world that is possible for iBNF:

ϕ ::= p | ¬ϕ | ϕ ∧ ϕ | Kiϕ

where p ranges over Prp and i over Agt

3 possible epistemic attitudes w.r.t. a formula ϕ:KiϕKi¬ϕ¬Kiϕ ∧ ¬Ki¬ϕ

14 / 25


Epistemic logic: possible worlds semantics

model M = (W , {Ri}i∈Agt,V) withW non-empty set of possible worldsRi ⊆ W ×W accessibility relationsV : W −→ 2Prp valuation

Ri is an equivalence relation (indistinguishability)Ri(w) = “set of worlds i cannot distinguish from w”

= “set of worlds compatible with i’s knowledge”truth conditions:

M,w p iff . . .M,w ¬ϕ iff . . .M,w ϕ ∧ ψ iff . . .M,w Kiϕ iff M,w′ ϕ for all w′ ∈ Ri(w)

15 / 25


Epistemic logic: possible worlds semantics

muddy children puzzle, initial situation

12R2

R1

12

R1

12R2

12

(reflexive arrows omitted)

M, 12 m1 ∧m2 ∧ K1m2 ∧ ¬K1m1 ∧ ¬K1¬m1

16 / 25


Epistemic logic for epistemic planning?

can be modeled:init = formula of epistemic logicgoal = formula of epistemic logic

cannot be expressed:actionLaws

17 / 25


Outline





18 / 25


Muddy children: Episode 1

1 initially, common knowledge that nobody is muddy2 1 gets muddy but isn’t sure; 2 watches3 2 gets muddy but isn’t sure; 1 watches

12

R1

12

R1

R212

R1

121 gets muddy

=⇒ 122 gets muddy

=⇒ 12R2

12

19 / 25


Dynamic epistemic logic DEL

idea: model uncertainty about current event by introducingpossible events

uncertainty about world uncertainty about eventpossible worlds possible eventsindistinguishability of worlds indistinguishability of events

⇒ ‘possible event models’distinguish agents who observe from agents who don’t

N.B.: an agent typically observes only very few events

muddy children:event model where 1 plays, 2 watches

skip1 R1getsMuddy1

(reflexive arrows omitted)

20 / 25


DEL: event models

EM = (E, {Si}i∈Agt, precond, effect) event model, whereE is a nonempty set of eventsSi ⊆ E × E

every Si is an equivalence relationeSi f = “i perceives occurrence of e as occurrence of f ”

precond : E −→ Fmlseffect : E −→ Fmls s.t. effect(e) conjunction of literals

(just as in STRIPS)

21 / 25


DEL: product construction

update world model WM = (W ,R ,V) by event model EM

WM ⊗ EM = WM′

where

W ′ = {(w, e) ∈ W × E : M,w precond(e)}(w, e)R ′i (v , f) iff wRiv and eSi f

V ′((w, e)) = (V(w) \ {p : p negative in effect(e)})∪ {p : p positive in effect(e)}

22 / 25


DEL for epistemic planning?

explored since >5 years [Bolander & Anderson 2011];[Lowe, Pacuit & Witzel 2011]; [Aucher, Maubert & Pinchinat 2014];[Yu, Li & Wang 2015],. . .

init = formula of multiagent epistemic logicgoal = formula of multiagent epistemic logicaction type = agent + event model

reasoning: not so easyplan existence undecidable in general[Bolander & Anderson 2011]; [Aucher & Bolander 2013];[Charrier, Maubert & Schwarzentruber 2016]decidable fragments: heavily restricted [Yu, Wen & Liu 2013];[Bolander et al. 2015],. . .

representation: some problems that seemingly wentunnoticed. . .

23 / 25


DEL for epistemic planning: problemsevent models rather describe action tokensactionLaws describe types, not tokenshow to describe conditional effects?

list all possible cases of perception of the actual eventinfinitely many conditional effects needed

conditional effects of getMuddy(i):

(inGardeni , mi)

(inGardenj , Kjmi)

(Ki inGardenj , Ki(Kjmi ∨ Kj¬mi))

(KjKi inGardenj , . . . )

...

(CKi,j inGardenj , CKi,j(Kjmi ∨ Kj¬mi))

⇒ event models with an infinite number of points!even when finite, event models have to be bigworld models typically grow exponentially when updated

24 / 25


Conclusion

knowledge representation with DEL event models:art rather than craftpractical problemsconceptual problems (type vs. token)

the other agents’ observation should be based on informationfrom the possible worlds model, not from the event model

edge-conditioned event models [Bolander, 2015]special propositional variable “agent i is watching”[Bolander et al., JoLLI 2016]

part of the state, not part of the action!special propositional variable “agent j is watching agent i”[Bolander et al., ongoing]

j changes her beliefs when j watches i’s actionj does not change her beliefs when j does not watch i’s action

25 / 25

andreas herzig irit, cnrs, univ. toulouse, franceandreas.herzig/p/pfia17_invited.pdf ·...

Documents