aquaint aquaint: two-way bridge a two-way bridge between language and logic recent accomplishments...

20
AQUAINT: Two-Way Bridge AQUAINT A Two-way Bridge between Language and Logic Recent Accomplishments and Challenges Danny Bobrow and Ron Kaplan With Cleo Condoravdi, Dick Crouch, Valeria de Paiva, Lauri Karttunen, Tracy King, John Maxwell, Annie Zaenen

Upload: gavin-mcdowell

Post on 18-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

AQUAINT: Two-Way Bridge

AQUAINT

A Two-way Bridge between Language and LogicRecent Accomplishments and Challenges

Danny Bobrow and Ron KaplanWith

Cleo Condoravdi, Dick Crouch, Valeria de Paiva, Lauri Karttunen, Tracy King, John Maxwell, Annie Zaenen

AQUAINT: Two-Way Bridge

AQUAINT

Outline

• Recent accomplishments• Some technical details• Collaborative challenges

Assertions

F-SC-S KR

XLE/LFG Parsing KR Mapping Target LogicText

M

Text

Sources

Question Query

Text to user

ComposedF-Structure Templates

MM

AnswersExplanationsSubqueries

XLE/LFGGeneration

AQUAINT: Two-Way Bridge

AQUAINT

Accomplishments: Incorporation of external resources

• Cyc extractions– Generalization hierarchy – Language to KR mappings

• VerbNet extractions– Syntactic lexicon, subcategorization frames– Thematic roles and semantics– Linkage between syntax and semantics

AQUAINT: Two-Way Bridge

AQUAINT

Accomplishments:Test suites

• XLE syntactic coverage with gold standards• Extracted examples: WordNet, VerbNet, NIST• PropBank

mapping to full sentence, dependency triples

(including stemmed verbs & argument heads)

• Project specific– Key phenomena

context inducing constructions, questions, time expressions, etc

– Domainsterrorism (bombings and money laundering),

travel, commercial transactions

AQUAINT: Two-Way Bridge

AQUAINT

Accomplishments: System Work

• PARC advances– Semantics-KR mapping in term-rewriting system – XLE, Glue, term-rewriting in one process– Uniform use of XLE packed ambiguity management

• Exploration and exploitation– ResearchCyc– ANSProlog– WordNet

AQUAINT: Two-Way Bridge

AQUAINT

PARC ArchitectureText

C-structure

F-structure

Linguisticsemantics

Conceptual&

ContextedKR

(Bridge KR)

XLE/LFGparsing

Gluederivation

Semanticlexicon

LFG GrammarLexicon

TermRewriting

Word structureEntity recognition Morphology

Target KRR

Mapping Rules

Cyc

VerbNetWordNet

CycANSProlog

AQUAINT: Two-Way Bridge

AQUAINT

Layered Mapping from F-Structures to KR

• Glue derivation: F-structure to linguistic semantics– Captures intensionality, scope variation, etc

– Shows logical structure by nesting of formulas

• Linguistic semantics to flattened clausal form– Use skolems as terms to encode structure

– Bridges between linguistic structure and knowledge structure

• Map semantic clauses to Bridge KR clauses– Use resourced term-rewriting with ambiguity management

– Canonicalizes to more uniform (easier to match) representations

• Map from Bridge KR to target KR

Natural decomposition through intermediate representations

with different formal characteristicsthat capture different generalizations

AQUAINT: Two-Way Bridge

AQUAINT

Glue DerivationF-Structure to Linguistic Semantics

• Two parallel, linked logics– Resourced linear logic for guiding semantic construction

• Derivation driven solely by F-structures as types– A meaning logic for semantic representation

• Composition driven by linear logic derivation

• Alternative derivations introduce semantic ambiguities– Quantifier scope ambiguity

• Every cable is attached to a base-plate x cable(x) y plate(y) & attached(x,y) y plate(y) & x cable(x) attached(x,y)

– Internal/external modification • John put up bookshelves for two days

– Working on it for two days, or stayed up for two days– Comparison sets in information structure

• Pets must be carried on escalator• Clothes must be worn in public

AQUAINT: Two-Way Bridge

AQUAINT

Linguistic semantics and flattened clauses

Two terrorists plan to attack a factory

A1,B1: exists(f. factory(f) & exists2(t. terrorist(t) & plan(t, exists(a. attack(a, t ,f))))One real factory for both terrorists

A1,B2: exists2(t. terrorist(t) & exists(f. factory(f) & plan(t, exists(a. attack(a, t ,f)))))A different real factory for each terrorist

A2: exists2(t. terrorist(t) & plan(t, exists(f. factory(f) & exists(a. attack(a, t, f)))))The terrorists haven’t chosen factories

Result of glue derivation

ist(c0, terrorist(t)) ist(c0, cardinality(t, 2))

ist(c0, plan(t, c1)) ist(c1, attack(a, t, f))

skfn(t, c0) skfn(c1, t) skfn(a, c1)

A1: ist(c0, factory(f))B1: skfn(f,c0)B2: skfn(f, t)A2: skfn(f, c1)A2: ist(c1, factory(f))

c0 is the top level contextc1 is the context of the planist(C,X): X is true in context Cskfn(sk, arg): skolem sk depends on arg

Flat, packed semantics

AQUAINT: Two-Way Bridge

AQUAINT

Skolems refer to subconcepts• terrorist(t)

– t does not refer to an individual that is a terrorist– rather t refers to a subconcept of the concept

“terrorist”

• Permits a simpler treatment of intensional constructions– Negotiations prevented a war n. subconcept(n, Negotiation) &

w. subconcept(w, War) & prevent(n,w)

– prevent(n,w) means that • concept n has instances (in the world of the prevention)• concept w has no instances in that world

AQUAINT: Two-Way Bridge

AQUAINT

After Glue and Flattening:Mapping to Bridge KR

• Map to concepts & roles in ontology (Cyc)

• Apply meaning postulates / lexical entailments

• Make conceptual & contextual structure explicit

• Eliminate ontologically ill-formed analyses

AQUAINT: Two-Way Bridge

AQUAINT

Example Abstract KR (I)Words Cyc Terms & Roles

• Ed got to Baghdad(objectMoving get_ev5 Ed7)

(toLocation get_ev5 Baghdad6)

(sub_concept get_ev5 Conveying-Generic)

(sub_concept Baghdad6 CityOfBaghdadIraq)

(sub_concept Ed7 Ed-default-name)

(instantiated t Ed7) (instantiated t Baghdad6) (instantiated t get_ev5)

• Ed got the package to Baghdad(objectMoving get_ev8 package9)

(toLocation get_ev8 Baghdad10)

(doneBy get_ev8 Ed11) …

• Ed got the package(objectMoving get_ev12 package14)

(toLocation get_ev12 Ed13) …

AQUAINT: Two-Way Bridge

AQUAINT

• Ed cooled the room(sub_concept cool_ev18 scalar-state-change)(decreasesCausally cool_ev18 room20 temperatureOfObject)(doneBy cool_ev18 Ed21)(sub_concept room20 RoomInAConstruction) …

• Ed lowered the temperature of the room(sub_concept lower_ev22 scalar-state-change)(decreasesCausally lower_ev22 room24 temperatureOfObject)(doneBy lower_ev22 Ed23)(sub_concept room24 RoomInAConstruction) …

• The room cooled(sub_concept cool_ev25 scalar-state-change)(decreasesCausally cool_ev25 room26 temperatureOfObject))(sub_concept room26 RoomInAConstruction) …

Example Abstract KR (II)Canonicalization

AQUAINT: Two-Way Bridge

AQUAINT

Example Abstract KR (III)Contextual Structure

• The senator prevented a warpreventRelation (action1 c0 c2)doneBy (action1 senator3)

sub_concept (action1 Eventuality)sub_concept (senator3 USSenator)sub_concept (war4 WagingWar)

instantiated (c0 senator3)instantiated (c0 action1)uninstantiated (c0 war2)

uninstantiated (c2 action1)instantiated (c2 war2)

AQUAINT: Two-Way Bridge

AQUAINT

Term-Rewriting for KR Mapping

Example Rule

• Rules of form– <Input terms> ==> <Output terms> (obligatory)– <Input terms> ?=> <Output terms> (optional)

(optional rules introduces new choices)

• Input patterns allow– Consume term if matched: Term– Test on term without consumption: +Term– Test that term is missing: -Term– Procedural attachment: {ProcedureCall}

hire(E), subj(E, X), obj(E, Y),

+sub_concept(X, CX), +sub_concept(Y, CY),

{genls(CX, Organization), genls(CY, Person)}

==> sub_concept(E, EmployingEvent),

performedBy(E,X), personEmployed(E,Y).

• Ordered rule application– Rule1 applied in all possible ways to Input to produce Output1– Rule2 applied in all possible ways to Output1 to produce Output2

AQUAINT: Two-Way Bridge

AQUAINT

• Alternative lexical mappings /- cyc_concept_map(bank, FinancialInstitution).

/- cyc_concept_map(bank, SideOfRiver).

ist(C, P(Arg)), cyc_concept_map(P,Concept) ==> sub_concept(Arg,Concept).

• Input term– ist(c0, bank(b1))

• Alternative rule applications produce different outputs Rewrite system represents this ambiguity by a new choice

• Output– C1: sub_concept(b1, FinancialInstitution)

C2: sub_concept(b1, SideOfRiver)– (C1 xor C2) 1

Term-rewriting can introduce ambiguity

PermanentBackgroundFacts

Mapping fromPredicate to Cyc concept

AQUAINT: Two-Way Bridge

AQUAINT

• The bank hired Ed hire(E), subj(E,X), obj(E,Y),

+sub_concept(X,CX), +sub_concept(Y,CY),

{genls(CX, Organization), genls(CY,Person)}

==> sub_concept(E, EmployingEvent),

performedBy(E,X), personEmployed(E,Y).

• From Cyc: genls(FinancialInstitution, Organization) true genls(SideOfRiver, Organization) false

• If bank is mapped to SideOfRiver, the rule will not fire.This leads to a failure to consume the subject. subj(A,B) ==> stop.

prunes this analysis from the choice space.

• In general, later rewrites prune analyses that don’t consume grammatical roles.

Term-rewriting can prune ill-formed mappings

Rule for mappinghire

AQUAINT: Two-Way Bridge

AQUAINT

Mapping to Target KR: Collaborative Challenges

• Target ontology– Cyc – extending the ontology– ANSProlog -- mapping from Cyc ontology

• Packed ambiguity– Pass on packed ambiguities? – Unpack possible interpretations?– Choose statistically most likely?

• Contexts– Cyc: will microtheories work?– ANSProlog: incorporating a logic of contexts?

AQUAINT: Two-Way Bridge

AQUAINT

Two forms of future evaluation

• Project– Regression test suites to ensure progress towards

benchmark representations– Matching algorithm to test utility of KR canonicalization

• AQUAINT Program: – Local textual inference?– Incorporation by a back-end system?

AQUAINT: Two-Way Bridge

AQUAINT

Thank you