© daniel s. weld 1 take-home handed out friday 12/12 2:30pm due monday 12/15 2:30pm points worth 2...

50
© Daniel S. Weld 1 Take-Home • Handed out Friday 12/12 2:30pm • Due Monday 12/15 2:30pm • Points Worth 2 problem sets

Upload: beatrix-higgins

Post on 06-Jan-2018

215 views

Category:

Documents


2 download

DESCRIPTION

© Daniel S. Weld 3 Report 8 page limit; 12 pt font; 1” margins Model on conference paper Include abstract, conclusions, but no introduction Online and offline aspect of your agent Describe your use of search (if you did) Section on lessons learned / experiments Short section explaining who did what Due 12/18 1pm Points Worth 40-60% of project score

TRANSCRIPT

Page 1: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 1

Take-Home • Handed out

  Friday 12/12 2:30pm• Due

  Monday 12/15 2:30pm• Points

  Worth 2 problem sets

Page 2: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 2

Tournament• Wednesday

  3 rounds each (unless requested or finals)• Before each battle, teams describe

project  6 min per group  Both people to talk  Focus on specific approach, surprises, lessons  2 slides max (print on transparencies)

Page 3: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 3

Report• 8 page limit; 12 pt font; 1” margins• Model on conference paper

  Include abstract, conclusions, but no introduction  Online and offline aspect of your agent  Describe your use of search (if you did)  Section on lessons learned / experiments  Short section explaining who did what

• Due 12/18 1pm• Points

  Worth 40-60% of project score

Page 4: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 4

Outline• Review of topics• Hot applications

  Internet “Search”  Ubiquitous computation  Crosswords

Page 5: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 5

573 Topics

Agency

Problem Spaces

Search Knowledge Representati

on

Planning

Perception NLP Multi-

agentRobotic

s

Uncertainty MDPs Supervised

Learning

Reinforcement

Learning

Page 6: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 6

State Space Search•Input:

  Set of states  Operators [and costs]  Start state  Goal state test

•Output:  Path Start End  May require shortest path

Page 7: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 7

•GUESSING (“Tree Search”)  Guess how to extend a partial solution to a

problem.  Generates a tree of (partial) solutions.  The leafs of the tree are either “failures” or

represent complete solutions•SIMPLIFYING (“Inference”)

  Infer new, stronger constraints by combining one or more constraints (without any “guessing”)Example: X+2Y = 3 X+Y =1 therefore Y = 2

•WANDERING (“Markov chain”)  Perform a (biased) random walk through the

space of (partial or total) solutions

Page 8: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 8

Some Methods• Guessing – State Space Search

1. BFS, DFS2. Iterative deepening, limited discrep3. Bidirectional4. Best-first search, beam5. A*, IDA*, SMA*6. Game tree7. Davis-Putnam (logic)

• Simplification – Constraint Propagation1. Forward Checking2. Path Consistency 3. Resolution

• Wandering – Randomized Search1. Hillclimbing2. Simulated annealing3. Walksat4. Monte-Carlo Methods

Constraint Satisfaction+

Page 9: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 9

Admissable Heuristics• f(x) = g(x) + h(x)• g: cost so far• h: underestimate of remaining costs

Where do heuristics come from?

Page 10: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 10

Relaxed Problems• Derive admissible heuristic from exact cost of a solution to a

relaxed version of problem  For transportation planning, relax requirement that car has to stay

on road Euclidean dist  For blocks world, distance = # move operations heuristic = number

of misplaced blocks  What is relaxed problem?

# out of place = 2, true distance to goal = 3• Cost of optimal soln to relaxed problem

cost of optimal soln for real problem

Page 11: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 11

CSP Analysis: Nodes ExploredBT=BM

BJ=BMJ=BMJ2

CBJ=BM-CBJ

FC-CBJ

FC

More

Fewer=BM-CBJ2

Page 12: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 12

Semantics• Syntax: a description of the legal

arrangements of symbols   (Def “sentences”)

• Semantics: what the arrangement of symbols means in the world

Sentences

FactsFacts

Sentences

Representation

World

Semantics

Semantics

Inference

Page 13: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 13

Propositional. Logic vs. First Order

Ontology

Syntax

Semantics

Inference Algorithm

Complexity

Objects, Properties, Relations

Atomic sentencesConnectives

Variables & quantificationSentences have structure: termsfather-of(mother-of(X)))

UnificationForward, Backward chaining Prolog, theorem proving

DPLL, GSATFast in practice

Semi-decidableNP-Complete

Facts (P, Q)

Interpretations (Much more complicated)Truth Tables

Page 14: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 14

Computational Cliff• Description logics• Knowledge representation systems

Page 15: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 15

Machine Learning Overview• Inductive Learning

  Defn, need for bias, …• One method: Decision Tree Induction

  Hill climbing thru space of DTs  Missing attributes  Multivariate attributes

• Overfitting• Ensembles• Naïve Bayes Classifier• Co-learning

Page 16: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 16

Learning as search thru hypothesis space

Yes

Outlook Temp

Humid Wind

On which attribute should we split?When stop growing tree?

Page 17: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 17

Ensembles of Classifiers• Assume

  Errors are independent  Majority vote

• Probability that majority is wrong…

• If individual area is 0.3• Area under curve for 11 wrong is 0.026• Order of magnitude improvement!

Ensemble of 21

classifiers

Prob 0.2

0.1

Number of classifiers in error

  = area under binomial distribution

Page 18: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 18

PS 3 Feedback• Independence in Ensembles

Classifier AClassifier BClassifier C

Example 1

Example 2

Example 3

X X

Ensemble

X X

XX X XX

66% error66% error66% error100% error

Page 19: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 19

Planning• The planning problem

  Simplifying assumptions• Searching world states

  Forward chaining (heuristics)  Regression

• Compilation to  SAT, CSP, ILP, BDD

• Graphplan  Expansion (mutex)  Solution extraction (relation to CSPs)

Page 20: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 20

Simplifying AssumptionsEnvironment

Percepts Actions

What action next?

Static vs.

Dynamic

Fully Observable vs.

Partially Observable

Deterministic vs.

Stochastic

Instantaneous vs.

Durative

Full vs. Partial satisfaction

Perfectvs.

Noisy

Page 21: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 21

How Represent Actions?• Simplifying assumptions

  Atomic time  Agent is omniscient (no sensing necessary).   Agent is sole cause of change  Actions have deterministic effects

• STRIPS representation  World = set of true propositions  Actions:

• Precondition: (conjunction of literals)• Effects (conjunction of literals)

aa

anorth11 north12

W0 W2W1

Page 22: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 22

STRIPS Planning actions

(:operator walk :parameters (?X ?Y) :precondition (at ?X)

:effect (and (at ?y) (not (at ?x))))

(:operator walk :parameters (?X ?Y) :precondition (and (at ?X)

(neq ?Y ?Z)) :effect (and (at ?y)

(not (at ?x))))

Frame problemQualification problemRamification problem

Page 23: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 23

Markov Decision ProcessesS = set of states set (|S| = n)A = set of actions (|A| = m)Pr = transition function Pr(s,a,s’)

represented by set of m n x n stochastic matriceseach defines a distribution over SxS

R(s) = bounded, real-valued reward functionrepresented by an n-vector

Page 24: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 24

Value Iteration (Bellman 1957)Markov property allows dynamic programming

Value iteration Policy iteration

)'(' )',,Pr(max)()( 1 ss VsassRsV kk

a

ssRsV ),()(0

)'(' )',,Pr(maxarg),(* 1 ss Vsasks k

a

Vk is optimal k-stage-to-go value function

Bellman backup

Page 25: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 25

Dimensions of Abstraction

A B C

A B C

A B C

A B C

A B C

A B C

A B C

A B C

A

A B C

A B

A B C

A

B

C=

5.3

5.3

5.3

5.3

2.9

2.9 9.3

9.3

5.3

5.2

5.5

5.3

2.9

2.79.3

9.0

Uniform

Nonuniform

Exact

Approximate

Adaptive

Fixed

Page 26: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 26

Partial observability• Belief states

  POMDP MDP

Page 27: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 27

Reinforcement Learning• Adaptive dynamic programming

  Learns a utility function on states• Temporal-difference learning

  Don’t update value at every state• Exploration functions

  Balance exploration / exploitation• Function approximation

  Compress a large state space into a small one

  Linear function approximation, neural nets, …

Page 28: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 28

PROVERB

Page 29: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 29

PROVERB• Weaknesses

Page 30: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 30

PROVERB• Future Work

Page 31: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 31

Grid Filling and CSPs

Page 32: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 32

CSPs and IRDomain from ranked candidate list?Tortellini topping:TRATORIA, COUSCOUS, SEMOLINA, PARMESAN,RIGATONI, PLATEFUL, FORDLTDS, SCOTTIES,ASPIRINS, MACARONI, FROSTING, RYEBREAD,STREUSEL, LASAGNAS, GRIFTERS, BAKERIES,…MARINARA, REDMEATS, VESUVIUS, …Standard recall/precision tradeoff.

Page 33: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 33

Probabilities to the Rescue?Annotate domain with the bias.

Page 34: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 34

Solution ProbabilityProportional to the product of the

probability of the individual choices.

Can pick sol’n with maximum probability.Maximizes prob. of whole puzzle correct. Won’t maximize number of words correct.

Page 35: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 35

Trivial Pursuit™Race around board, answer questions.Categories: Geography, Entertainment,

History, Literature, Science, Sports

Page 36: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 36

WigwamQA via AQUA (Abney et al. 00)

• back off: word match in order helps score.• “When was Amelia Earhart's last flight?”

• 1937, 1897 (birth), 1997 (reenactment)• Named entities only, 100G of web pages

Move selection via MDP (Littman 00)• Estimate category accuracy.• Minimize expected turns to finish.

Page 37: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 37

Mulder• Question Answering System

  User asks Natural Language question: “Who killed Lincoln?”

  Mulder answers: “John Wilkes Booth”• KB = Web/Search Engines• Domain-independent• Fully automated

Page 38: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 38

Page 39: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 39

User

Architecture

?QueryFormulation

??

?

QuestionParsing

SearchEngine

AnswerExtraction

AnswerSelection

Final Answers

QuestionClassification

Page 40: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 40

Experimental Methodology• Idea: In order to answer n questions,

how much user effort has to be exerted• Implementation:

  A question is answered if • the answer phrases are found in the result

pages returned by the service, or• they are found in the web pages pointed to by

the results.  Bias in favor of Mulder’s opponents

Page 41: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 41

Experimental Methodology• User Effort = Word Distance

  # of words read before answers are encountered

• Google/AskJeeves   query with the original question

Page 42: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 42

Comparison Results

% Questions Answered

70

60

50

40

30

20

10

0

User Effort (1000 Word Distance)

0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

Mulder

Google

AskJeeves

Page 43: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 43

Know It All• Research project started June 2003• Large scale information extraction

  Domain-independent extraction  PMI-IR  Completeness of web

Page 44: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 44

Domain-independent extraction• Cities such as X, Y, Z

• Movies such as X, Y, Z

Proper nouns

Page 45: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 45

TOEFL SynonymsUsed in college applications.fish

(a) scale(b) angle(c) swim(d) dredge

Turney: PMI-IR

Page 46: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 46

Comprehensive CoverageSearch on: boston seattle paris chicago london

Page 47: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 47

Ubiquitous Computing

Page 48: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 48

Mode Prediction

Page 49: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

© Daniel S. Weld 49

Placelab• Location-aware computing

Page 50: © Daniel S. Weld 1 Take-Home Handed out Friday 12/12 2:30pm Due Monday 12/15 2:30pm Points Worth 2 problem sets

Adapting

UIs to

Device (&

User)

Characteristics