collaborative manipulation in natural language

70
Spatial References and Perspective in Natural Language Instructions for Collaborative Manipulation Rosario Scalise, Shen Li Henny Admoni, Stephanie Rosenthal, Siddhartha S. Srinivasa 1

Upload: others

Post on 24-Mar-2022

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Collaborative Manipulation in Natural Language

Spatial References and Perspective in Natural Language Instructions for

Collaborative Manipulation

Rosario Scalise, Shen LiHenny Admoni, Stephanie Rosenthal, Siddhartha S. Srinivasa

1

Page 2: Collaborative Manipulation in Natural Language

2

● Background, why tabletop is important● Problem: object uniqueness

○ Solution 1: spatial reference○ Solution 2: perspective

● Study 1○ Image generation○ Study design○ Result

■ Human vs robot■ Visual search + word frequencies■ Difficulty

● Study 2○ Data coding○ Study design○ Result

■ Block ambiguity■ Perspective

● Discussion○ 3 approaches to give instructions○ Block ambiguity and perspective ambiguity○ Neither perspective is the best○ Future work - interactivity

Page 3: Collaborative Manipulation in Natural Language

3

Herb image courtesy of Pittsburgh Post-Gazette

Page 4: Collaborative Manipulation in Natural Language

4

Page 5: Collaborative Manipulation in Natural Language

5

“I am going to pick up the cup on the right!”

Page 6: Collaborative Manipulation in Natural Language

6

Key Issue: Ambiguity

Question by Jessica Lock from the Noun Project

Page 7: Collaborative Manipulation in Natural Language

7

Key Issue: Ambiguity

As scene complexity increases, so does the difficulty in specifying an object.

Page 8: Collaborative Manipulation in Natural Language

8

Key Issue: Ambiguity

As scene complexity increases, so does the difficulty in specifying an object.

Natural language is inherently ambiguous.

Page 9: Collaborative Manipulation in Natural Language

Forms of Ambiguity

9

Visual Appearance

“Pick up the coffee cup.”

Page 10: Collaborative Manipulation in Natural Language

Forms of Ambiguity

10

Visual Appearance

“Pick up the coffee cup.”

Which one?

Page 11: Collaborative Manipulation in Natural Language

Forms of Ambiguity

11

Perspective

“Pick up the coffee cup on the right.”

Page 12: Collaborative Manipulation in Natural Language

Forms of Ambiguity

12

Perspective

“Pick up the coffee cup on the right.”

Whose right?

Page 13: Collaborative Manipulation in Natural Language

Forms of Ambiguity

13

Proximity

“Pick up the coffee cup next to the donuts.”

Page 14: Collaborative Manipulation in Natural Language

Forms of Ambiguity

14

Proximity

“Pick up the coffee cup next to the donuts.”

How close is ‘next to’?

Page 15: Collaborative Manipulation in Natural Language

15

Page 16: Collaborative Manipulation in Natural Language

16

Can youuniquely

describethis block?

Page 17: Collaborative Manipulation in Natural Language

How can we best overcome ambiguity when grounding our references while keeping communication natural?

17

Page 18: Collaborative Manipulation in Natural Language

Approach

Learn by observing what humans do and extract best-practices from the examples that are most successful.

18

Page 19: Collaborative Manipulation in Natural Language

19bender by Jordan Díaz Andrés from the Noun Project

Page 20: Collaborative Manipulation in Natural Language

20bender by Jordan Díaz Andrés from the Noun Project

Collect Corpus

Page 21: Collaborative Manipulation in Natural Language

21bender by Jordan Díaz Andrés from the Noun Project

Collect CorpusGain Insights

Page 22: Collaborative Manipulation in Natural Language

22bender by Jordan Díaz Andrés from the Noun Project

Collect CorpusGain Insights

Evaluate Corpus

Page 23: Collaborative Manipulation in Natural Language

23bender by Jordan Díaz Andrés from the Noun Project

Collect CorpusGain Insights

Evaluate CorpusExtract Guidelines

Page 24: Collaborative Manipulation in Natural Language

24bender by Jordan Díaz Andrés from the Noun Project

Collect CorpusGain Insights

Evaluate CorpusExtract Guidelines

+ Analysis Tools

Page 25: Collaborative Manipulation in Natural Language

25

Page 26: Collaborative Manipulation in Natural Language

26

Page 27: Collaborative Manipulation in Natural Language

Study 1 : Collecting Instructions for Corpus

27

Page 28: Collaborative Manipulation in Natural Language

Study 1 : Collecting Instructions for Corpus

28

person

person

Page 29: Collaborative Manipulation in Natural Language

Study 1 : Collecting Instructions for Corpus

29

robot

Page 30: Collaborative Manipulation in Natural Language

Study 1 : Collecting Instructions for Corpus

30

robot

1400 Total

Page 31: Collaborative Manipulation in Natural Language

Evaluating

31

How do we tell how good any specific instruction is?

“Pick up the blue block”

Page 32: Collaborative Manipulation in Natural Language

Evaluating

32

Given an instruction and the stimulus it corresponds to, can people infer the correct block?

“Pick up the blue block”

Page 33: Collaborative Manipulation in Natural Language

Evaluating

33

Given an instruction and the stimulus it corresponds to, can people infer the correct block?

“Pick up the blue block”

Page 34: Collaborative Manipulation in Natural Language

Study 2 : Corpus Evaluation

34

Page 35: Collaborative Manipulation in Natural Language

Metrics

35

For each instruction, we calculate:

Page 36: Collaborative Manipulation in Natural Language

Metrics

36

Accuracy: # of successful block selections

For each instruction, we calculate:

total # of times instruction is shown

Page 37: Collaborative Manipulation in Natural Language

Metrics

37

Accuracy: # of successful block selections

For each instruction, we calculate:

total # of times instruction is shown

Avg. Completion time: How long it takes to select the indicated block on average

Page 38: Collaborative Manipulation in Natural Language

38

Full investigation and results TBR in:

“Spatial References and Perspective in Natural Language Instructions for Collaborative Manipulation”

at IEEE Ro-Man 2016 (Late August)

Page 39: Collaborative Manipulation in Natural Language

Perspectives

39

Page 40: Collaborative Manipulation in Natural Language

40

Partner

Participant (Speaker)

Types of Perspective:

Page 41: Collaborative Manipulation in Natural Language

41

Partner

Participant (Speaker)

Partner:“Pick up the blue block on your left”

Types of Perspective:

Page 42: Collaborative Manipulation in Natural Language

42

Partner

Participant (Speaker)

Participant:“Pick up the blue block on my right”

Partner:“Pick up the blue block on your left”

Types of Perspective:

Page 43: Collaborative Manipulation in Natural Language

43

Partner

Participant (Speaker)

Participant:“Pick up the blue block on my right”

Partner:“Pick up the blue block on your left”

Neither:“Pick up the blue block closest to the orange block.”

Types of Perspective:

Page 44: Collaborative Manipulation in Natural Language

44

Partner

Participant (Speaker)

Participant:“Pick up the blue block on my right”

Partner:“Pick up the blue block on your left”

Neither:“Pick up the blue block closest to the orange block.”

Unknown:“Pick up the blue block to the right of the orange block.”

Types of Perspective:

Page 45: Collaborative Manipulation in Natural Language

Perspective vs

Accuracy and Completion Time

45

Page 46: Collaborative Manipulation in Natural Language

46

Pick up the box furthest to your left.

Partner perspective

Partner

Participant

Page 47: Collaborative Manipulation in Natural Language

47

Pick up the box furthest to your left.

Partner

Participant

Page 48: Collaborative Manipulation in Natural Language

48

Pick up the box furthest to your left.

Partner

Participant

Page 49: Collaborative Manipulation in Natural Language

49

Pick up the orange block closest to my right hand side.

Participant perspective

Partner

Participant

Page 50: Collaborative Manipulation in Natural Language

50

Pick up the orange block closest to my right hand side.

Partner

Participant

Page 51: Collaborative Manipulation in Natural Language

51

Pick up the orange block closest to my right hand side.

Partner

Participant

Page 52: Collaborative Manipulation in Natural Language

52

Please pick up the orange block that is closest to me.

Partner

Participant

Neither perspective

Page 53: Collaborative Manipulation in Natural Language

53

Please pick up the orange block that is closest to me.

Partner

Participant

Page 54: Collaborative Manipulation in Natural Language

54

Pick up the rightmost orange block

Partner

Participant

Right to ???

Page 55: Collaborative Manipulation in Natural Language

55

Pick up the rightmost orange block

Partner

Participant

Unknown perspective

Page 56: Collaborative Manipulation in Natural Language

Hypothesis:

Neither Perspective is better

56

Page 57: Collaborative Manipulation in Natural Language

57

Page 58: Collaborative Manipulation in Natural Language

58

Page 59: Collaborative Manipulation in Natural Language

Result:

Prefer Neither Perspective

59

Page 60: Collaborative Manipulation in Natural Language

Other Factors

60

Page 61: Collaborative Manipulation in Natural Language

61

Pick the blue block that is closer to you and right next to the yellow block

Partner

Participant

Neither perspective

Page 62: Collaborative Manipulation in Natural Language

62

Pick the blue block that is closer to you and right next to the yellow block

Partner

Participant

Neither perspective

Page 63: Collaborative Manipulation in Natural Language

63

Pick the blue block that is closer to you and right next to the yellow block

Partner

Participant

Neither perspective

Page 64: Collaborative Manipulation in Natural Language

64

Pick up the blue block on your far right.

Partner

Participant

Partner perspective

Page 65: Collaborative Manipulation in Natural Language

65

Pick up the blue block on your far right.

Partner

Participant

Partner perspective

Page 66: Collaborative Manipulation in Natural Language

Tradeoff

66

Page 67: Collaborative Manipulation in Natural Language

Robot Partner vs Human Partner

67

Page 68: Collaborative Manipulation in Natural Language

68

Robot Partner

Human Partner

Pick up the third blue block from your left

Page 69: Collaborative Manipulation in Natural Language

Spatial References and Perspective in Natural Language Instructions for

Collaborative Manipulation

Rosario Scalise, Shen [email protected], [email protected]

69

Page 70: Collaborative Manipulation in Natural Language

70

Thank You!

Learn More @ Poster Session

Investigated

Visual features

Perspectives

Dataset will be made available soon!

Perspectives