a chatbot for privacy policies - swisstext...7 rely on secondary devices with screen? what can we do...

50
PriBot A Chatbot for Privacy Policies Hamza Harkous , Kassem Fawaz, Rémi Lebret, Florian Schaub, Kang G. Shin, Karl Aberer

Upload: others

Post on 21-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

PriBot

A Chatbot for Privacy Policies

Hamza Harkous, Kassem Fawaz, Rémi Lebret, Florian Schaub, Kang G. Shin, Karl Aberer

Page 2: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

2

Problem?

Page 3: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

3

Page 4: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

4

Solution?Let’s turn them to a QA conversation?

Page 5: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

UI-Limited Interfaces: Voice-activated Devices

5

Page 6: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

6

Read the whole policy?

What can we do with the current machinery?

Page 7: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

7

What can we do with the current machinery?

Page 8: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

7

Rely on secondary devices with screen?What can we do with the current machinery?

Page 9: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

UI-Limited Interfaces: Voice-activated Devices

8

Page 10: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

9

Usability Privacy

Page 11: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Customer Support

10

Page 12: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Customer Support

10

Automated

Page 13: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Policy

Question

Automated QA Approach

Page 14: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Policy

Question

6A1

A2…

Segmenter

Automated QA Approach

Page 15: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Policy

QuestionPrivacy QA

Ranking Algorithm

A1A2

Q

6A1

A2…

Segmenter

Automated QA Approach

Page 16: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Policy

QuestionPrivacy QA

Ranking Algorithm

A1A2

Q

A4

A11

A28

QA Interface:

Chatbot, Voice Assistant,

Twitter Bot

6A1

A2…

Segmenter

Automated QA Approach

Page 17: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Policy

Question A4

A11

A28

QA Interface:

Chatbot, Voice Assistant,

Twitter Bot

6A1

A2…

Segmenter Privacy QA Ranking

Algorithm

A1A2

Q

Page 18: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Policy

Question A4

A11

A28

QA Interface:

Chatbot, Voice Assistant,

Twitter Bot

6A1

A2…

Segmenter Privacy QA Ranking

Algorithm

A1A2

Q

Page 19: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

15

To whom do you expose my content?

Ranking Challenges

Page 20: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

15

To whom do you expose my content?

Ranking Challenges

1. User wording is different from policies wording.

Page 21: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

15

To whom do you expose my content?

Ranking Challenges

1. User wording is different from policies wording.

2. Difficulty of accounting for the general topic: • Is "content" about the third parties or the first party?

Page 22: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Advantage of Word Embeddings

16

Page 23: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Advantage of Word Embeddings

16

Using a general embeddings, such as GloVe embeddings (Wikipedia14 + Gigaword 5), allows matching words in the

policies to words used by users.

Page 24: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Neural Networks feed on labelled data..

17

Page 25: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Neural Networks feed on labelled data..

17

How to get such data?

Page 26: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Neural Networks feed on labelled data..

17

How to get such data?

We don't have QA data.

Page 27: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Neural Networks feed on labelled data..

17

How to get such data?

We don't have QA data.

Can we survive with classification data?

Page 28: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

19

You can modify information you have given us. To correct or delete information or update account settings, log into your account and follow the instructions. We make changes as soon as we can. This information may stay in our backup files. If we cannot make the changes you want, we will let you know and explain why. If you contact us requesting access to your information, we will respond within 30 days.

You can control cookies and tracking tools. To learn how to manage how we - and our vendors - use cookies and other tracking tools, please click here.

*Wilson et al., ACL 2016; usableprivacy.org/data

User Access, Edit & Deletion

Access Type: Edit Information

Expert Annotations

Online Privacy Policies Dataset (OPP)*

Page 29: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Online Privacy Policies Dataset• 115 annotated policies • 23K annotations

1st Party Collection

Collection Mode

Information Type

Purpose

3rd Party Collection

Action

Information Type

Purpose

Choice, Control

Choice Type

Choice Scope

Access, Edit, Delete

Access Rights

Data Retention

Retention Period

Retention Purpose

Information Type

Data Security

Security Measure

Specific Audiences

Audience group

Do Not Track

Do Not Track Policy

Policy Change

Notification Type

Other

Introductory

Contact Information

Practice not covered

20

Page 30: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

22

To whom do you expose my content?

1. User wording is different from policies wording.

2. Difficulty of accounting for the general topic: • Is "content" about the third parties or the first party?

Ranking Challenges

Page 31: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

22

To whom do you expose my content?

1. User wording is different from policies wording.

2. Difficulty of accounting for the general topic: • Is "content" about the third parties or the first party?

Ranking Challenges

✓✓

Page 32: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Twitter Evaluation Dataset

• Search for unbiased keywords in replies:

• e.g.,: "check our privacy policy"

23

• Backtrack company replies to questions

Page 33: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Evaluation

24

• Predictive Accuracy

• User-perceived Utility

Page 34: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Predictive Accuracy

25

Page 35: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Predictive Accuracy

25

A1A26

A1

A2…

Segmenter

Page 36: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Predictive Accuracy

25

A1A26

A1

A2…

Segmenter

Privacy QA Ranking

Algorithm

A4

A11

A28

top-3

Page 37: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Predictive Accuracy

25

Two Experts

A5, A11

A1A26

A1

A2…

Segmenter

Privacy QA Ranking

Algorithm

A4

A11

A28

top-3

Page 38: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Predictive Accuracy

25

Two Experts

A5, A11

A1A26

A1

A2…

Segmenter

How many questions

have an expert answer

in top-k?

Privacy QA Ranking

Algorithm

A4

A11

A28

top-3

Page 39: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

User-Perceived Utility•Methodology• Between subject study with 4 groups

27

Page 40: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

User-Perceived Utility•Methodology• Between subject study with 4 groups• 1186 participants from MTurk (15 QA pairs per user)

27

Page 41: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

UX: A Key to Chatbots' Success

29

Page 42: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

UX: A Key to Chatbots' Success

• User experience is key: • animations, time to answer, readability, failsafe,

29

Page 43: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

UX: A Key to Chatbots' Success

• User experience is key: • animations, time to answer, readability, failsafe,

• Balance between accuracy and usability

29

Page 44: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

UX: A Key to Chatbots' Success

• User experience is key: • animations, time to answer, readability, failsafe,

• Balance between accuracy and usability

29

Page 45: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

UX: A Key to Chatbots' Success

• User experience is key: • animations, time to answer, readability, failsafe,

• Balance between accuracy and usability

• Not everything has to be DL-based: • DL for the core functionality • External framework for managing interactions

29

Page 46: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Take-aways

30

Page 47: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Take-aways

• Limited-UI devices and hands-free interactions • Traditional privacy notice delivery methods do not apply

30

Page 48: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Take-aways

• Limited-UI devices and hands-free interactions • Traditional privacy notice delivery methods do not apply

• Solution: PriBot • Answers, automatically, user free-form question from policies • Provides answers that have high accuracy and relevance in real-time

30

Page 49: A Chatbot for Privacy Policies - SwissText...7 Rely on secondary devices with screen? What can we do with the current machinery?

Take-aways

• Limited-UI devices and hands-free interactions • Traditional privacy notice delivery methods do not apply

• Solution: PriBot • Answers, automatically, user free-form question from policies • Provides answers that have high accuracy and relevance in real-time

• Applications: • Compare privacy practices of different companies • Use for privacy-related customer service

30