a framework for computing the privacy scores of users in ... · a framework for computing the...

33
A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM Research) September 11, 2009

Upload: others

Post on 21-May-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

A Framework for Computing the Privacy Scores of Users in Online Social Networks

Kun Liu

Yahoo! Labs (previously IBM Research)September 11, 2009

Page 2: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

2 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

The Team

IBM Almaden Research Center

� Tyrone Grandison

� Kun Liu

� Michael (Max) Maximilien

� Evimaria Terzi

IBM Silicon Valley Lab

� Sherry Guo

� Dwayne Richardson

� Tony Sun

Page 3: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM
Page 4: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

4 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Wait a minute, how come the talk is related

Credit Score?

Page 5: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

5 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

From humble beginnings in 1956, Fair Isaac

Corp.'s credit score has come to loom over

consumer finance like no other statistical

measure ever has.

Page 6: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

6 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

So, how about a Privacy Score that indicates the

privacy risk of online social-networking users?

Page 7: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

7 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Do you want to design a privacy score and

other advanced privacy and risk management

mechanisms so that 50 years (or much less)

later, people will appreciate your effort?

Page 8: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

8 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Roadmap

�Motivation and Goal

�Privacy Score and Its Applications

�Privacy Score and Facebook/OpenSocial

�Proof of Concept: The Privacy-Aware Market

Place

�Conclusions

Page 9: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

9 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Motivation

Courtesy to: http://i.ehow.com/images/GlobalPhoto/Articles/4873454/identity-theft-protect-yourself-main_Full.jpg

Millions of users share details of their personal lives with vast networks of friends, and often, strangers

Disclosure of personal info expose the users to identity theft, digital stalking, etc.

Courtesy to: http://www.contrib.andrew.cmu.edu/%7Egct/mygroup.html

Page 10: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

10 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Motivation (Cont.)

How to prevent my ex from seeing my

status updates?

How to hide my friend list

in the search results?

How to prevent the

applications my friends installed

from accessing my information?

All my friends have shared their

hometown and phone number,

maybe I should also do this?

I enjoyed sharing

my daily activities with the

World! But any adverse effects?

My God! What information I have

shared all these years and who can

view these information?

Page 11: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

11 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Goal

� Our goal is to develop a mechanism that is able to

� Measure/monitor the privacy risk of social-networking users

� Boost public awareness of privacy

� Help users to easily manage their information sharing

� Our goal is NOT to prevent people from sharing information

� How to achieve this goal?

� Privacy-score calculation

� Comprehensive information-sharing report

� Privacy-settings recommendation

� and more …

Page 12: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

12 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Life Cycle of Privacy Score

Privacy Risk Monitoring

privacy score of the user

Privacy Settings Recommendation

Privacy Score

Calculation

Utilize Privacy Scores

Comprehensive Privacy Report

Privacy Settings

Page 13: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

13 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Roadmap

�Motivation and Goal

�Privacy Score and Its Applications

�Privacy Score and Facebook/OpenSocial

�Proof of Concept: The Privacy-Aware Market

Place

�Conclusions

Page 14: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

14 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Basic Premises of Privacy Score

� Sensitivity: The more sensitive the information revealed by

a user, the higher his privacy risk.

� Visibility: The wider the information about a user spreads, the higher his privacy risk.

mother’s maiden name is more sensitive than mobile-phone numbermother’s maiden name is more sensitive than mobile-phone number

home address known by everyone poses higher risks than by friends onlyhome address known by everyone poses higher risks than by friends only

Page 15: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

15 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

The Framework

Privacy Score of User j due to Profile Item i

PR( , )= ( , ).i

i j V i jβ ×

sensitivity of profile item i visibility of profile item i

name, or gender, birthday, address,

phone number, degree, job, etc.

Page 16: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

16 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

The Framework

Privacy Score of User j due to Profile Item i

PR( , )= ( , ).i

i j V i jβ ×

sensitivity of profile item i visibility of profile item i

Overall Privacy Score of User j

PR( ) PR( , ) ( , ).i

i i

j i j V i jβ= = ×∑ ∑

name, or gender, birthday, address,

phone number, degree, job, etc.

Page 17: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

17 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

The Naïve Approach

R(n, N)R(n, 1)

R(i, j)

R(1, N)R(1, 2)R(1, 1)

User_1 User_j User_N

Profile Item_1

(birthday)

Profile Item_i

(cell phone #)

Profile Item_n

share, R(i, j) = 1

not share, R(i, j) = 0

Page 18: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

18 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

The Naïve Approach

R(n, N)R(n, 1)

R(i, j)

R(1, N)R(1, 2)R(1, 1)

User_1 User_j User_N

Profile Item_1

(birthday)

Profile Item_i

(cell phone #)

Profile Item_n

share, R(i, j) = 1

not share, R(i, j) = 0

| | ( , )i

j

R R i j=∑

| |i

i

N R

−=Sensitivity:

Page 19: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

19 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

The Naïve Approach

R(n, N)R(n, 1)

R(i, j)

R(1, N)R(1, 2)R(1, 1)

User_1 User_j User_N

Profile Item_1

(birthday)

Profile Item_i

(cell phone #)

Profile Item_n

share, R(i, j) = 1

not share, R(i, j) = 0

| | ( , )i

j

R R i j=∑

| | ( , )j

i

R R i j=∑

| |i

i

N R

−=Sensitivity:

( , ) Pr{ ( , ) 1} 1 Pr{ ( , ) 0} 0V i j R i j R i j= = × + = ×Visibility:

Page 20: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

20 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

The Naïve Approach

R(n, N)R(n, 1)

R(i, j)

R(1, N)R(1, 2)R(1, 1)

User_1 User_j User_N

Profile Item_1

(birthday)

Profile Item_i

(cell phone #)

Profile Item_n

share, R(i, j) = 1

not share, R(i, j) = 0

| | ( , )i

j

R R i j=∑

| | ( , )j

i

R R i j=∑

| |i

i

N R

−=Sensitivity:

( , ) Pr{ ( , ) 1} 1 Pr{ ( , ) 0} 0V i j R i j R i j= = × + = ×Visibility:

| | | |Pr{ ( , ) 1}

j

iR R

R i jN n

= = ×

Page 21: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

21 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Item Response Theory (IRT)

� IRT (Lawley,1943 and Lord,1952) has its origin in psychometrics.

� It is used to analyze data from questionnaires and tests.

� It is the foundation of Computerized Adaptive Test like GRE, GMAT

Ability

Item Characteristic Curve (ICC)

Pro

bab

ilit

y o

f C

orr

ect

Resp

on

se

Page 22: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

22 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Item Characteristic Curve (ICC)

( )

1Pr{ ( , ) 1}

1 i j i

R i je

α θ β− −= =

+

Student j’s ability

Question i’s difficulty

Question i’s discrimination

Pro

bab

ilit

y o

f C

orr

ect

Resp

on

se

Question 1

Question 2

Page 23: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

23 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

The Item Response Theory (IRT) Approach

R(n, N)R(n, 1)

R(i, j)

R(1, N)R(1, 2)R(1, 1)

User_1 User_j User_N

Profile Item_1

(birthday)

Profile Item_i

(cell phone #)

Profile Item_n

share, R(i, j) = 1

not share, R(i, j) = 0

( )

1Pr{ ( , ) 1}

1 i j iij

P R i je

α θ β− −= = =

+

Profile item’s discrimination

User’s attitude, e.g., conservative or extrovert

Profile item’s sensitivity

Profile item i’s true visibility

Page 24: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

24 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Calculating Privacy Score using IRT

iβSensitivity: Visibility:

Overall Privacy Score of User j

PR( ) ( , ).i

i

j V i jβ= ×∑

( , ) Pr{ ( , ) 1}V i j R i j= =

( )

1Pr{ ( , ) 1}

1 i j iij

P R i je

α θ β− −= = =

+

byproducts: profile item’s discrimination and user’s attitude

All parameters can be estimated using Maximum Likelihood Estimation and Expectation-Maximization.

Page 25: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

25 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Advantages of the IRT Model

� The mathematical model fits the observed data well

� The quantities IRT computes (i.e., sensitivity, attitude and

visibility) have intuitive interpretations

� Computation is parallelizable using e.g. MapReduce

� Privacy scores calculated within different social networks are comparable

Page 26: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

26 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Interesting Results from User Study

Sensitivity of The Profile Items Computed by IRT Model

•49 profile items

•153 users from 18 countries/regions

•53.3% are male and 46.7% are female

•75.4% are in the age of 23 to 39

•91.6% hold a college degree or higher

•76.0% spend 4+ hours online per day

Statistics

We collected the information-sharing

preferences of 153 users on 49 profile items such as name, gender, birthday,

political views, address, phone number,

degree, job, etc.

Survey

Average Privacy Scores Grouped by Geographical Regions

Page 27: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

27 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Utilize Privacy Scores

� Privacy Risk Monitoring

� Privacy (Information Sharing) Report

� Privacy Settings Recommendation

Score: 100 ~ 150 Score: 100 ~ 150

Score: 100 ~ 150

Score: 100 ~ 150

Score: 100 ~ 150

Page 28: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

28 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Roadmap

� Motivation and Goal

� Privacy Score and Its Applications

� Privacy Score and Facebook/OpenSocial

� Proof of Concept: The Privacy-Aware Market Place

� Conclusions

Page 29: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

29 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Privacy Score and Facebook/OpenSocial

Enable application developers to

implement their own Privacy Scores.Enable application developers to

implement their own Privacy Scores.

Provide native implementation of

Privacy Score computation.Provide native implementation of

Privacy Score computation.

Enable application developers to build

Information Sharing Report modules and

Privacy Settings Recommendation modules.

Enable application developers to build

Information Sharing Report modules and

Privacy Settings Recommendation modules.

Page 30: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

30 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Roadmap

� Motivation and Goal

� Privacy Score and Its Applications

� Privacy Score and Facebook/OpenSocial

� Proof of Concept: The Privacy-Aware Market Place

� Conclusions

Page 31: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

31 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Privacy-aware Marketplace (PaMP)

http://apps.facebook.com/p_a_m_p

Page 32: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

32 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Conclusions

� In this talk, we have discussed

� the importance of privacy score and privacy management

� two ways to compute privacy score using privacy settings

� Our goal is to develop a mechanism and a platform that will

� Measure/monitor the privacy risk of social-networking users

� Boost public awareness of privacy

� Help users to easily manage their information sharing

� We believe that

� simple and effective privacy management makes one feels safe

and comfortable about sharing information online, which will eventually facilitate the information sharing and integration.

Page 33: A Framework for Computing the Privacy Scores of Users in ... · A Framework for Computing the Privacy Scores of Users in Online Social Networks Kun Liu Yahoo! Labs (previously IBM

33 References: http://www.almaden.ibm.com/cs/projects/iis/ppn/

Thank you and Questions?