privacy, ethics, and future uses of the social web

35
1 Privacy, Ethics, and Future Uses of the Social Web Prepared for Owen Graduate School of Management (Vanderbilt University) April 3, 2014 Matthew A. Russell (Chief Technology Officer @ Digital Reasoning) Twitter: @ptwobrussell & @dreasoning

Upload: matthew-russell

Post on 10-May-2015

365 views

Category:

Internet


0 download

DESCRIPTION

A presentation to the Owen Graduate School of Management (Vanderbilt University) about social media and some of the technology behind the future uses of social media that are likely to shape the future of the Web as we know it.

TRANSCRIPT

Page 1: Privacy, Ethics, and Future Uses of the Social Web

1!

Privacy, Ethics, and Future Uses of the Social Web "Prepared for Owen Graduate School of Management (Vanderbilt University)!April 3, 2014!Matthew A. Russell (Chief Technology Officer @ Digital Reasoning)!Twitter: @ptwobrussell & @dreasoning!! !

Page 2: Privacy, Ethics, and Future Uses of the Social Web

Overview!

•  Intro (5 mins)

•  Mining the Social Web (5 mins)

•  "Know thy data..." (10 mins)

•  "...and know thyself" (15 mins)

•  Wrap Up/Final Q&A (15 mins)

2!

Page 3: Privacy, Ethics, and Future Uses of the Social Web

INTRO!

3!

Page 4: Privacy, Ethics, and Future Uses of the Social Web

Hello, My Name Is ... Matthew!

•  Background in Computer Science •  Data mining, AI, machine learning, etc.

•  CTO @ Digital Reasoning Systems •  Moving toward cognitive computing

•  Author •  5 published books on technology (just for fun)

•  CrossFit, triathlon, Bikram hot yoga •  Stress management

4!

Page 5: Privacy, Ethics, and Future Uses of the Social Web

The only easy day was yesterday.

-- Motto of the U.S. Navy SEALs

5!

Page 6: Privacy, Ethics, and Future Uses of the Social Web

It pays to be a winner.

-- Motto of the U.S. Navy SEALs

6!

Page 7: Privacy, Ethics, and Future Uses of the Social Web

Mining the Social Web!

7!

Page 8: Privacy, Ethics, and Future Uses of the Social Web

Data Exhaust => Digital Fingerprints!

•  World population: ~7B people •  Facebook: 1.15B users •  Twitter: 500M users •  Google+ 343M users •  LinkedIn: 238M users •  ~200M+ blogs (conservative estimate)

8!

Page 9: Privacy, Ethics, and Future Uses of the Social Web

•  An open source software (OSS) project •  http://bit.ly/MiningTheSocialWeb2E

•  A (rewritten) book •  http://bit.ly/135dHfs

•  Accessible to (virtually) everyone •  Virtual machine with turn-key coding templates for

data science experiments •  Think of the book as "premium" support for the OSS

project

Transforming Curiosity Into Insight!

9!

Page 10: Privacy, Ethics, and Future Uses of the Social Web

Table of Contents (1/2)!•  Chapter 1 - Mining Twitter: Exploring Trending Topics,

Discovering What People Are Talking About, and More •  Chapter 2 - Mining Facebook: Analyzing Fan Pages,

Examining Friendships, and More •  Chapter 3 - Mining LinkedIn: Faceting Job Titles,

Clustering Colleagues, and More •  Chapter 4 - Mining Google+: Computing Document

Similarity, Extracting Collocations, and More •  Chapter 5 - Mining Web Pages: Using Natural Language

Processing to Understand Human Language, Summarize Blog Posts, and More

•  Chapter 6 - Mining Mailboxes: Analyzing Who's Talking to Whom About What, How Often, and More

10!

Page 11: Privacy, Ethics, and Future Uses of the Social Web

Table of Contents (2/2)!•  Chapter 7 - Mining GitHub: Inspecting Software

Collaboration Habits, Building Interest Graphs, and More •  Chapter 8 - Mining the Semantically Marked-Up Web:

Extracting Microformats, Inferencing over RDF, and More •  Chapter 9 - Twitter Cookbook •  Appendix A - Information About This Machine's Virtual

Machine Experience •  Appendix B - OAuth Primer •  Appendix C - Python and IPython Notebook Tips & Tricks

11!

Page 12: Privacy, Ethics, and Future Uses of the Social Web

Anatomy of Each Chapter!

•  Brief Intro •  Objectives •  API Primer •  Analysis Technique(s) •  Data Visualization •  Recap •  Suggested Exercises •  Recommended Resources

12!

Page 13: Privacy, Ethics, and Future Uses of the Social Web

Why You Should Use IPython Notebook!

•  Because it's great for hacking •  And hacking is usually the first step

•  Because it's great for collaboration •  Sharing/publishing results is trivial

•  Because the UX is as easy as working in a notepad •  Think of it as "executable paper"

•  In short, it's a terrific learning platform for novices and experts alike

13!

Page 14: Privacy, Ethics, and Future Uses of the Social Web

14!

Page 15: Privacy, Ethics, and Future Uses of the Social Web

15!

Page 16: Privacy, Ethics, and Future Uses of the Social Web

"Know thy data..."!

16!

Page 17: Privacy, Ethics, and Future Uses of the Social Web

If we have data, let’s look at data. If we have opinions, let’s go with mine.

--Jim Barksdale

17!

Page 18: Privacy, Ethics, and Future Uses of the Social Web

In God we trust. All others must bring data.

--W. Edwards Deming

18!

Page 19: Privacy, Ethics, and Future Uses of the Social Web

Communication => Data!

•  Communication •  Senders

•  humans & machines

•  Messages •  natural language, images, videos, etc.

•  Recipients •  humans & machines

19!

Page 20: Privacy, Ethics, and Future Uses of the Social Web

Data Alchemy!

•  Data: Documents & document fragments (text

messages, etc.)

•  Information: "Assertions", summaries, tags, etc.

•  Knowledge: Aggregated, query-able information

•  Wisdom: “Compressed” knowledge

•  Gold: Money

20!

Page 21: Privacy, Ethics, and Future Uses of the Social Web

Data Mining = Curiosity + Stats!

•  Curiosity •  Interests, desires, and intuitions

•  Statistics •  Counting •  Comparing •  Filtering •  Ranking

•  Hypothesis testing; knowledge discovery

21!

Page 22: Privacy, Ethics, and Future Uses of the Social Web

Machine Learning!•  A program that learns (improves)

from experience (data) according to some objective •  Supervised learning •  Unsupervised learning •  Reinforcement learning

•  How to do it •  Program mathematical

models and hope for the best...

•  How to do it well •  Program state-of-the-art

mathematical models with sufficient representative data

22!

Page 23: Privacy, Ethics, and Future Uses of the Social Web

Knowledge is a process of piling up facts; wisdom lies in their simplification.

--Martin Fischer

23!

Page 24: Privacy, Ethics, and Future Uses of the Social Web

Any sufficiently advanced technology is indistinguishable from magic.

--Arthur C. Clarke

24!

Page 25: Privacy, Ethics, and Future Uses of the Social Web

"...and know thyself"!

25!

Page 26: Privacy, Ethics, and Future Uses of the Social Web

Is Privacy Already an Illusion?!

•  Digital happenings circa 2014 •  The Cloud •  Social Media •  Deep Learning •  The Internet of Things •  Internet.org

26!

Page 27: Privacy, Ethics, and Future Uses of the Social Web

Civilization is the progress toward a society of privacy.

--Ayn Rand

27!

Page 28: Privacy, Ethics, and Future Uses of the Social Web

If you have something that you don’t want anyone to know,

maybe you shouldn’t be doing it in the first place.

-- Eric Schmidt, (former) CEO of Google

28!

Page 29: Privacy, Ethics, and Future Uses of the Social Web

Influences on Ethics!•  Capitalism, economics, & marketing

•  A for-profit corporation's fiduciary duty: To maximize the common stock's value

•  How to do it? By transacting commerce •  How do it well? By advertising more effectively

than competitors •  How to do it really well? With highly relevant

personalized ads (recommenders) •  Terms of Service (ToS) - The legal extent of

ethical obligations?

29!

Page 30: Privacy, Ethics, and Future Uses of the Social Web

If you're not paying for the product, you are the product.

-- Savvy consumers everywhere one day (?)

30!

Page 31: Privacy, Ethics, and Future Uses of the Social Web

For the wisdom of this world is foolishness...

-- Saint Paul

31!

Page 32: Privacy, Ethics, and Future Uses of the Social Web

The Future of the Web...!•  The Blue Pill: All of your precious data housed remotely

and controlled by a few of the world's most powerful international corporations

•  The Red Pill: A distributed cloud controlled by no one with decentralized data and anonymity online as the status quo

•  The Purple Pill: Meet somewhere in the middle (?) •  Significant legislative reforms concerning consumer

data (?) •  Consumer education with more transparency (?) •  Resurgence of local/offline storage and anonymity

online (?)

32!

Page 33: Privacy, Ethics, and Future Uses of the Social Web

The real danger is the gradual erosion of individual liberties through automation,

integration, and interconnection of many small, separate record-keeping systems,

each of which alone may seem innocuous, even benevolent, and wholly justifiable.

-- Anonymous (U. S. Privacy Study Commission, 1977)

33!

Page 34: Privacy, Ethics, and Future Uses of the Social Web

.  

There are two primary choices in life: to accept conditions as they exist,

or accept the responsibility for changing them.

-- Dennis Waitley

34!

Page 35: Privacy, Ethics, and Future Uses of the Social Web

WRAP-UP / Q&A!

35!