big data – what is it?

Post on 23-Feb-2016

59 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

What’s the Big Deal about Big Data? 52 nd Annual ACMSE Conference Jennifer Lewis Priestley, Ph.D. Professor of Statistics and Data Science. Big Data – What is it?. Big Data – What is it?. VOLUME. VELOCITY. VARIETY. Big Data – What is it?. - PowerPoint PPT Presentation

TRANSCRIPT

What’s the Big Deal about Big Data?

52nd Annual ACMSE Conference

Jennifer Lewis Priestley, Ph.D.Professor of Statistics and Data Science

2

Big Data – What is it?

Center for Statistics and Analytical Services at Kennesaw State University

3

Big Data – What is it?

Center for Statistics and Analytical Services at Kennesaw State University

VOLUME

VELOCITY

VARIETY

4

Big Data – What is it?Big Data (noun) – Condition present when the volume, variety, and velocity of data exceeds an organization’s storage or computing capacity for accurate and timely decision making.

It is NOT just about size.

Big Data – What is it?…but size (volume) is certainly part of the issue…

5

Number of emails sent every second?2.9 Million

Video Uploaded to Youtube every minute?20 Hours

Amount of Data processed every day by Google?24 Petabytes

Tweets per Day?50 million

Orders Processed by Amazon every Second?73

6

Big Data – What is it?…and the costs of storage are dropping…

7

The total amount of digital data will reach 2.7 zettabytes by the end of this year. Approximately 80 percent of this data will be unstructured…

Big Data – What is it?

8

Unstructured Data = Data

2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 20200

5

10

15

20

25

30

35

Structured Versus Unstructured Data Generated by Year

STRUCTUREDUNSTRUCTURED

Zett

abyt

es o

f Dat

a

9

Big Data – Does it really matter?

10

Jennifer’s Grandmother

We can’t do things the way we always have…

11

We can’t do things the way we always have…

12

Big Data Company 1: Coca Cola

~ 1500 machines around the world

Can dispense about 95 drinks an hour

Can dispense about 125 different drinks

Submits real time data on:- Syrup consumed/drink configuration- Outlet- Time

13

Big Data Company 2: The Home Depot

~ 2300 stores globally

About 40,000 products in each store

Product pricing has to be dynamic

Thousands of vendors

1414

Big Data Company 3: The Southern Company

~ 4.6 million customers

~27,000 power distribution lines

Real time data, every customer

Advanced Metering Infrastructure

15

What do these companies have in common?

They have all recognized the value of data to their operations.

They have all invested heavily in new hardware and software to capture and store their new data.

They have new hiring needs: Computer Scientists, Statisticians, Mathematicians

16

This shift has huge implications for universities.

17

We can’t teach the way we have always taught.

The 1950s called…they want their curriculum back…

18

So, what does a 21st Century Curriculum look like?

Math, Stat, Computer Science…

Real Big, Real World Datasets…

Better Integration with Practitioners…

More Interdisciplinary Degrees…

19

top related