big data overview

38
BIG DATA Arnon Rotem-Gal-Oz Director of Technology Research, Amdocs The blind men and the elephant. Poem by John Godfrey Saxe (Cartoon originally copyrighted by the authors; G. Renee Guzlas, artists http://www.nature.com/ki/journal/v62/n5/fig_tab/4493262f1.html

Upload: arnon-rotem-gal-oz

Post on 10-Sep-2014

1.613 views

Category:

Software


2 download

DESCRIPTION

An introduction to big data. What's big data, why we'd want it , how is it applicable to CSPs, short intro to Hadoop (some of the info is in the slide notes)

TRANSCRIPT

Page 1: Big data Overview

BIG DATAArnon Rotem-Gal-Oz

Director of Technology Research, AmdocsThe blind men and the elephant. Poem by John Godfrey Saxe (Cartoon originally copyrighted by the authors; G. Renee Guzlas, artists http://www.nature.com/ki/journal/v62/n5/fig_tab/4493262f1.html

Page 2: Big data Overview

1880 US Census

Page 3: Big data Overview
Page 4: Big data Overview

HollerithTabulating Machine

Hollerith photos by Martin Wichary : http://www.flickr.com/photos/mwichary/4358926764/in/photostream/

Page 5: Big data Overview
Page 6: Big data Overview

Source: Silicon Angle http://siliconangle.com/blog/2013/11/13/how-big-is-big-data-really/

Big data happens when the data you have to process is bigger than what you can process in the given time with current

technologies

Page 7: Big data Overview

Myth: Big data = keep all data

Source: Big Data Public Private Forum : http://www.big-project.eu/sites/default/files/D2.2.1_First%20draft%20of%20Technical%20white%20papers_FINAL_v1.01_0.pdf

Page 8: Big data Overview

Source: Big Data Public Private Forum : http://www.big-project.eu/sites/default/files/D2.2.1_First%20draft%20of%20Technical%20white%20papers_FINAL_v1.01_0.pdf

Page 9: Big data Overview

Some Telco Numbers

Source: Wikipediahttp://upload.wikimedia.org/wikipedia/commons/5/50/Telephone_operators,_1952.jpg

Page 10: Big data Overview

So, what do we do with all this data?

Source: Wikipedia http://upload.wikimedia.org/wikipedia/commons/0/06/UPS_Truck.jpg

Page 11: Big data Overview

It’s the insights, stupid*

* With apologies to Bill Clinton

Page 12: Big data Overview

Source: Silicon Angle http://siliconangle.com/blog/2013/11/13/how-big-is-big-data-really/

Big data analytics is when sample = N

• Big data happens when the data you have to process is bigger than what you can process in the given time with current technologies

Page 13: Big data Overview
Page 15: Big data Overview

We need to watch out thatAnalytics won’t get too creepy

Page 16: Big data Overview

When people hear big data they think

fast data

Source: Steve Jones Cap Geminihttp://www.no.capgemini.com/node/778541

Page 17: Big data Overview

Subscribers

Collect& Filter Correlate

(simplified) Network proactive care flow

Account

Event Store

Identify & Predict NetworkFailures

ReimburseVIPs

Prioritize technicians

Identify impact on

high valued Accounts

Page 18: Big data Overview

Source: Silicon Angle http://siliconangle.com/blog/2013/11/13/how-big-is-big-data-really/

Big data is when we can handle data fast enough to make a difference

• Big data happens when the data you have to process is bigger than what you can process in the given time with current technologies

• Big data analytics is when sample = N

Page 19: Big data Overview

Technology space

Page 20: Big data Overview

The Elephant in the room

Page 21: Big data Overview

Hadoop Stack

Map/Reduce

HDFS

HBase

PigHive

ZooKeeper

Oozie MahoutGiraph

Page 22: Big data Overview

Schema on read

Page 23: Big data Overview

Move data to computation

Page 24: Big data Overview

Maybe we should rethink moving data to computation…

Source : http://my-inner-voice.blogspot.co.il/2012/06/haddop-101-paper-by-miha-ahronovitz-and.html

Page 25: Big data Overview
Page 26: Big data Overview

Map/reduce

Source: http://www.bodhtree.com/blog/2012/10/18/ever-wondered-what-happens-between-map-and-reduce/

Page 27: Big data Overview

Customer Segmentation

First name

Last name

ARPU Age Device Country …

Mr. Smith 100 22 iPhone 5s,White USA

John Doe 87 42 Samsung Galaxy S5,Gold France

Lady In Red 105 21 Samsung Note 3, White UK

Uluru, Australia by Stuart Edwards (cc) http://en.wikipedia.org/wiki/Uluru#mediaviewer/File:Uluru_Panorama.jpg

Page 28: Big data Overview

K-Means

ARPU

Age

Source : http://pypr.sourceforge.net/kmeans.html

Page 29: Big data Overview

K=3AR

PU

Age

ARPU

Age

Source : http://pypr.sourceforge.net/kmeans.html

Page 30: Big data Overview

New paradigms

Map/Reduce

HDFS

HBase

PigHive

ZooKeeper

Oozie MahoutGiraph

Page 31: Big data Overview

New Paradigms

Map/Reduce

HDFS

HBase

Pig HiveZoo

Keeper

Oozie Mahout

YARN

Giraph

Page 32: Big data Overview

New Paradigms

Map/Reduce

HDFS

HBase

Pig HiveZoo

Keeper

Oozie Mahout

YARN

Giraph SparkStorm

Slider

Flink

Impala

Tez

Presto

Page 33: Big data Overview

Amdocs Analytics & Data Management Heritage

2013

• Proactive Care• TerraScale• Network optimization

• Real time analytics platform

• Single product catalog

• BSS–OSS Integration

• CRM-Billing Integration

OSSAnalytics Platform,

16 Analytics Patents

• aLDM logical data model

• Policy control

Network AnalyticsCRM

2000 2008

Acqu

isiti

ons

Portf

olio

Page 34: Big data Overview

34Information Security Level 2 – Sensitive© 2014 – Proprietary and Confidential Information of Amdocs

Touchpoints & Applications

CRM Self Service E-MailPCRF SMS OtherWi-Fi OffloadCampaign Mng. • • • • • • •

Operational Envelope & Platform Administration

• Security Management

• Configuration Management

• Services Inventory

• Performance Management

• Fault Management

• LoggerCollect & Ingest

Transform & Enrich

Aggregate & Correlate

Drive Insight

Close the Loop

Machine Learn &

ScoreApplication-Ready Data and Analytics/ML Insights

Entities and Profiles

Detailed Data

OSSProbes Social RAN Inventory Usage &

ChargingCRM

Real-Time & Batch Connectors

Insight Platform

Marketing

AnalyticalApplication Framework:

Dashboards & Visualisation

Decisioning Engine

Dynamic Micro Segmentation

Network Care Operations

Page 35: Big data Overview

Source: Silicon Angle http://siliconangle.com/blog/2013/11/13/how-big-is-big-data-really/

• Big data happens when the data you have to process is bigger than what you can process in the given time with current technologies

• Big data analytics is when sample = N

• Big data is when we can handle data fast enough to make a difference

Page 36: Big data Overview

Additional takeaways

• CSPs have always been in the big data business – they just didn’t know it

• Big data is not a panacea • Hadoop is shaping up as the big data OS– Though there are alternatives arriving from the

cloud arena (mesos, kubernetes)

Page 37: Big data Overview

What we covered here is not even

the tip of the iceberg

Source: wikimedia http://commons.wikimedia.org/wiki/File:Iceberg.jpg

Page 38: Big data Overview

Arnon Rotem-Gal-Oz Director of Technology Research, [email protected] / [email protected]