01 bigdata overview

17
© 2013 IBM Corporation August 18, 2014 BigData Overview

Upload: rofaelemil

Post on 08-Dec-2015

6 views

Category:

Documents


1 download

DESCRIPTION

Bigdata Overview

TRANSCRIPT

Page 1: 01 Bigdata Overview

© 2013 IBM CorporationAugust 18, 2014

BigData Overview

Page 2: 01 Bigdata Overview

Agenda

What is BigData

Use Cases

The IBM Big Data Platform

Page 3: 01 Bigdata Overview

1 in 2business leaders don’t have access to data they need

83%of CIO’s cited BI and analytics as part of their visionary plan

5.4Xmore likely that top performers use business analytics

80% of the world’s data today is unstructured

90% of the world’s data was created in the

last two years

20%of available data can

be processed by traditional systems

Source: GigaOM, Software Group, IBM Institute for Business Value"

Intrinsic Property of Data … it grows

Page 4: 01 Bigdata Overview

A growing Interconnected and Instrumented World

2+ billion

people on the

Web by end 2011

30 billion RFID tags today

(1.3B in 2005)

4.6 billion camera phones

world wide

100s of millions of GPS

enabled devices

sold annually

76 million smart meters in 2009… 200M by 2014

500+ Million users posting 55 Million

tweets every day

1+ Billion active users spending 700 Million minutes per

month

1.2

Trill

ion

sear

ches

Page 5: 01 Bigdata Overview

Characteristics of Big Data

V4 = Volume Velocity Variety Veracity

Collectively analyzing the broadening Variety

Responding to the increasing Velocity

Cost efficiently processing the growing Volume

Establishing the Veracity of big data sources

1 in 3 business leaders don’t trust the information they use to make decisions

50x 35 ZB

20202010

30 Billion RFID sensors and counting

80% of the worlds data is unstructured

Page 6: 01 Bigdata Overview

Commoditization of Hardware Enabling New Analytics

Low cost compute platform– 1 petabyte Hadoop cluster for approx $1 million– Hadoop architecture

• Optimized for high data volumes Clusters of affordable machines running a Distributed File System (HDFS) and MapReduce processing

– Hardware failure is expected and managed

Hardware Appliance– Up and Running with new cluster in hours

Cloud– Up and Running with new cluster in minutes– Pay what you use

© 2013 IBM Corporation6 Source: Forbes: “The Big Cost of Big Data”

Page 7: 01 Bigdata Overview

Big Data ExplorationFind, visualize, understand

all big data to improve decision making

Enhanced 360o Viewof the Customer

Extend existing customer views by incorporating additional internal and external data sources

Data Warehouse AugmentationIntegrate big data and data warehouse

capabilities to increase operational efficiency

Security/Intelligence Extension

Lower risk, detect fraud and monitor cyber security

in real-time

The 5 Key Big Data Use Cases

© 2013 IBM Corporation7

Operations AnalysisAnalyze a variety of machine

data for improved business results

Page 8: 01 Bigdata Overview

More Ways - Wide Ranging Analytics & Techniques

© 2013 IBM Corporation8

Machine Learning

Video Analysis

Audio Analysis

Statistics Text Analysis

Image Analysis

Spatial Analysis

Temporal Analysis

Page 9: 01 Bigdata Overview

Medical information is doubling every 5 years, much of which is unstructured

81% of physicians report spending 5 hours or less per month reading medical journals

Source: International Journal of Circumpolar Health, DoctorDirectory.com, Institute for Medicine"

“Medicine has become too complex (and only) about 20 percent of the knowledge clinicians use today is evidence-based”

- Steven Shapiro, Chief Medical and Scientific Officer, UPMC…to keep up with the state of the art, a doctor would have to devote 160 hours a week to perusing papers…”

The Economist Feb 14th 2013

Big Data and Complexity in Health Care

Page 10: 01 Bigdata Overview

Cloud | Mobile | Security

Gather, extract and explore data

using best of breed visualization

Speed time to value with analytic

and application accelerators

IBM Big Data Platform

Systems Management

Applications & Development

Visualization & Discovery

Analyze streaming data and large data bursts for real-time

insights

Govern data quality and

manage information

lifecycle

Cost-effectively analyze

Petabytes of structured and unstructured information

Deliver deep insight with advanced

in-database analytics and operational

analytics

Accelerators

Information Integration & Governance

HadoopSystem

Stream Computing

Data Warehouse

Contextual Discovery

Index and federated discovery

for contextual collaborative

insights

Solutions

Analytics and Decision Management

Big Data Infrastructure

Big Data Platform and Application Frameworks

Page 11: 01 Bigdata Overview

ETL, MDM, Data Governance

Metadata and Governance Zone

11

Warehousing Zone

Enterprise Warehouse

Data Marts

An example of the big data platform in practiceIngestion and Real-time Analytic Zone

Streams

Connector s

BI & Reporting

PredictiveAnalytics

Analytics and Reporting Zone

Visualization & Discovery

Landing and Analytics Sandbox Zone

Hive/HBaseCol Stores

Documentsin variety of formats

MapReduce

Hadoop

Page 12: 01 Bigdata Overview

Integrate and Govern all Data Sources

Integration, Data Quality, Security, ILM, MDM

A Big Data Platform Manifesto

12

Manage Streaming Data Stream Computing

Understand and Navigate Federated Big Data Sources

Federated Discovery and Navigation

Data WarehousingStructure and Control Data

Manage and Store Huge Volume of any Data

Hadoop File SystemMapReduce

Analyze Unstructured Data Text Analytics Engine

Page 13: 01 Bigdata Overview

© 2013 IBM Corporation13

Use Cases for a Big Data Platform

Financial services– Problem:

• Manage the several Petabytes of data which is growing at 40-100% per year under increasing pressure to prevent frauds and complain to regulations.

– How big data analytics can help:• Fraud detection• Risk management• 360°View of the Customer

Page 14: 01 Bigdata Overview

© 2013 IBM Corporation14

Use Cases for a Big Data Platform

Telecommunication services– Problem:

• Legacy systems are used to gain insights from internally generated data facing issues of high storage costs, long data loading time, and long administration process.

– How big data analytics can help:• CDR processing• Churn prediction• Geomapping / marketing• Network monitoring

Page 15: 01 Bigdata Overview

© 2013 IBM Corporation15

Use Cases for a Big Data Platform

Transportation services– Problem:

• Traffic congestion has been increasing worldwide as a result of increased urbanization and population growth reducing the efficiency of transportation infrastructure and increasing travel time and fuel consumption.

– How big data analytics can help:• Real time analysis to weather and traffic congestion data streams to identify traffic

patterns reducing transportation costs.

Page 16: 01 Bigdata Overview

© 2013 IBM Corporation16

Use Cases for a Big Data Platform

Healthcare and Life Sciences– Problem:

• Vast quantities of real-time information are starting to come from wireless monitoring devices that postoperative patients and those with chronic diseases are wearing at home and in their daily lives.

– How big data analytics can help:• Epidemic early warning• Intensive Care Unit and remote monitoring

Page 17: 01 Bigdata Overview

© 2013 IBM Corporation

Questions?

17