big data on public cloud

Post on 14-Jul-2015

4.449 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Big Data on Public Cloud

Assoc. Prof. Dr. Thanachart NumnondaExecutive DirectorIMC Institute13 March 2015

2

“B ัy 2015, 20% of Global 1000 organizationsWill have established a strategic focus on

information infrastructure ”

Gartner

3

Big Data Landscape

Source: Big Data in the Enterprise. When to Use What?

4

Big Data Landscape

Source : http://www.vitria.com/

5

6

NoSQL

7

A scalable fault-tolerant distributed system for data storage and processing

Completely written in javaOpen source & distributed under Apache license

What is Hadoop?

8

Hadoop Environment

Source: Hadoop in Practice; Alex Holmes

9

Major Hadoop Components

Hadoop Distributed File System(HDFS)

Map/Reduce System

10

Hadoop Distribution

Microsoft Azure

11

Big Data Future Architecture

Sscial Media Images e-mails Crawlers ERP CRM LOB APPs

Unstructured and Structured Data

Parallel Data Warehouse

Hadoop OnCloud

Hadoop OnPrivateServer

Connectors

SSRS

BI Platform

Familiar End User ToolsSpreadsheet Predictive Analytics

Data Market Place

NoSQL

Petabytes of Data(Unstructured)

Hundreds of TB of Data(structured)

12

Issue with Big Data Infrastructure

Large investment

Scalabilty

ROI

Business Cases

13

14Source : http://acloudyplace.com/

15

Big Data on Cloud

Using IaaS to leverage Cloud Vms

Using Big Data as a Services

16

Big Data Services on Cloud

Amazon Elastic Mapreduce

Microsoft Azure Hadoop

17

Big Data as a Service

18

19

Database as a Service

Amazon RDS

IBM SQL Database for Bluemix

Microsoft SQL Database

Google CloudSQL

20

NoSQL as a Service

Amazon DynomoDB

Google Cloud DataStore

Microsoft Azure DocumentDB

Cloudant on IBM Bluemix.

Mongo DB on Heroku

21

Hadoop as a Service

Amazon Elastic Map Reduce

Rackspace Cloud Big Data Platform

Qubole

Google Cloud Platform

IBM Bluemix: Analytic on Hadoop

Microsoft Azure HDInsight

22

23

24

Big Data on Amazon EMR

25

26

27

28

Big Data on Cloud Roadmap

Step 1: Build the business case

Step 2: Assess your Big Data applicationworkloads

Step 3: Develop a technical approach fordeploying and managing Big Data in the cloud

Step 4: Address governance, security, privacy,risk,

Step 5: Deploy, integrate, and operationalizeyour cloud-based Big Data infrastructure

Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS

29

Access your application workloads

Big-data storage

Big-data processing

Big-data development

Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS

30

Sample applications

Enterprise applications already hosted in thecloud

High-volume external data sources thatrequire considerable preprocessing

Tactical applications beyond your on-premises, Big Data capabilities

Elastic provisioning of very large but short-lived analytic sandboxes

Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS

31

Demo

32

Amazon DynomoDB

33

Google BigQuery

34

Hadoop on Google

35

Amazon EMR

36

www.facebook.com/imcinstitute

37

Thank you

thanachart@imcinstitute.comwww.facebook.com/imcinstitutewww.slideshare.net/imcinstitutewww.thanachart.org

top related