big data paris : hadoop and nosql

21

Upload: tugdual-grall

Post on 01-Nov-2014

21 views

Category:

Technology


6 download

Tags:

DESCRIPTION

Quick overview of the benefits and some use case of BigD

TRANSCRIPT

Page 1: Big Data Paris : Hadoop and NoSQL
Page 2: Big Data Paris : Hadoop and NoSQL

BigData - NoSQLHadoop - Couchbase

Tugdual “Tug” Grall

Technical Evangelist

Page 3: Big Data Paris : Hadoop and NoSQL

About Me

• Tugdual “Tug” Grall Couchbase

• Technical Evangelist

eXo

• CTO

Oracle

• Developer/Product Manager

• Mainly Java/SOA

Developer in consulting firms

• Web

• @tgrall

• http://blog.grallandco.com

• tgrallNantesJUG co-founderPet Project :http://www.resultri.com

Page 4: Big Data Paris : Hadoop and NoSQL

<50%?

2027

95%Relational

Technology

$30B Database Market Being Disrupted

2012

All new database growth will be NoSQLAll new database growth will be NoSQLAll new database growth will be NoSQLAll new database growth will be NoSQL

Relational Technology

Relational Technology

Relational Technology

NoSQLTechnology

Other

Page 5: Big Data Paris : Hadoop and NoSQL

ClouderaHortonworks

Mapr

Operational vs. Analytic Databases

CouchbaseMongoDB

CassandraHbase

AnalyticAnalyticDatabasesDatabases

Get insights Get insights from datafrom data

Real-time, Real-time, Interactive Interactive DatabasesDatabases

Fast access Fast access to datato data

NoSQL

Page 6: Big Data Paris : Hadoop and NoSQL

Lack of flexibility/rigid schemas

Inability to scale out data

Performance challenges

Cost All of these Other

49%

35%

29%

16%12% 11%

Source: Couchbase Survey, December 2011, n = 1351.

Page 7: Big Data Paris : Hadoop and NoSQL

Hadoop and NoSQL

Page 8: Big Data Paris : Hadoop and NoSQL

Sqoop is a tool designed to transfer data between Hadoop and relational databases.

You can use Sqoop to import data from a relational database management system (RDBMS) such as MySQL or Oracle into the Hadoop Distributed File System (HDFS), transform the data in Hadoop MapReduce, and then export the data back into an RDBMS.

sqoop.apache.org

What is Sqoop?

Page 9: Big Data Paris : Hadoop and NoSQL

• Traditional ETL

Application DataData

T

What is Sqoop?

Page 10: Big Data Paris : Hadoop and NoSQL

• A different paradigm

Data

ApplicationData

What is Sqoop?

Page 11: Big Data Paris : Hadoop and NoSQL

• A very scalable different paradigm

Data

Application

Data

Application

Data

Application

Data

Page 12: Big Data Paris : Hadoop and NoSQL

• Where did the Transform go?

Application

Data

TTT TTT TTT TTT

What is Sqoop?

Page 13: Big Data Paris : Hadoop and NoSQL

• Sqoop

• Default connection is via JDBCLots of custom connectorsCouchbase, VoltDB, VerticaTeradata, NetezzaOracle, MySQL, Postgres

What is Sqoop?

Page 14: Big Data Paris : Hadoop and NoSQL

events

profiles, campaigns

profiles, real time campaign statistics

40 milliseconds to respond with the decision.

2

3

1

Ad and offer targeting

Page 15: Big Data Paris : Hadoop and NoSQL

Moving Parts

Page 16: Big Data Paris : Hadoop and NoSQL

Content & Recommendation Targeting

Page 17: Big Data Paris : Hadoop and NoSQL

Moving Parts

Page 18: Big Data Paris : Hadoop and NoSQL

Couchbase

Page 19: Big Data Paris : Hadoop and NoSQL

Easy Scalability

Consistent High Performance

Always On 24x365

Grow cluster without application changes, without downtime with a

single click

Consistent sub-millisecond read and write response times

with consistent high throughput

No downtime for software upgrades, hardware maintenance,

etc.

Flexible Data Model

JSON document model with no fixed schema.

Couchbase Server Core Principles

Page 20: Big Data Paris : Hadoop and NoSQL

Couchbase Handles Real World Scale

Page 21: Big Data Paris : Hadoop and NoSQL

Q&A