Transcript
Page 1: Apache Hadoop Architecture (2016-17)

Structured Data Unstructured Data Semi Structured Data

Documents Social MediaVideo|

Enterprise Data CRM Machine Sensor

EDI XML/JSON| Transaction

Inte

gra

tio

n

To

ols

Apache FlumeApache KafaApache Sqoop Apache NiFi Apache ManifoldCF

File

S

yste

m

ERPCRM

Da

ta S

ou

rce

s

Hadoop Distributed File System

Quantcast File System Ceph File System XtreemFS

YARNClu

ste

r R

eso

urc

e

Ma

na

ge

me

nt

Execution Engine

DirectJava.NET

Script

Slides

Batch

Script CascadingSQL

PigHive, Apache

Drill, Cloudera IMPALA

ScalaJava Other

ISV

StreamNoSQL

Strom Other ISV

In-memory

Data Flow

Engines

Machine Learning & Search Other

ISV

ZooKeeper

VisualizationM

an

ag

em

en

t &

Co

ord

ina

tion

Top Related