real time analytics with netty, storm, kafka

13
Real-time Analytics with Netty, Apache Kafka and Storm Case study with “lambda architecture” http://nguyentantrieu.info Update: 07/06/2013

Upload: trieu-nguyen

Post on 08-Sep-2014

1.441 views

Category:

Technology


1 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Real time analytics with Netty, Storm, Kafka

Real-time Analytics with Netty, Apache Kafka and StormCase study with “lambda architecture”

http://nguyentantrieu.infoUpdate: 07/06/2013

Page 2: Real time analytics with Netty, Storm, Kafka

Agenda1. Overview Architecture2. Log HTTP-Handler and producer: Netty 43. Kafka 0.8 (Stream Data Log Storage)4. Storm Analytics Cluster

Page 3: Real time analytics with Netty, Storm, Kafka

Overview System Architecture

Page 4: Real time analytics with Netty, Storm, Kafka

Concept Flow

Page 5: Real time analytics with Netty, Storm, Kafka

Concept FlowJavaScript Tracking

Mobile SDK

Http Log Server

Kafka

Page 6: Real time analytics with Netty, Storm, Kafka

S2 HTTP Log Servernetty framework 4

Page 7: Real time analytics with Netty, Storm, Kafka

Netty.ioNetty is a non-blocking I/O (NIO) client-server framework for the development of Java network applications such as protocol servers and clients. The asynchronous event-driven network application framework and tools are used to simplify network programming such as TCP and UDP socket servers.[2] Netty includes an implementation of the reactor pattern of programming.

http://en.wikipedia.org/wiki/Netty_(software)http://nguyentantrieu.info/blog/backend-system-with-netty-io

Page 8: Real time analytics with Netty, Storm, Kafka

Apache Kafka (version 0.8)https://cwiki.apache.org/confluence/display/KAFKA/Index

Page 9: Real time analytics with Netty, Storm, Kafka

In Production● Clustering ( 4 nodes)● Partitions

○ user-activity: 24 partitions● Producer and consumer are replication

aware

Page 10: Real time analytics with Netty, Storm, Kafka

Core Producer Class (S2 HTTP Log Server)

Page 11: Real time analytics with Netty, Storm, Kafka

Storm Analytics Cluster

Page 12: Real time analytics with Netty, Storm, Kafka

The Storm Topology

Kafka Clustertopic: user-activity

Tokenizer Bolt

Parser Bolt

Aggregate Bolt

Redis Statistics Bolt

Save DWH Bolt

Raw Data

Kafka Consumer Spout