kafka audit - kafka meetup - january 27th, 2015

KAFKA AUDITJanuary 27th, 2015 - LinkedIn Meetup

ProducerKafka Cluster

Mp = {Plain old Kafka message}

Kafka Cluster

Aggregate

Mp = Plain old Kafka message

Producer

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Kafka Cluster

Datacenter A Datacenter B

Producer

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Offline

processing

Producer Kafka Cluster

Ma = {

Plain old Kafka message

Producer creation timestamp

Producer identification string}

Producer Kafka Cluster

Ma = {

Plain old Kafka message

Producer creation timestamp

Producer identification string}

Mm = {Count of messages

The topic this count is for

Tier identification string

Time bucket interval

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Offline

processing

Ma = Message with audit data

Mm = Monitoring message

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Offline

processing

Consumer

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Offline

processing

Consumer

Kafka Cluster

Aggregate

Offline

processing

Consumer

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Offline

processing

Consumer

AppREST API

Audit MySQL

Audit UI

AUDIT UI

Tier Count

Local 123

Aggregate

Aggregate Offline

Producer 123

(for each topic and time window)

AUDIT UI

Tier Count

Local 123

Aggregate

Aggregate Offline

Producer 123

We lost 4 messages between local and aggregate!

(for each topic and time window)

CAVEATS

• Audit consumers need to consume

everything.

• Intermediate tiers are tough to drill down into.

QUESTIONS?

users@kafka.apache.org

https://kafka.apache.org/

irc://irc.freenode.net/#apache-kafka

Many folks on the mailing list know the details

of how Kafka Audit works.

LATE MESSAGE

RESOLUTION

LATE MESSAGE

RESOLUTION

Producer

Aggregate

Hadoop

10:10 10:20 10:30 10:40

299 337

337 326

From the 10:10 to 10:20 time bucket, 53 messages were

lost from the producer to the Kafka local cluster.

Unhealthy!

Current time

LATE MESSAGE

RESOLUTION

Producer

Aggregate

Hadoop

10:10 10:20 10:30 10:40

299+53

299 337

337 326

Another message Mm arrives later with the missing count of 53!

Current time

LATE MESSAGE

RESOLUTION

Producer

Aggregate

Hadoop

10:10 10:20 10:30 10:40

352 337

337 326

All time periods match after arrival of late Mm message.

Healthy state now.

Current time

The producer timestamp determines the time bucket

the message is placed into — deterministic.

Mm = {Count of messages

The topic this count is for

Tier identification string

Time bucket interval

TRANSPORT TIME

Producer

Kafka Cluster

Aggregate

Kafka Cluster

Aggregate

Consumer

Tt = {Time Ma seen by audit consumer

}Topic name

Metrics

(e.g. RRDs)

Tt = { Time seen by audit consumer}Topic name

Tt can be sampled,

no need to emit for all messages

Tt[time] = <Audit Consumer NTPd Time> - Ma[time]

CAVEATS

• Depends on the Audit Consumer lag.

• Producer batching can skew timestamps.

SCHEMA RESOLUTION

WHAT IS A SCHEMA?{

"type":"record",

"name":"User",

"fields":[

"name":"name",

"type":"string"

"name":"favorite_number",

"type":[

"int",

"null"

Every message should be formatted to a schema!

SCHEMA REGISTRYA REST API to go from schema to ID, and ID to schema.

Schema ID = hash(Raw Schema)

Schema Registry Database

Registration

TimestampSchema ID Raw Schema

History of registrations is maintained.

ProducerSchema

Registry

1. Producer registers schema.

2. Registry returns schema ID (hash of schema).

3. Schema ID prepended to all Kafka messages.

Ms = { }<Schema ID> + Mall

kafka audit - kafka meetup - january 27th, 2015

audit datamm

datanext slide

plain old kafka messagesuch

plain old kafka messagesay

audit working

audit messagespause

kafka auditjanuary

count of messagesthe

Technology

kafka connect & streams - the ecosystem around kafka

kafka tutorial - basics of the kafka streaming platform

user's guide apache kafka software release 2 diagram shows...

apache kafka bay area sep meetup - 24/7 customer, inc

· apache kafka introduction to apache kafka apache kafka...

kafka tutorial: kafka security

kafka & hadoop - for nyc kafka meetup

maurice blanchot, de kafka a kafka

kafka connect & kafka streams/ksql - the ecosystem around...

apache kafka - rainfocus · apache kafka scalable message...

enterprise kafka: kafka as a service

kafka to the maxka - (kafka performance tuning)

net flix kafka seattle meetup

cloudurablecloudurable.com/ppt/cloudurable-kafka-intro-with-simple-java-produc… ·...

building large-scale analytics platform with storm, kafka...

kafka low-level design discussion of kafka design kafka...

cassandra and kafka support on...

paris kafka meetup - how to develop with kafka

kafka and hadoop at linkedin meetup

101 ways to configure kafka - badly (kafka summit)