apache nifi - better analytics demands better data flow

2
Built to automate data flow between system. Design concepts closely relate to the main ideas of Flow Based Programming Visual creation and management of directed graphs of processors. Asynchronous which allows for very high throughput and natural buffering even as processing and flow rates fluctuate. Web-based user interface Seamless experience between design, control, feedback, and monitoring Highly configurable Loss tolerant vs guaranteed delivery Low latency vs high throughput Dynamic prioritization Flow can be modified at runtime Back pressure Data Provenance Track dataflow from beginning to end Designed for extension Build your own processors and more Enables rapid development and effective testing Secure SSL, SSH, HTTPS, encrypted content, etc... Pluggable role-based authentication/authorization Enterprise application integration, ETL Data Ingestion & Streaming Easily and efficiently inject data into hadoop IoAT Optimization Secure, Prioritize, Enrich and Trace data at edge Data Security Acquire and prioritize data into data lake for analysis Compliance Gain full transparency into provenance and flow of data Introduction Architecture Features Use case Better analytics demands better dataflow

Upload: abhishek-solanki

Post on 11-Apr-2017

191 views

Category:

Technology


0 download

TRANSCRIPT

• Built to automate data flow between system.

• Design concepts closely relate to the main ideas of Flow Based Programming

• Visual creation and management of directed graphs of processors.

• Asynchronous which allows for very high throughput and natural buffering even as

processing and flow rates fluctuate.

• Web-based user interface

• Seamless experience between design, control, feedback, and monitoring

• Highly configurable

• Loss tolerant vs guaranteed delivery

• Low latency vs high throughput

• Dynamic prioritization

• Flow can be modified at runtime

• Back pressure

• Data Provenance

• Track dataflow from beginning to end

• Designed for extension

• Build your own processors and more

• Enables rapid development and effective testing

• Secure

• SSL, SSH, HTTPS, encrypted content, etc...

• Pluggable role-based authentication/authorization

• Enterprise application integration, ETL

• Data Ingestion & Streaming

Easily and efficiently inject data into hadoop

• IoAT Optimization

Secure, Prioritize, Enrich and Trace data at edge

• Data Security

Acquire and prioritize data into data lake for analysis

• Compliance

Gain full transparency into provenance and flow of data

Introduction Architecture

Features Use case

Better analytics demands better dataflow