apache nifi - better analytics demands better data flow
TRANSCRIPT
• Built to automate data flow between system.
• Design concepts closely relate to the main ideas of Flow Based Programming
• Visual creation and management of directed graphs of processors.
• Asynchronous which allows for very high throughput and natural buffering even as
processing and flow rates fluctuate.
• Web-based user interface
• Seamless experience between design, control, feedback, and monitoring
• Highly configurable
• Loss tolerant vs guaranteed delivery
• Low latency vs high throughput
• Dynamic prioritization
• Flow can be modified at runtime
• Back pressure
• Data Provenance
• Track dataflow from beginning to end
• Designed for extension
• Build your own processors and more
• Enables rapid development and effective testing
• Secure
• SSL, SSH, HTTPS, encrypted content, etc...
• Pluggable role-based authentication/authorization
• Enterprise application integration, ETL
• Data Ingestion & Streaming
Easily and efficiently inject data into hadoop
• IoAT Optimization
Secure, Prioritize, Enrich and Trace data at edge
• Data Security
Acquire and prioritize data into data lake for analysis
• Compliance
Gain full transparency into provenance and flow of data
Introduction Architecture
Features Use case
Better analytics demands better dataflow