big data infrastructure todo-tasks rfx framework

Overview of Rfx Framework / Platform https://docs.google.com/document/d/1wutns90tuW1PGR03tXhDE_DkrdWZtfvh9R_cJRtrXk/edit?usp=sharing Big Data Infrastructure - TODO Tasks Update March 12, 2014 by Triều (@tantrieuf31) ● Module HTTP Log Server: ○ Hot deployment/restart/shutdown Http Log Server ○ Reactive streaming for Kafka Producer (RxJava) ■ https://github.com/Netflix/RxJava/wiki/TransformingObservables ● Module Messaging (Kafka): https://bitbucket.org/trieunt/kafka ○ Tìm 1 cơ chế quản lý configs và rotate kafka logs 1 cách an toàn hơn (hiện đang bị 1 issue Kafka Consumer chưa đọc xong mà Kafka log đã move đi => kg tìm thấy offset để đọc tiếp => thiếu data) ○ Dự đoán tốc độ tăng file Kafka log để chọn 1 configs tối ưu cho từng loại sản phẩm (machine learning (linear regression) for system performance) ○ Tạo mapping (thời gian, offset và binary offset files) (lúc cần parse lại thì dễ tìm files) ○ Quản lý + index lại offset của Kafka theo thời gian (giờ, ngày, ...), lúc cần thì set vào là chạy reparse lại (hiện chưa implement) ● Module Stream Data Processing: https://bitbucket.org/trieunt/rfx/wiki/Home ○ Quản lý memory của worker node (nếu set HeapSize quá thấp => Worker sẽ die/restart liên tục do kg đủ memory để chạy vì log nhiều) ○ Cơ chế extensions/plugins/hooking vào hệ thống (phân chia core và applications) ○ Refactoring (tổ chức lại code cho rõ ràng) giữa logic code công việc giữa: ■ parse => ghi vào Redis (chỉ parse, counting và check rules) ■ parse => ghi ra raw log files trong 1 worker (chỉ parse và write raw logs) ○ Unit Test Tools (Kafka Producer) + Test Tools (integration test) cho Reactive Topologies ○ Cải thiện chức năng debug log của Worker (ElasticSearch+Kibana) ○ Monitor Front End cho tất cả các critical metrics: ■ worker nodes (logs, memory, restart time, running, died, uptime, downtime ) ■ alert/notification ■ số lượng log đọc từ Kafka, parsed OK, check OK, save OK ■ Disk Free, memory cho worker ■ Backup Redis Data ■ Simple Analytics Dashboard cho logs (analytics) ○ New Job Server (dùng Groovy script để dễ deploy và control qua Pub/Sub Redis) ■ Synchronized Data job ● Module Active Intelligence (tính năng mới ) ● social data crawler Facebook/Twitter/Google+ (Rfx Social Data Crawler) ● Clustering Stream Data (test case: tin tức về các vụ tai nạn xe cột / cướp giật / thảm họa thiên nhiên) dùng Apache Spark http://spark.apache.org ● Realtime Visualization Engine with HTML5 Web Socket (D3.js + Netty + Akka Actor)

Upload: trieu-nguyen

Post on 26-Jan-2015

103 views

Category:

Technology

1 download

Report

Download

Embed Size (px):

DESCRIPTION

Big data infrastructure todo-tasks Rfx Framework

TRANSCRIPT

Overview of Rfx Framework / Platformhttps://docs.google.com/document/d/1wutns90tuW1PGR03tXhDE_DkrdWZtfvh9R_cJRtrXk/edit?usp=sharing

Big Data Infrastructure - TODO Tasks Update March 12, 2014 by Triều (@tantrieuf31)

● Module HTTP Log Server:○ Hot deployment/restart/shutdown Http Log Server○ Reactive streaming for Kafka Producer (RxJava)

■ https://github.com/Netflix/RxJava/wiki/TransformingObservables● Module Messaging (Kafka): https://bitbucket.org/trieunt/kafka

○ Tìm 1 cơ chế quản lý configs và rotate kafka logs 1 cách an toàn hơn (hiện đang bị 1 issue Kafka Consumer chưa đọc xong mà Kafka log đã move đi => kg tìm thấy offset để đọc tiếp => thiếu data)

○ Dự đoán tốc độ tăng file Kafka log để chọn 1 configs tối ưu cho từng loại sản phẩm (machine learning (linear regression) for system performance)

○ Tạo mapping (thời gian, offset và binary offset files) (lúc cần parse lại thì dễ tìm files)○ Quản lý + index lại offset của Kafka theo thời gian (giờ, ngày, ...), lúc cần thì set vào là chạy

reparse lại (hiện chưa implement)● Module Stream Data Processing: https://bitbucket.org/trieunt/rfx/wiki/Home

○ Quản lý memory của worker node (nếu set HeapSize quá thấp => Worker sẽ die/restart liên tục do kg đủ memory để chạy vì log nhiều)

○ Cơ chế extensions/plugins/hooking vào hệ thống (phân chia core và applications)○ Refactoring (tổ chức lại code cho rõ ràng) giữa logic code công việc giữa:

■ parse => ghi vào Redis (chỉ parse, counting và check rules)■ parse => ghi ra raw log files trong 1 worker (chỉ parse và write raw logs)

○ Unit Test Tools (Kafka Producer) + Test Tools (integration test) cho Reactive Topologies ○ Cải thiện chức năng debug log của Worker (ElasticSearch+Kibana)○ Monitor Front End cho tất cả các critical metrics:

■ worker nodes (logs, memory, restart time, running, died, uptime, downtime )■ alert/notification■ số lượng log đọc từ Kafka, parsed OK, check OK, save OK■ Disk Free, memory cho worker■ Backup Redis Data■ Simple Analytics Dashboard cho logs (analytics)

○ New Job Server (dùng Groovy script để dễ deploy và control qua Pub/Sub Redis)■ Synchronized Data job

● Module Active Intelligence (tính năng mới )● social data crawler Facebook/Twitter/Google+ (Rfx Social Data Crawler)● Clustering Stream Data (test case: tin tức về các vụ tai nạn xe cột / cướp giật / thảm họa thiên

nhiên) dùng Apache Spark http://spark.apache.org● Realtime Visualization Engine with HTML5 Web Socket (D3.js + Netty + Akka Actor)

https://docs.google.com/document/d/1wutns90tuW1PGR03tXhDE_-DkrdWZtfvh9R_cJRtrXk/edit?usp=sharing

https://www.google.com/url?q=https%3A%2F%2Fgithub.com%2FNetflix%2FRxJava%2Fwiki%2FTransforming-Observables&sa=D&sntz=1&usg=AFQjCNGlAbkO_uAKEksnBiybF7CBbA_VOw

https://www.google.com/url?q=https%3A%2F%2Fbitbucket.org%2Ftrieunt%2Fkafka&sa=D&sntz=1&usg=AFQjCNFOySVpN74jJK-BEd-jdMiOQeeRJQ

https://www.google.com/url?q=https%3A%2F%2Fbitbucket.org%2Ftrieunt%2Frfx%2Fwiki%2FHome&sa=D&sntz=1&usg=AFQjCNFHEnG_GnUWzukc-23RBKLeSEVm3g

http://www.google.com/url?q=http%3A%2F%2Fspark.apache.org&sa=D&sntz=1&usg=AFQjCNHcBkcejNSXwp_dwtxrQ6CFe6AmoA

OPERATION MANUAL Introduction - Samson … you for selecting the ZOOM RFX-2000 (hereafter simply called the "RFX-2000"). The RFX-2000 is a sophisticated digital reverb and multi-effect

CONECTORES COAXIALES RF Amphenol - INTERSAC - … · 82-202-RFX (RG-8) 82-202-1006-RFX (9913) Conector Macho, Clamp. Tipo N, 50 Ohm. Cable: RG-8, TWB4001(B9913) 82-63-RFX Conector

RFX-mod Workshop – Padova, 20-22 January 2009 1 Experimental QSH confinement and transport Fulvio Auriemma on behalf of RFX-mod team Consorzio RFX, Euratom-ENEA

RFx Response Document - ProcurePoint | One place … · Web viewTENDER RESPONSE 15 TENDER RESPONSE TENDER RESPONSE 15 15 RFx RESPONSE DOCUMENT 16 RFx RESPONSE DOCUMENT RFx RESPONSE

RFX-mod Programme Workshop, 20-22/01/09, Padova - T. Bolzonella1 Tommaso Bolzonella on behalf of RFX-mod team Consorzio RFX- Associazione Euratom-ENEA

RFx Supplier - KBR RFx Supplier Guide ©KBR Draft Version 1.0 Page 2 Introduction Welcome to RFx, from this site, current and potential suppliers can register and maintain information