cloumon product introduction

10
CLOUMON The powerful Hadoop open-source software stack requires careful integration, calibration, and monitoring, which is why Gruter has developed its own in-house cloud management solution, Cloumon. With a user-friendly interface and management console, Cloumon enables system administrators to optimize the Hadoop ecosystem and take control of the cloud across the entire data lifecycle. ENJOY HADOOP Hadoop, Hive and Hadoop Ecosystem Monitoring and Management System CLOUMON CH (Core Hadoop) Hadoop (HDFS, MapReduce) and Hive CLOUMON PA (Power Analytics) Advanced Analysis Rule Manager, Streaming Data Processing Manager, and Interactive Analysis Query Manager CLOUMON EPs (Extension Packs) Oozie, HBase, ZooKeeper, and Flume R R R R

Upload: gruter-corp

Post on 26-Jan-2015

110 views

Category:

Technology


6 download

DESCRIPTION

Cloumon - Big data platform monitoring and management Solution http://www.gruter.com

TRANSCRIPT

Page 1: Cloumon Product Introduction

CLOUMON

The powerful Hadoop open-source software stack requires careful integration,

calibration, and monitoring, which is why Gruter has developed its own in-house cloud

management solution, Cloumon. With a user-friendly interface and management

console, Cloumon enables system administrators to optimize the Hadoop ecosystem

and take control of the cloud across the entire data lifecycle.

ENJOY HADOOP

Hadoop, Hive and Hadoop Ecosystem

Monitoring and Management System

CLOUMON CH (Core Hadoop)

Hadoop (HDFS, MapReduce) and Hive

CLOUMON PA (Power Analytics)

Advanced Analysis Rule Manager, Streaming Data Processing

Manager, and Interactive Analysis Query Manager

CLOUMON EPs (Extension Packs)

Oozie, HBase, ZooKeeper, and Flume

R

R

R

R

Page 2: Cloumon Product Introduction

CLOUMON KEY FEATURES

DATA STORAGE

HDFS File

Manager

Hive Table

Manager

HBase Data

Manager

MapReduce Job

Manager

Job Workflow Manager

Hive Query

Manager

MANAGEMENT

ZooKeeper Node

Manager

Real-time Analysis Manager

Flume Data Flow Manager

ZooKeeper

Esper Query

Manager Esper

DATA COLLECTION

HDFS File

Hive Table

HBase

Collector Data Agent

DATA ANALYSIS

MapReduce

Hive Query

Pig Query

Flume Data Flow

Manager

STREAMING DATA PROCESSING

Collect and graph metrics from target daemon servers including NameNode, DataNode, JobTracker and TaskTracker

Create alerts by setting thresholds on target metrics and servers

Construct highly visible log data management views

Monitor system resource usage

MONITORING

Manage integrated configurations for server groups

Conveniently access optimized Hive and Oozie functionality

Remotely control servers to perform stop-start maintenance routines

Run various Hadoop distributions including Apache Hadoop 1.0.x, Apache Hadoop 2.0.x and CDH Hadoop 4.2.x

Control multiple Hadoop clusters

CLUSTER MANAGEMENT

Manage entire data lifecycle from data collection to storage, batch analysis and real-time analysis

Browse files with Hadoop File Browser; create and execute queries with Hive Query Workbench

Design and schedule workflows with Oozie Workflow Designer; manage ZNode with ZooKeeper Manager

DATA MANAGEMENT

1 Enjoy Connecting GRUTER

* Key Cloumon management zones in orange

R

R

Page 3: Cloumon Product Introduction

Cloumon CH provides a streamlined environment for the operation of Hadoop and Hive, the core components of

advanced Big Data platforms. Through enhanced component visibility and task management features, Cloumon CH

gives unprecedented access to and control over Big Data systems.

2 Enjoy Connecting GRUTER

CLOUMON CH (Core Hadoop)

HDFS Manager

HDFS Cluster Manager

HDFS daemon status monitoring

Remote server control

Server group configuration

Comprehensive metric monitoring

Integrated log view creation and

management

Multiple cluster commissioning and

management

User-configured server threshold

alerts

· Monitor status and failures on NameNode, JournalNode, SecondaryNameNode,

DataNode and DFSZKFailoverController

· Use simple pre-configured wizards to add new servers to running clusters

(coming release — Q2 2013)

· Start and stop servers remotely

KEY FEATURES

· Manage configurations in server groups

· Detect servers with asymmetric configurations automatically

· Apply configurations to all clusters or specific target servers

· Collect HDFS metrics at single minute intervals

· Track performance history and graph server metrics for thorough system analysis

· Set disk usage thresholds by server and partition

· Set alert thresholds on all HDFS metrics

· Set SMS alerts for critical metrics via Alert Plugin

Major HDFS distribution compatibility · Compatible with major distributions including Apache Hadoop 0.20.x, Apache

Hadoop 1.0.x, Apache Hadoop 2.0.x, CHD 4.1.x, CDH 4.2.x

HDFS commands

List sorting

· Execute commands including list, mkdir, delete, chown and chmod

· Sort lists by name, size, date and owner to improve search speed

And more: Directory tree views; file block information; file data views; file download/upload capabilities

HDFS File Browser

KEY FEATURES

· Create one-stop views of logs from across the distributed system

· Commission and manage multiple clusters as system scales out

R

Page 4: Cloumon Product Introduction

MapReduce Manager

MapReduce daemon status monitoring

Remote server control

Server group configuration

Comprehensive metric monitoring and

configurable server threshold alerts

Integrated log view creation and

management

Multiple cluster commissioning and

management

· Monitor status and failures on JobTracker and TaskTracker

· Use simple pre-configured wizards to add new servers to running clusters

(coming release — Q2 2013)

· Start and stop servers remotely

· Manage configurations in server groups

· Detect servers with asymmetric configurations automatically

· Apply configurations to all clusters or specific target servers

· Set disk usage thresholds by server and partition

· Set alert thresholds on all HDFS metrics

· Set SMS alerts for critical metrics via Alert Plugin

MapReduce Cluster Manager

KEY FEATURES

MapReduce Job Manager

KEY FEATURES

Job management

Job status monitoring

· Manage current job information and track job history

· Filter job lists by status and period

· Monitor task status and job counter

· Track full execution history

Task profiling · Profile task execution progress and elapsed execution time

Task control · Abort processes through stop task functionality

Scheduler monitoring · Monitor fair scheduler mode queue status

· Manage queues

Hive and Oozie integration · Monitor Hive query executions

3 Enjoy Connecting GRUTER

· Create one-stop views of logs from across the distributed system

· Commission and manage multiple clusters as system scales out

Page 5: Cloumon Product Introduction

Hive Query and Hive Configuration

KEY FEATURES

Hive connection management

HDFS

Apache Hadoop 0.20.x

Apache Hadoop 1.0.x

Apache Hadoop 2.0.x

CDH 4.1.x

CDH 4.2.x

MapReduce

Apache Hadoop 0.20.x

Apache Hadoop 1.0.x

CDH 4.x-mr1

Hive

Apache Hive 0.8.x

Apache Hive 0.9.x

Apache Hive 0.10.x

CDH 4.1.x Hive

CDH 4.2.x Hive

· Support multiple connections with built-in Hive delegator (Hive installation not

required)

Hive session management · Manage driver sessions and track query execution status

Table meta viewer and table viewer · Generate detailed table description views and data tables

Multiple query executor · Execute multiple simultaneous queries

User-defined jar and script

management · Upload/delete/apply UDF and Custom M/R

Progress viewer and query status

inquiry · Check query execution progress and track execution history

Query management · Generate saved query and Hive function description views

Table and query wizard · Use simple pre-configured wizards to create tables and queries

Configuration management · Edit and dynamically deploy Hive and Hadoop client configurations

· Access comprehensive storage usage, partitioning and bucket information.

OS

WebServer

DataBase

Java Virtual Machine

Linux, Windows

Tomcat 6.x

MySQL 5.x

JDK6

4 Enjoy Connecting GRUTER

Hive Manager

Versions Supported System Requirements

Web-based support

8x5

24x7

Phone support

2-24 hour initial response time

Service SLA

Page 6: Cloumon Product Introduction

5 Enjoy Connecting GRUTER

CLOUMON PACKAGE CLOUMON PA (Power Analytics)

Cloumon PA is a high-performance Big Data system which brings together a powerful set of cutting-edge

technologies and tools to help you perform advanced analytics on Hadoop and Hive.

Smart query building processes and intuitive execution flows generate sophisticated outputs in just a few clicks

without the need for complex query syntax.

Stream Processing Rule Manager

Console for streaming data

processing

KEY FEATURES

Interactive Analytics (Impala, Tajo)

Impala

Tajo

KEY FEATURES

· Manage metadata such as table schemas for integration with Hive

· Use Impala query workbench

· Monitor status of Impala clusters

· Manage metadata such as table schemas

· Use Tajo query workbench

· Monitor status of Tajo clusters

· Manage entire lifecycle of streaming data processing by registering data type,

configuring parser, managing EPL queries for analysis, and storing and querying

results, among other functionalities

Data type management and

configuration · Define type name, column, record parser and result table

· Use built-in storage interfaces such as HBase and MySQL

· Extend interface to add and select user-defined storage Analysis result storage management

· Manage EPL queries

· Add and delete queries dynamically in a running environment

· Have results stored automatically in selected storage

Analysis query manager

· Visualize results using various charts and graphs according to data type Analysis output visualization

· Manage Hive/Tajo/Impala queries in an integrated fashion and choose optimal

execution platform

· Manage queries in concert with Advanced Analysis Rule Manager

R

Page 7: Cloumon Product Introduction

Advanced Analysis Rule Manager

Hive Query Based Analysis Rule Management

Analysis target object management

KEY FEATURES

· Select analysis targets such as Hive table and existing queries, among others

· Manage aliases down to fields and rules via user-friendly UI

· Build complex queries with multiple “join”, “group by”, and “order by”

functions/clauses simply and quickly

· Employ variables for high re-usability and productivity

· Define materialized views as analysis targets without burden of generating

actual views

· Visualize usage of individual rules and their interrelationships through charting

and graphing tools

· Manage execution start time

· Track execution history

· Create fresh results at each execution or conveniently reuse previous results at point of execution

6 Enjoy Connecting GRUTER

KEY FEATURES

Hive query builder

Analysis target querying

Rule charting

Execution and Result Management

Scheduling

Query optimization

Dynamic variable binding at execution

Multiple storage options · Use Hive Table, HDFS Directory and HBase Table

Powerful viewer and APIs to access analysis outputs

· Bind actual values to variables dynamically at point of execution

· Create fresh results at each execution or conveniently reuse previous results at point of execution

Page 8: Cloumon Product Introduction

Cloumon EPs provide additional monitoring and management capabilities for other key components of the Hadoop

ecosystem including Oozie, HBase, Flume and Zookeeper, granting comprehensive control of the entire data lifecycle

from data collection, storage and workflow design to task scheduling and distributed system role management.

7 Enjoy Connecting GRUTER

CLOUMON PACKAGE CLOUMON EPs (Extension Packs)

HBase Manager

HBase Cluster Management

Table data scanning

KEY FEATURES

Oozie Workflow Manager

Wysiwyg job designer

Library file management

Job execution management

· Upload jar files

· Manage mapper, reducer, and writable classes

· Manage job libraries (distributed cache)

KEY FEATURES

· Schedule job execution

· Track job execution history

· Monitor jobs via integrated Cloumon MapReduce Job Manager

· Collect information at single minute intervals

· Track history of metric changes over time and chart in time-series

· Fetch lists of tables and look up table schemas

· Manage table region lists and region lists on RegionServers

· Monitor detailed region metrics

· Create and drop table (Q3 2013)

· Perform Region compaction, split, and merge

· Execute and schedule jobs according to user-configured rules

HMaster and RegionServer alerts

Single server metric monitoring

Table and Region status monitoring

Manage Region (Q3 2013)

HBase Data Management

Column data fetching by row query

Long type transformation

Web-based HBase shell (Q3 2013)

· Automatically convert byte array long to numeric long for readability

R

Page 9: Cloumon Product Introduction

8 Enjoy Connecting GRUTER

ZooKeeper Manager

ZooKeeper Cluster Management

Monitor server status and set alerts

KEY FEATURES

Flume Manager for Flume-OG (v0.9.4)

Data flow management

Powerful configuration tool

KEY FEATURES

· Inspect data flow between agent and collector

· Monitor workloads of each node using workload indicators

· Design data processing flows via powerful tool which allocates source, deco

and sink

· Easily set parameters with pre-configured forms and help tips

· Reuse and edit existing configurations

Physical/logical node status

monitoring

· Check overview of node status and drill down to analyze specific details

· List logical nodes on specific physical nodes

Map/unmap/decommission/purgeAll · Control the entire lifecycle of logical nodes with minimal clicks

· Use smart proxies to complete complex jobs in a single click

Multiple cluster management · Manage multiple clusters

· Collect metrics at single minute intervals

· Track metric change history

View detailed ZooKeeper server

metrics

Monitor ZooKeeper connections · Monitor and inspect all connections to ZooKeeper servers

Manage multiple clusters simultaneously

Easily manage zNodes by accessing detailed information and manipulating data through convenient file browser

interface

Manage ACLs for each zNode

Manage zNode watcher registration

ZooKeeper Node Management

· Create integrated views by combining data from Flume masters and ZooKeeper

Page 10: Cloumon Product Introduction

Phone: +82-2-508-5911

Fax: +82-2-508-5912

E-mail: [email protected]

Web: www.gruter.com

For demo videos, please visit: www.gruter.com/products/cloumon#video

GRUTER, INC.

5F Sehwa Office Building 889-70 Daechi-dong, Gangnam-gu, Seoul, South Korea 135-839

Gruter: Your Partner in the Big Data Revolution