hadoop administration - · pdf fileabout hadoop administration about ism univ hadoop is a...

9
HADOOP ADMINISTRATION PROSPECTUS HADOOP ADMINISTRATION UNIVERSITY OF SKILLS

Upload: lamhanh

Post on 08-Mar-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

HADOOP ADMINISTRATION

PROSPECTUS

HADOOP ADMINISTRATION

UNIVERSITY OF SKILLS

Page 2: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

ABOUT HADOOP ADMINISTRATION

ABOUT ISM UNIV

Hadoop is a free, Java -based programming framework that supports the

processing of large data sets in a distributed computing environment. It is

part of the Apache project sponsored by the Apache Software Foundation.

Hadoop makes it possible to run applications on systems with thousands of

nodes involving thousands of terabytes. Its distributed file system facilitates

rapid data transfer rates among nodes and allows the system to continue

operating uninterrupted in case of a node failure. This approach lowers the

risk of catastrophic system failure, even if a significant number of nodes

ISM UNIV is established in 1994 , past 21 years this premier institution has trained

over 7000+ Engineers on Embedded Systems and other Software Engineering

courses. ISM has carved a nice career for all students, This institution is founded

& headed by he is the CEO of this institutions. Over last

21 years this institution has Earned good will and become one of the sought after

Embedded and Software Training institution in India. Today we are proud to say

we are

Mr. LOGANATHAN V

Ranked #1 Embedded systems training institute in India.

Our Training methods and quality of service are far ahead of our competitors

which makes ISM to be a unique place to fine tune skills.

become inoperative.

UNIVERSITY OF SKILLS

Page 3: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

WHY IS HADOOP IMPORTANT ?

1. Ability to store and process huge amounts of any kind of data, quickly. With data

volumes and varieties constantly increasing, especially from social media and the

Internet of Things (IoT), that’s a key consideration.

2. Computing power. Hadoop’s distributed computing model processes big data fast.

The more computing nodes you use, the more processing power you have.

3. Fault tolerance. Data and application processing are protected against hardware

failure. If a node goes down, jobs are automatically redirected to other nodes to male

sure the distributed computing does not fail. Multiple copies of all data are stored

automatically.

4. Flexibility. Unlike traditional relational database, you don’t have to preprocess data

before storing it, You can store as much data as you want and decide how to use it

later. That includes unstructured data like text, images and videos.

5. Low cost. The open-source framework is free and used commodity hardware to

store large quantities of data.

6. Scalability. You can easily grow your system to handle more data simply by adding

nodes. Little administration is required.

UNIVERSITY OF SKILLS

Page 4: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

We are a proud Institution having helped most of our students in their

career building process.

which is genuinely far ahead of any of our competitors.

We have client base across India and abroad , we work with MNC's

and MSI , we cater all our clients with trained manpower and we ensure

our client satisfied with the manpower supplied. we ensure this with

Quality training.

We provide 100% Genuine placement assistance and guidance and help

you to begin an innovative career. We promise you that we provide

interviews until you get a job.

We have

We conduct 25 interviews per month and place 40 students per month,

placed 5000+ students so far….

PLACEMENT RECORDS

CERTIFICATE COURSE ON HADOOP ADMINISTRATION

The Hadoop Cluster Administration training course is designed to provide

knowledge and skills to become a successful Hadoop Architect. It starts with

the fundamental concepts of Apache Hadoop and Hadoop Cluster. It covers

topics to deploy, configure, manage, monitor, and secure a Hadoop Cluster.

The course will also cover HBase Administration. There will be many

challenging, practical and focused hands-on exercises for the learners. By

the end of this Hadoop Cluster Administration training, you will be prepared to

understand and solve real world problems that yo may come across while

working on Hadoop cluster.

UNIVERSITY OF SKILLS

Page 5: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

COURSE OUTLINE

1. Introduction to Big Data,

What is Big Data ?

Big Data Facts

The Three V’s of Big Data

2. Understanding Hadoop

What is Hadoop ?

Why learn Hadoop ?

Relational Databases Vs. Hadoop

Motivation for Hadoop

6 Key Hadoop Data Types

3. The Hadoop Distributed File system (HDFS)

What is HDFS ?

HDFS components

Understanding Block Storage

The Name Node

Data Node Failures

HDFS Commands

HDFS File Permissions

4. The MapReduce Framework

Overview of MapReduce

Understanding MapReduce

The Map Phase

The Reduce Phase

WordCount in MapReduce

Running MapReduce Job

5. Planning Your Hadoop Cluster

Single Node Cluster Configuration

Multi-Node Cluster Configuration

UNIVERSITY OF SKILLS

Page 6: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

COURSE OUTLINE 6. Cluster Maintenance

Checking HDFS Status

Breaking the Cluster

Copying Data Between Clusters

Adding And Removing Cluster Nodes

Rebalancing the cluster

Name Node Metabata Backup

Cluster Upgrading

7. Installing and Mangaing Hadoop Ecosystem Projects

Sqoop

Flume

Hive

Pig

HBase

Oozie

8. Managing and Scheduling Jobs

Managing Jobs

The FIFO Scheduler

The Fair Schedule

How to stop and start jobs running on the cluster

9. Cluster Monitoring, Troubleshooting, and Optimizing

General System conditions to Monitor

Name Node and Job Tracker Web Uis

View and Manage Hadoop’s Log files

Ganglia Monitoring Tool

Common cluster issues and their resolutions

Benchmark your cluster’s performance

10. Populating HDFS from External Sources

How to use Sqoop to import data from RDBMSs to HDFS

How to gather logs from multiple systems using Flume

Features of Hive, Hbase and Pig

How to populate HDFS from external Sources

UNIVERSITY OF SKILLS

Page 7: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

1. 2 e-learning courses

2. Soft copy of all software used in the course

3. Access to E-library during course period

COURSE FEE

COURSE INCLUDES

Course Fee : Rs. 8,000/-

1. 80% Instructor lead Training ( 38 hrs )

2. 20% Online training (10 hrs )

3. Course Materials

4. Certificate

5. Placement Guidance

6. Recaps & Tests

VALUE ADDITION (FREE) FOR ALL STUDENTS

Duration : 48 hrs

Basics of Linux

ELIGIBILITY

UNIVERSITY OF SKILLS

Page 8: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

Are the few in list …… we have client list of 1500+

OUR CLINETS WHO RECRUITS ISM STUDENTS

UNIVERSITY OF SKILLS

.

These are few clients who recruite ISM Students

Page 9: HADOOP administration - · PDF fileABOUT HADOOP ADMINISTRATION ABOUT ISM UNIV Hadoop is a free, Java -based programming framework that supports the processing of large data sets in

INDIA’S LEADING & LARGEST SOFTWARE TRAINING INSTITUTE

Bangalore: #29/18,17th E main,5th Block, Rajajinagar ,Near Madduramma TempleJD Halli , Bangalore-560010 Ph: 91 80 40494949, 23100524, 91 94484 74282

Hyderabad: #6-3-347/22/2, 1st floor,Aishwarya Nilayam , Near Saibaba MandirDwarakapuri Coloney , Panjagutta,Hyderabad-500082

Ph:91 40 40040518, 23353654, 91

[email protected] www.ismuniv.com

89789 93264

UNIVERSITY OF SKILLS

ISO 9001-2008

UNIVERSITY OF SKILLS