big data hadoop training

6
SPIRITSOFTS ONLINE TRAINING INSTITUTE Hadoop Online training By Spiritsofts with real time experts are providing Big Data Training Online. Learn Hadoop training online, we are providing Big Data classes throughout the world. http://www.spiritsofts.com/hadoop-online-training Hadoop on-line coaching Course Content Administrator Training for Apache Hadoop Introduction to Big Data and Hadoop • What is Big Data? • What area unit the challenges for process Big data? • What technologies support Big data? • Distributed systems • What is Hadoop? • Why Hadoop? • History of Hadoop • Use Cases of Hadoop • Hadoop eco System • HDFS • Map scale back • Statistics

Upload: srinivas-k

Post on 11-Apr-2017

283 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Big data hadoop training

SPIRITSOFTS ONLINE TRAINING INSTITUTE

Hadoop Online training By Spiritsofts with real time experts are providing Big Data Training Online. Learn Hadoop training online, we are providing Big Data classes throughout the world.

http://www.spiritsofts.com/hadoop-online-training

Hadoop on-line coaching Course Content

Administrator Training for Apache Hadoop

Introduction to Big Data and Hadoop

• What is Big Data?

• What area unit the challenges for process Big data?

• What technologies support Big data?

• Distributed systems

• What is Hadoop?

• Why Hadoop?

• History of Hadoop

• Use Cases of Hadoop

• Hadoop eco System

• HDFS

• Map scale back

• Statistics

Understanding the Cluster

• Typical work flow

• Writing files to HDFS

Page 2: Big data hadoop training

SPIRITSOFTS ONLINE TRAINING INSTITUTE• Reading files from HDFS

• Rack Awareness

• 5 daemons

Best Practices for Cluster Setup

• Best Practices

• How to decide on the proper hadoop distribution

• How to decide on right hardware

Cluster Setup

• Install Pseudo cluster

• Install Multi node cluster

• Configuration

• Setup cluster on Cloud - EC2

• Tools

• Security

• Benchmarking the cluster

Routine Admin procedures

• Metadata & knowledge Backups

• File system check (fsck)

• File system Balancer

• Commissioning and decommissioning nodes

• Upgrading

• Using DFS Admin

Monitoring the Cluster

• Using the online user interfaces

Page 3: Big data hadoop training

SPIRITSOFTS ONLINE TRAINING INSTITUTE• Hadoop Log files

• Setting the log levels

• Monitoring with Nagios

Install , Configure and use

• PIG

• HIVE

• HBASE

• Flume and Sqoop

• Zookeeper

Developer coaching for Apache Hadoop

Introduction to Big Data and Hadoop

• What is Big Data?

• What area unit the challenges for process Big data?

• What technologies support Big data?

• Distribution systems.

• What is Hadoop?

• Why Hadoop?

• History of Hadoop

• Use Cases of Hadoop

• Hadoop eco System

• HDFS

• Map scale back

Page 4: Big data hadoop training

SPIRITSOFTS ONLINE TRAINING INSTITUTE• Statistics

Understanding the Cluster

• Typical work flow

• Writing files to HDFS

• Reading files from HDFS

• Rack Awareness

• 5 daemons

Developing the Map scale back Application

• Configuring development atmosphere - Eclipse

• Writing Unit check

• Running regionally

• Running on Cluster

• Map Reduce workflows

How Map Reduce Works

• Anatomy of a Map Reduce job run

• Failures

• Job programing

• Shuffle and type

• Task Execution

Map Reduce varieties and Formats

• Map Reduce varieties

• Input Formats - Input splits & records, text input, binary input, multiple inputs & information input

• Output Formats - text Output, binary output, multiple outputs, lazy output and information output

Map Reduce options

• Counters

Page 5: Big data hadoop training

SPIRITSOFTS ONLINE TRAINING INSTITUTE• Sorting

• Joins - Map aspect and scale back aspect

• Side knowledge Distribution

• Map Reduce Combiner

• Map Reduce Practitioner

• Map Reduce Distributed Cache

Hive and PIG

• Fundamentals

• When to Use PIG and HIVE

• Concepts

HBASE

• CAP Theorem

• Hbase design and ideas

• Programming