big data analytics with hadoop 40 hour course€¦ · overview learning objectives understanding...
TRANSCRIPT
BIG DATA ANALYTICS WITH HADOOP 40 Hour Course
1
OVERVIEW
Learning Objectives
Understanding Big Data
Understanding various types of data that can be stored in Hadoop
Setting up and Configuring Hadoop in
Pseduo Distributed Mode
Distributed Mode
Understanding how Big Data & Hadoop fit in the current environment and infrastructure
WWW.WISDOMSPROUTS.COM 2
Work with Map Reduce programs
Code the Ecosystem projects
Performing Data Analytics using PIG & HIVE
Understand and work on real time use cases
Implementing a Hadoop project
Working on live/real life project on big data analytics using Hadoop eco-system
And Much More..
WWW.WISDOMSPROUTS.COM 3
COURSE HIGHLIGHTS
Detailed explanation of every topic with real world example. Examples include both Industry based as well as based on day to day activities.
Labs, Assignments and Test for every topic. Labs would be performed on Virtual Machine which will be shared with you.
Project A real use case based project will be assigned with proper documentation and all required files.
.
WWW.WISDOMSPROUTS.COM 4
COURSE FEES
Simple fee structure. No hidden costs.
25,000 INR
20,000 INR
WWW.WISDOMSPROUTS.COM 5
Support
Anytime during the Course duration and even after the course completion, you can post your queries through mail on [email protected] and we will get back to you within 24 hours
WWW.WISDOMSPROUTS.COM 6
CONTENT PROVIDED
A Hadoop Reference guide having all the topics covered in the sessions.
Lab Manual with all Labs covered.
Exercise Manual with some assignments.
Question Bank.
Government Recognised Certificate
WWW.WISDOMSPROUTS.COM 7
MODULE 1
Introduction to Big Data
8
INTRODUCTION TO BIG DATA ANALYTICS
Big Data what?
It’s characteristics.
Some facts and figures
Importance of Big Data
Need of understanding and analyzing Big Data.
Basics of Data Analytics
Problems with existing systems.
WWW.WISDOMSPROUTS.COM 9
MODULE 2
Introduction to Hadoop
10
INTRODUCTION TO HADOOP
What is Hadoop Architecture
Hadoop Job Process
File Anatomy Read Operations
Write Operations
Useful Configurations core-site.xml
hdfs-site.xml
mapred-site.xml
WWW.WISDOMSPROUTS.COM 11
HDFS ( HADOOP DISTRIBUTED FILE SYSTEM )
Significance of HDFS in Hadoop
HDFS Features
Daemons of Hadoop and functionalities NameNode
DataNode
JobTracker
TaskTracker
Secondary NameNode
WWW.WISDOMSPROUTS.COM 12
Data Storage in HDFS Blocks
Heartbeats
Data Replication
Accessing HDFS CLI (Command Line Interface) Unix and Hadoop Commands
Java Based Approach
WWW.WISDOMSPROUTS.COM 13
MAP REDUCE
Introduction to MapReduce MapReduce Architecture
MapReduce Programming Model
MapReduce Algorithm and Phases
Basic MapReduce Program Driver Code
Mapper Code
Reducer Code
WWW.WISDOMSPROUTS.COM 14
LABS
Configuring a pseudo distributed Hadoop Cluster.
Working with HDFS command line options.
Running a Word Count program.
WWW.WISDOMSPROUTS.COM 15
MODULE 3
Hadoop Ecosystem
16
HADOOP ECOSYSTEM
Hadoop Ecosystem What is ecosystem
Different ecosystem projects
Sqoop
Hive
Pig
Flume
Ambari
Hue
WWW.WISDOMSPROUTS.COM 17
Revision Test
25 questions
20 minutes
WWW.WISDOMSPROUTS.COM 18
LABS
Import data using Sqoop and query it using Hive.
Configuring a Flume agent.
Mini Project Using all Ecosystem projects on one sample weblogs data set
WWW.WISDOMSPROUTS.COM 19
MODULE 4
Brief Walk with HDFS and MapReduce
20
DEEPER DIVE
Advanced HDFS Secondary NameNode
Federation
High Availability
Advanced MapReduce Demo of Precedence levels
Partioners
Combiners
WWW.WISDOMSPROUTS.COM 21
MODULE 5
Hadoop Administration
22
HADOOP ADMINISTRATION
Cluster Planning Understanding hardware and software requirements of a Hadoop cluster
Different modes of operation of Hadoop
Precedence levels
Some dos and don’ts
WWW.WISDOMSPROUTS.COM 23
LABS
Hive Creating different types of tables
Executing different queries
Pig Working with different data types
Modes of execution
Running some pig application driven commands
Sqoop Import
Export
WWW.WISDOMSPROUTS.COM 24
MODULE 6
Data Visualization
25
DATA VISUALIZATION
Working with Hadoop ODBC Connector.
Data Visualization using Excel. Exporting Hive data
Creating graphs and interactive charts for your hive data.
Analysing hive data using power view in excel.
WWW.WISDOMSPROUTS.COM 26
MODULE 7
Use Cases Project Work
27
PROJECT
Twitter Use Case Why do companies use twitter data
How to analyze twitter data
How to
Create Twitter API
Load Some tweets into HDFS
Query twitter logs.
Revision Test
50 questions
30 minutes
WWW.WISDOMSPROUTS.COM 28
HOPE TO SEE ON-BOARD SOON….
29