big data hadoop training course

13
@RMSSoftwareTech [email protected] http://www.rmssoftwaretech.com Apache Hadoop (Big Data) Big Data Training

Upload: rms-software-technologies

Post on 27-Jan-2015

109 views

Category:

Education


2 download

DESCRIPTION

Big Data Hadoop Training Course Agenda Please contact [email protected]

TRANSCRIPT

Page 1: Big Data Hadoop Training Course

@RMSSoftwareTech [email protected]

http://www.rmssoftwaretech.com

Apache Hadoop (Big Data)

Big Data Training

Page 2: Big Data Hadoop Training Course

2 © 2014 RMS Software Tech (rmssoftwaretech.com)

Logo’s & Trademarks

• Note: Any logos used in this presentation are owned by their respective companies and are only used in this slide deck for educational purposes. No other companies are responsible for or provide attribution for any of the material in these slides.

• This slide deck is released under a Creative Commons License and can be reused in your own presentations, however please research the specific meanings of these symbols:

• You may be able to use the slide deck for purposes beyond the CC license if you email me with the special request.

• All third party trademark rights acknowledged

Page 3: Big Data Hadoop Training Course

3 © 2014 RMS Software Tech (rmssoftwaretech.com)

Profile : RMS Software Technologies

•  About us : ◦  Leading provider of Software solutions, System integration Services and

Professional Training ◦  Based in San Jose, CA & Mumbai, India started in 2012.

• Professional Courses : ◦  Expert Training team of 10 people who provide training in various leading

technologies like iPhone iOS, Android, Java, Big Data Hadoop, QA & Agile Scrum Methodologies.

◦  Team of developers using AngularJS at the client projects. ◦  Focused on Interns and engineers looking to learn new exciting technologies.

• Track Record: ◦  We work with emerging technologies to create mobile applications, rich-client desktop

software, and large-scale systems (CRM, ERP). ◦  We create high quality solutions for hard problems, to help our customers thrive ◦  We can share this expertise with you developers, in the form of workshop style, hands-on

training classes.

Page 4: Big Data Hadoop Training Course

4 © 2014 RMS Software Tech (rmssoftwaretech.com)

Course : Prerequisites and Equipment

•  Prerequisites : ◦  Students should have experience with Database (DBMS) like Oracle,

Informix, Sybase. No prior experience of Big Data or NOSQL and Hadoop is required for the course.

• Equipment : ◦  Please use - Laptop (Windows, Linux, or Mac).

Page 5: Big Data Hadoop Training Course

5 © 2014 RMS Software Tech (rmssoftwaretech.com)

Training Agenda

Training Schedule & Agenda

Week 1 : Hadoop Overview

Week 2 : HDFS Deep Dive

Week 3 : MapReduce and Pig

Week 4 : Hive and HBase

Week 5 : Zookeeper, Oozie, Flume, Talend

Week 6 : Practice Questions, Q & A with Final Project We believe this curriculum covers the basics well, and positions students to use Hadoop effectively and efficiently. It provides a good overview on Hadoop and Big Data

Page 6: Big Data Hadoop Training Course

6 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 1 : Hadoop Overview

•  Brief History of Hadoop

•  RDBMS/SQL vs. Hadoop

•  Structured vs. Unstructured data

•  Introduction to Hadoop Ecosystem (HDFS, MapReduce, Pig, Hive, HBase)

•  HDFS Overview (NameNode vs. DataNode)

•  MapReduce overview (JobTracker vs. TaskTracker)

•  Hadoop XML files for configuration

•  Hadoop Ecosystem (Hive, Pig, Hbase, Zookeeper, Mahout, Oozie, Talend, Scoop, Flume)

•  Lab #1 Virtual Machine Setup

Page 7: Big Data Hadoop Training Course

7 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 2 : HDFS Deep Dive

•  NameNode Architecture

•  DataNode Architecture

•  Write Pipeline

•  Read Pipeline

•  HDFS Disk space quotas and number of file quotas

•  Quick Intro to Java API interface

•  Lab #2.

Page 8: Big Data Hadoop Training Course

8 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 3 : MapReduce and Pig

•  MapReduce Architecture

•  Combiner, Partitioner

•  JobTracker & TaskTracker

•  Job Scheduling

•  Distributed Cache

•  Counters

•  MapReduce configuration files

•  Simple MapReduce example : WordCount

•  Next Gen MapReduce : YARN.

•  Lab #3 : MapReduce

•  Lab #4 : Pig

Page 9: Big Data Hadoop Training Course

9 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 4: Hive and HBase

• Hive architecture.

• Hive vs. RDBMS.

• HiveQL and Hive. Shell

• Managing Tables

• Querying Data

• Data Types and Schemas

•  Introduction to UDF (User Defined Functions)

• HBase Architecture

• HBase vs. Cassandra

Page 10: Big Data Hadoop Training Course

10 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 4: HBase

• Bloom Filters and Block indexes

• Table Scans and Filters

•  Lab # Intro to HBase command line.

Page 11: Big Data Hadoop Training Course

11 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 5: Zookeeper, Oozie, Flume, Sqoop, Talend

• Flume overview

• Flume usage

• Sqoop overview

• Sqoop usage

• Hadoop workflow

•  Jobcontrol

• Oozie

• Talend

• Sqoop

Page 12: Big Data Hadoop Training Course

12 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 6: Project and Practice Questions

• Sample Project

• Practice Questions

• Q & A

Page 13: Big Data Hadoop Training Course

Foundation for tomorrow

@RMSSoftwareTech

[email protected]