big data hadoop training course

Post on 27-Jan-2015

109 Views

Category:

Education

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

Big Data Hadoop Training Course Agenda Please contact info@rmssoftwaretech.com

TRANSCRIPT

@RMSSoftwareTech training@rmssoftwaretech.com

http://www.rmssoftwaretech.com

Apache Hadoop (Big Data)

Big Data Training

2 © 2014 RMS Software Tech (rmssoftwaretech.com)

Logo’s & Trademarks

• Note: Any logos used in this presentation are owned by their respective companies and are only used in this slide deck for educational purposes. No other companies are responsible for or provide attribution for any of the material in these slides.

• This slide deck is released under a Creative Commons License and can be reused in your own presentations, however please research the specific meanings of these symbols:

• You may be able to use the slide deck for purposes beyond the CC license if you email me with the special request.

• All third party trademark rights acknowledged

3 © 2014 RMS Software Tech (rmssoftwaretech.com)

Profile : RMS Software Technologies

•  About us : ◦  Leading provider of Software solutions, System integration Services and

Professional Training ◦  Based in San Jose, CA & Mumbai, India started in 2012.

• Professional Courses : ◦  Expert Training team of 10 people who provide training in various leading

technologies like iPhone iOS, Android, Java, Big Data Hadoop, QA & Agile Scrum Methodologies.

◦  Team of developers using AngularJS at the client projects. ◦  Focused on Interns and engineers looking to learn new exciting technologies.

• Track Record: ◦  We work with emerging technologies to create mobile applications, rich-client desktop

software, and large-scale systems (CRM, ERP). ◦  We create high quality solutions for hard problems, to help our customers thrive ◦  We can share this expertise with you developers, in the form of workshop style, hands-on

training classes.

4 © 2014 RMS Software Tech (rmssoftwaretech.com)

Course : Prerequisites and Equipment

•  Prerequisites : ◦  Students should have experience with Database (DBMS) like Oracle,

Informix, Sybase. No prior experience of Big Data or NOSQL and Hadoop is required for the course.

• Equipment : ◦  Please use - Laptop (Windows, Linux, or Mac).

5 © 2014 RMS Software Tech (rmssoftwaretech.com)

Training Agenda

Training Schedule & Agenda

Week 1 : Hadoop Overview

Week 2 : HDFS Deep Dive

Week 3 : MapReduce and Pig

Week 4 : Hive and HBase

Week 5 : Zookeeper, Oozie, Flume, Talend

Week 6 : Practice Questions, Q & A with Final Project We believe this curriculum covers the basics well, and positions students to use Hadoop effectively and efficiently. It provides a good overview on Hadoop and Big Data

6 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 1 : Hadoop Overview

•  Brief History of Hadoop

•  RDBMS/SQL vs. Hadoop

•  Structured vs. Unstructured data

•  Introduction to Hadoop Ecosystem (HDFS, MapReduce, Pig, Hive, HBase)

•  HDFS Overview (NameNode vs. DataNode)

•  MapReduce overview (JobTracker vs. TaskTracker)

•  Hadoop XML files for configuration

•  Hadoop Ecosystem (Hive, Pig, Hbase, Zookeeper, Mahout, Oozie, Talend, Scoop, Flume)

•  Lab #1 Virtual Machine Setup

7 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 2 : HDFS Deep Dive

•  NameNode Architecture

•  DataNode Architecture

•  Write Pipeline

•  Read Pipeline

•  HDFS Disk space quotas and number of file quotas

•  Quick Intro to Java API interface

•  Lab #2.

8 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 3 : MapReduce and Pig

•  MapReduce Architecture

•  Combiner, Partitioner

•  JobTracker & TaskTracker

•  Job Scheduling

•  Distributed Cache

•  Counters

•  MapReduce configuration files

•  Simple MapReduce example : WordCount

•  Next Gen MapReduce : YARN.

•  Lab #3 : MapReduce

•  Lab #4 : Pig

9 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 4: Hive and HBase

• Hive architecture.

• Hive vs. RDBMS.

• HiveQL and Hive. Shell

• Managing Tables

• Querying Data

• Data Types and Schemas

•  Introduction to UDF (User Defined Functions)

• HBase Architecture

• HBase vs. Cassandra

10 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 4: HBase

• Bloom Filters and Block indexes

• Table Scans and Filters

•  Lab # Intro to HBase command line.

11 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 5: Zookeeper, Oozie, Flume, Sqoop, Talend

• Flume overview

• Flume usage

• Sqoop overview

• Sqoop usage

• Hadoop workflow

•  Jobcontrol

• Oozie

• Talend

• Sqoop

12 © 2014 RMS Software Tech (rmssoftwaretech.com)

Week 6: Project and Practice Questions

• Sample Project

• Practice Questions

• Q & A

Foundation for tomorrow

@RMSSoftwareTech

training@rmssoftwaretech.com

top related