become a certified hadoop administrator data & had… · greycampus provides instructor led...
TRANSCRIPT
GreyCampus provides Instructor Led Classes on Hadoop Administration. The course is intended for System Administrators, DBA’s, Linux admins and Software engineers responsible for managing and maintaining Hadoop clusters. This is designed to provide knowledge to become a successful Hadoop Administrator. This course covers Hadoop architecture and its components, Managing, Maintaining, Monitoring and Troubleshooting a Hadoop Cluster. The focus of this course is to give the participants hands on experience, so there would be multiple assignments, quizzes and a project.
COURSE OBJECTIVESUpon successful completion of this course, participants should be able to:
f Describe the fundamental concepts of using Big Data
f Identify where Hadoop fits into Big Data
f Hadoop Architecture and HDFS
f Gain insight on YARN and MapReduce
f Installing and Configuring Apache Ecosystem Tools
f Configuration and Performance Tuning
f Learn about Hadoop Cluster
f Manage, Maintain, Monitor and Troubleshoot a Hadoop Cluster
COURSE INCLUSIONONE YEAR ACCESSParticipants will have access to GreyCampus learn platform for a period of one year, This includes access to the Course PPTs, Reading material, Quizzes, Assignments, Project, and Class videos
DEDICATED SUPPORTParticipants will get the Technical and Nontechnical support through email within 1 business day. Participants can send their queries at [email protected] or they can call the toll free no: 1800 102 0723.
FACT SHEET
HADOOP ADMINISTRATOR TRAINING & CERTIFICATION
BECOME A CERTIFIED HADOOP ADMINISTRATOR
© www.greycampus.com
VIRTUAL MACHINEParticipants will be provided instructions to set up their Virtual Machine before the course starts.
HANDS ON PROJECTAt the end of the course participants submit a project which covers all the key aspects of the course. This allows them to implement techniques they learnt in the course.
COURSE CERTIFICATIONUpon completing 30 hrs of training participants will be provided a Project which they have to submit within 15 days. A successful completion of the project would make the participants eligible for the GreyCampus certificate.
30 PDUS30 PDUs will be sent to PMI credential holders within 2 business day upon request.
SYSTEM REQUIRMENTS
• Min. 2 MBPS Internet Connectivity
• Multimedia PC with Speakers/Headphones and Microphone
• Windows 7 (or newer) / Mac OS 10.7 (Lion) or newer
• Power backup (preferred) through the duration of the Live Online classes
• Power backup: for both Internet Router and PC
2
© www.greycampus.com
COURSE AGENDA 3
© www.greycampus.com
MODULE 1: UNDERSTANDING BIG DATA AND HADOOP
• Big Data
• Limitations and Solutions of existing Data
Analytics Architecture
• Hadoop
• Hadoop Features
• Hadoop Ecosystem
• Hadoop 2.x core components
• Hadoop Storage: HDFS
• Hadoop Processing: MapReduce Framework
• Anatomy of File Write and Read
• Rack Awareness.
MODULE 2: HADOOP ARCHITECTURE AND HDFS
MODULE 3: YARN AND MAPREDUCE
• Hadoop 2.x Cluster Architecture - Federation and
High Availability
• A Typical Production Hadoop Cluster
• Hadoop Cluster Modes
• Common Hadoop Shell Commands
• Installation of Hadoop on Single Node/Multi
Cluster env
• Hadoop 2.x Configuration Files, Password-Less SSH
• MapReduce Job Execution
• Data Loading Techniques: Hadoop Copy Commands
• FLUME
• SQOOP
• Node roles
• Data Processing
• Network configuration
MODULE 4: LOAD DATA AND RUN APPLICATIONS
• Hive
• Pig
• Mahout
• HBase
• Hcatalog/Hive
• Hbase Administration
• Data Loading Techniques: Hadoop Copy Commands
• FLUME
• SQOOP
• What Is MapReduce?
• Basic MapReduce Concepts
• YARN Cluster Architecture
• Resource Allocation
• Failure Recovery
• Using the YARN Web UI
• MapReduce Version 1
MODULE 5: INSTALLING AND CONFIGURING APACHE ECOSYSTEM TOOLS
MODULE 6: ADVANCED CLUSTER CONFIGURATION
MODULE 7: HADOOP SECURITY
MODULE 8: MANAGING AND SCHEDULING JOBS
MODULE 9: CONFIGURATION AND PERFORMANCE TUNING
• OS
• JVM and Hadoop configuration parameters tuning
MODULE 10: INSTALLING AND CONFIGURING APACHE ECOSYSTEM TOOLS
• Checking HDFS Status
• Copying Data between Clusters
• Adding and Removing Cluster Nodes
• Rebalancing the Cluster
• Cluster Upgrading
• General System Monitoring
• Monitoring Hadoop Clusters
• Common Troubleshooting Hadoop Clusters
• Common Misconfigurations
• Checking Logs and Log File Locations
• Managing Running Jobs
• Scheduling Hadoop Jobs
• Configuring the FairSchedulers
• Why Hadoop Security Is Important
• Hadoop’s Security System Concepts
• What Kerberos Is and How it Works
• Securing a Hadoop Cluster with Kerberos
• Advanced Configuration Parameters
• Configuring Hadoop Ports
• Explicitly Including and Excluding Hosts
• Configuring HDFS for Rack Awareness
• Configuring HDFS High Availability
© www.greycampus.com
TRAINED OVER 15,000PROFESSIONALS
REACH ACROSS50+ COUNTRIES
EXAM PASS RATE OFOVER 97 %
COURSES ACCREDITED BY LEADING GLOBAL BODIES
ABOUT GREYCAMPUS
GreyCampus is a leading provider of on-demand training that address the unique learning needs of professionals, delivered as online self-learning, live online training or in-person classroom training. Our aim is to provide quality training enabling professionals to achieve their certification and career enhancement goals. We offer training for certifications in areas of Big Data & Hadoop, Project Management, IT Service Management, Quality Management, Python Programming, Agile Training Coaching & Certification and Workplace Tools.
DISCLAIMER
“PMI®”, “PMBOK®”, “PMP®” “CAPM®” and “PMI-ACP®” are registered marks of the Project Management Institute, Inc.
The Swirl logo™ is a trade mark of AXELOS Limited.ITIL® is a registered trade mark of AXELOS Limited.PRINCE2® is a Registered Trade Mark of AXELOS Limited.IASSC® is a registered mark of International Association for Six Sigma Certification.
ACCREDITATIONS & ASSOCIATIONS
Provider ID : 3871