duration : 6 months · scikit learn. module name segment overview of the course intro to data...

14
DATA SCIENCE MASTERS PROGRAM Boost your professional skills and shoot up your career to a new high with this professional certification course Duration : 6 Months 4.6 Rated by more than 2000 users! Eckovation ( Lifetime Content Access )

Upload: others

Post on 24-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

DATA SCIENCE MASTERS PROGRAMBoost your professional skills andshoot up your career to a new high with this professional certification course

Duration : 6 Months

4.6Rated by more

than 2000 users!

Eckovation

( Lifetime Content Access )

Page 2: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

MODULE NAME SEGMENT

Python and its applications

Installation and configuration

Working with IDLE (Integrated Development & Learning Environment)

“Hello World” program

Data types

Variables

Introduction to Python

String manipulation

String concatenation

Indexing

Slicing of strings

Typecasting and its applications

Escape character

User inputs

PYTHON

Data structures

Lists

Tuples

Di�erences between lists and tuples

Dictionaries

Set

Practical applications of data structures

Page 3: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

If-else loop

For loop

While loop

Problems based on control loops

Control loops

Functions

Inbuilt functions in Python

Function definition

Function calling

Problems based on functions

Creation of Python modules

Packages

Installation and usage of pip (package manager)

Importing modules

Object Oriented Programming

Modules and packages

Introduction to object oriented programming design paradigm

Classes and objects

Constructor function

Class variables

Class methods

Static methods

Practical implementation of object oriented programming

Page 4: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

Introduction to exceptions

Try and except block

‘Finally’ keyword

File handling

Exception handling

Python for Data Science

Networking

Numpy

Scipy

Pandas

Matplotlib

Scikit learn

Page 5: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

MODULE NAME SEGMENT

Overview of the course

Intro to Data Analysis

Intro to R - Basics

Introduction

Data Frames

Lists

Conditions

Loops

Functions

Vectors

Matrices

Factors

Apply functions

R ANALYTICS

Utility functions

Regular Expressions

Time and Date

Basic R Programming

Tidyverse

Types of visualizations

Data wrangling

Data visualization

Grouping and summarizing

Page 6: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

Importing data from flat files with utils

readr & data.table

Importing Excel data

Reproducible Excel work with XLConnectImporting Data in R (Part 1)

Importing data from databases

Importing data from the web

Importing data from statistical so�ware packages

Ticket Sales Data

MBTA Ridership Data

World Food Facts

School Attendance Data

Introduction to dplyr and tbls

Select and mutate

Filter and arrange

Summarize and the pipe operator

Group_by and working with databases

Cleaning Data

Case Study and Assignment 1

dplyr: Data Manipulation

Introduction and exploring raw data

Tidying data

Preparing data for analysis

Putting it all together

Page 7: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

Mutating joins

Filtering joins and set operations

Assembling data

Advanced joining

dplyr: Joining Data

Case study

qplot and wrap-up

Tweets across the United States

Shakespeare gets Sentimental

Analyzing TV News

Singing a Happy Song (or Sad?!)

Language of data

Study types and cautionary tales

Sampling strategies and experimental design

Case study

Exploring Categorical Data

Exploring Numerical Data

Numerical Summaries

Case Study

ggplot 2: Data Visualization

Case Study and Assignment 2: Sentiment Analysis

Data and R

Exploratory Data Analysis

Introduction

Data

Aesthetics

Geometries

Page 8: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

Data cleaning and summarizing with dplyr

Data visualization with ggplot2

Tidy modeling with broom

Joining and tidying

Case Study and Assignment 3: Exploratory Data Analysis

Authoring R Markdown Reports

Embedding Code

Compiling Reports

Configuring R Markdown (optional)

Reporting with R Markdown

Page 9: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

MODULE NAME SEGMENT

Multiple Linear Regression

Simple Linear Regression

Polynomial Regression

Logistic Regression

SVR

Regressions

Classification

Decision Tree

KNN

SVM

Kernel SVM

Naive Bayes

MACHINE LEARNING

Recommended System development

Natural Language Processing

Clustering

Decision Tree

Random Forest

Hierarchical

K means

Text Classification

Language Modeling and Sequence Tagging

Mini Assignments

Page 10: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

Introduction to Deep Learning and Neural Networks

Neural Networks

PCA/LDA

PCA

LDA

Kernel PCA

Tensorflow installation

Introduction to Tensorflow

Sequence to sequence tasks

Vector space modeling

Tensorflow basics

Artificial Neural Network

Convolution Neural Network

Page 11: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

MODULE NAME SEGMENT

Introduction to Big Data

Introduction to the Course

Need for Handling Big Data

Structure of Big Data

Storage Technique

Introduction to Big Data & Hadoop

Hadoop Ecosystem

Application of Big Data

Big Data - Impact on IT

Overview of Big data Solutions

HDFS

Architecture (HDFS)

BIG DATA ANALYTICS

Cloudera VM Installations

Hadoop Distributed File System

Hadoop Clusters

Hadoop Ecosystem

Cloudera VM Overview

Cloudera VM Installation

HDFS

HDFS Daemons

VM Player Installation

Single Node Cluster Installation and Setup

Multi Node Cluster Installation and Setup

Page 12: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

Writing Files to HDFS

Re-replicating Missing replicas

Channel and Sink

Practice

Map Reduce Framework

Flow

Flow Example

Practice

Practice: Mapper

Practice: Reducer

Combiner and Practitioner

Job Cycle

Failures

Advanced Map Reduce

HDFS Shell Command

Data Ingestion Technique: Flume

Data Ingestion Technique: Map Reduce

Practice

Introduction

Architecture

Aggression Flow

Checkpoints and Journals

Data Node Startup

Data Node Heartbeats

HDFS Shell Command

Page 13: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

Practice: Reducer

Combiner and Practitioner

Input and Output Format

YARN

Data Modelling

Custome Input Format

Apache HIVE

Apache HIVE - Deep Concepts

Customising Table Storage Formats

Case Study

Partition

HIVE Positioning and Bucketing

Bucketing

HIVE-Joints

User Defined Functions

HIVE Shell

PIG Running

Apache PIG

Features

Data Modelling

Diagnostic Tools

PIG-Joints

HIVE

PIG

Page 14: Duration : 6 Months · Scikit learn. MODULE NAME SEGMENT Overview of the course Intro to Data Analysis ... Big Data & Hadoop Hadoop Ecosystem Application of Big Data Big Data - Impact

Use Cases

Commands

NoSQL

HBase

Hadoop vs NoSQL

Case Study

Sqoop

Introducing Hadoop with Other Systems

Overview of Connectors

Case Study

Oozie

Zookeeper

YARN

Cloudera Manager

Overview of Spark, Impala, Ambari, HCatalog

Real life Case Scenarios & Discussion

Need for NoSQL Database

Data Integration in Hadoop

Data Administration in Hadoop

Recent Advancements