data science & machine learning using python

8
Data Science & Machine Learning Using Python l Introduction To Python Why Python Application areas of python Python implementations Cpython Jython Ironpython Pypy Python versions Installing python Python interpreter architecture Python byte code compiler Python virtual machine(pvm) l l l l l Python l l l l l l l Writing and Executing First Python Program Using interactive mode Using script mode General text editor and command window Idle editor and idle shell Understanding print() function How to compile python program explicitly l l l l l l Python Language Fundamentals Character set Keywords Comments Variables Literals Operators Reading input from console Parsing string to int, float l l l l l l l l Python Conditional Statements If statement If else statement If elif statement If elif else statement Nested if statement l l l l l Looping Statements While loop For loop Nested loops Pass, break and continue keywords l l l l Standard Data Types Int, float, complex, bool, nonetype Str, list, tuple, range Dict, set, frozenset l l l Duration: 4.5 Months

Upload: others

Post on 01-Oct-2021

14 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: data science & machine Learning Using Python

Data Science & Machine LearningUsing Python

l�

Introduction To Python

Why PythonApplication areas of pythonPython implementations Cpython Jython Ironpython PypyPython versionsInstalling pythonPython interpreter architecture Python byte code compiler Python virtual machine(pvm)

l�

l�

l�

l�

l�

Python

l�

l�

l�

l�

l�

l�

l�

Writing and Executing First Python Program

Using interactive modeUsing script mode General text editor and command window Idle editor and idle shell Understanding print() function How to compile python program explicitly

l�

l�

l�

l�

l�

l�

Python Language Fundamentals

Character setKeywordsCommentsVariablesLiteralsOperatorsReading input from consoleParsing string to int, float

l�

l�

l�

l�

l�

l�

l�

l�

Python Conditional Statements

If statementIf else statementIf elif statementIf elif else statementNested if statement

l�

l�

l�

l�

l�

Looping Statements

While loopFor loopNested loopsPass, break and continue keywords

l�

l�

l�

l�

Standard Data Types

Int, float, complex, bool, nonetypeStr, list, tuple, rangeDict, set, frozenset

l�

l�

l�

Duration: 4.5 Months

Page 2: data science & machine Learning Using Python

l�

String Handling

What is stringString representationsUnicode stringString functions, methodsString indexing and slicingString formatting

l�

l�

l�

l�

l�

l�

Python List

Creating and accessing listsIndexing and slicing listsList methodsNested listsList comprehension

l�

l�

l�

l�

l�

Python Tuple

Creating tupleAccessing tupleImmutability of tuple

l�

l�

l�

Python Set

How to create a setIteration over setsPython set methodsPython frozenset

l�

l�

l�

l�

Python Dictionary

Creating a dictionaryDictionary methodsAccessing values from dictionaryUpdating dictionaryIterating dictionaryDictionary comprehension

l�

l�

l�

l�

l�

l�

Python Functions

Defining a functionCalling a functionTypes of functionsFunction arguments Positional arguments, keyword arguments Default arguments, non-default arguments Arbitrary arguments, keyword arbitrary argumentsFunction return statementNested functionFunction as argumentFunction as return statementDecorator functionClosureMap(), filter(), reduce(), any() functionsAnonymous or lambda function

l�

l�

l�l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

Modules & Packages

Why modulesScript v/s moduleImporting moduleStandard v/s third party modulesWhy packagesUnderstanding pip utility

l�

l�

l�

l�

l�

l�

File I/O

Introduction to file handlingFile modesFunctions and methods related to file handlingUnderstanding with block

l�

l�

l�

Page 3: data science & machine Learning Using Python

l�

Object Oriented Programming

Procedural v/s object oriented programmingOOP principlesDefining a class & object creationObject attributesInheritanceEncapsulationPolymorphism

l�

l�

l�

l�

l�

l�

l�

Exception Handling

Difference between syntax errors and exceptionsKeywords used in exception handling try, except, finally, raise, assertTypes of except blocks

l�

l�

l�

l�

Regular Expressions(Regex)

Need of regular expressionsRe moduleFunctions /methods related to regexMeta characters & special sequences

l�

l�

l�

GUI Programming

Introduction to tkinter programmingTkinter widgets Tk, label, Entry, Textbox, Button Frame, messagebox, filedialog etcLayout managersEvent handlingDisplaying image

l�

l�

l�

l�

l�

l�

l�

Multi-Threading Programming

Multi-processing v/s Multi-threadingNeed of threadsCreating child threadsFunctions /methods related to threadsThread synchronization and locking

l�

l�

l�

l�

l�

Introduction to Database

Database Concepts What is Database Package? Understanding Data StorageRelational Database (RDBMS) Concept

l�

l�

SQL

l�

l�SQL basicsDML, DDL & DQLDDL: create, alter, dropSQL constraints: Not null, unique, Primary & foreign key, composite key Check, defaultDML: insert, update, delete and mergeDQL : selectSelect distinctSQL whereSQL operatorsSQL likeSQL order bySQL aliasesSQL viewsSQL joins Inner join Left (outer) join Right (outer) join Full (outer) join

l�l�

SQL (Structured Query Language)

l�

l�

l�

l�

l�

l�l�l�l�l�l�l�l�l�

l�

l�

l�

l�

l�

Page 4: data science & machine Learning Using Python

l�

l�Mysql functions String functions Char_length Concat Lower Reverse Upper Numeric functions Max, min, sum Avg, count, abs Date functions Curdate Curtime Now

l�

l�

l�

l�

l�

l�

l�

l�

Introduction to Statistics

Sample or populationMeasures of central tendency Arithmetic mean Harmonic mean Geometric mean Mode Quartile First quartile Second quartile(median) Third quartile Standard deviation

Statistics, Probability & Analytics:

l�

l�

l�

l�

l�

l�

l�l�

l�

l�

l�

l�

l�

l�

l�

l�

Probability Distributions

Introduction to probabilityConditional probabilityNormal distributionUniform distributionExponential distributionRight & left skewed distributionRandom distributionCentral limit theoreml�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

Hypothesis Testing

Normality testMean test T-test Z-test ANOVA testChi square testCorrelation and covariance

l�

l�

l�

l�

l�

l�

l�

l�

l�

Numpy Package

Difference between list and numpy arrayVector and matrix operationsArray indexing and slicing

l�

l�

l�l�

l�l�

l�

Introduction to pandas

Labeled and structured dataSeries and dataframe objects

l�

l�

Pandas Package

Page 5: data science & machine Learning Using Python

l�

How to load datasets

From excelFrom csvFrom html table

l�

l�

l�

Accessing data from Data Frame

at & iatloc & ilochead() & tail()

l�

l�

l�

Exploratory Data Analysis (EDA)

describe()groupby()crosstab()boolean slicing / query()

l�

l�

l�

l�

Data Manipulation & Cleaning

Map(), apply()Combining data framesAdding/removing rows & columnsSorting dataHandling missing valuesHandling duplicacyHandling data error

l�

l�

l�

l�

l�

l�

Handling Date and Time

Data Visualization using matplotlib and seaborn packages

l�Scatter plot, lineplot, bar plotHistogram, pie chart, Jointplot, pairplot, heatmapOutlier detection using boxplot

l�

l�

l�

l�

Introduction To Machine Learning

Traditional v/s Machine Learning ProgrammingReal life examples based on MLSteps of ML ProgrammingData Preprocessing revisedTerminology related to ML

l�

l�

Machine Learning:

l�

l�

l�

Supervised Learning

Classification Regressionl�

l�

KNN Classification

Math behind KNNKNN implementationUnderstanding hyper parameters

l�

l�

l�

Performance metrics

Math behind KNNKNN implementationUnderstanding hyper parameters

l�

l�

l�

Unsupervised Learning

clustering

Page 6: data science & machine Learning Using Python

l�

Regression

Math behind regressionSimple linear regressionMultiple linear regressionPolynomial regressionBoston price predictionCost or loss functions Mean absolute error Mean squared error Root mean squared error Least square errorRegularization

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

Logistic Regression for classification

Theory of logistic regressionBinary and multiclass classificationImplementing titanic datasetImplementing iris datasetSigmoid and softmax functions

l�

l�

l�

l�

l�

Support Vector Machines

Theory of SVMSVM Implementationkernel, gamma, alpha

l�

l�

l�

Decision Tree Classification

Theory of decision treeNode splittingImplementation with iris datasetVisualizing tree

l�

l�

l�

l�

Ensemble Learning

Random forestBagging and boostingVoting classifier

l�

l�

l�

Model Selection Techniques

Cross validationGrid and random search for hyper parameter tuningl�

l�

Recommendation System

Content based techniqueCollaborative filtering techniqueEvaluating similarity based on correlationClassification-based recommendations

l�

l�

l�

l�

Clustering

K-means clusteringHierarchical clusteringElbow techniqueSilhouette coefficientDendogram

l�

l�

l�

l�

l�

Text Analysis

Install nltkTokenize wordsTokenizing sentencesStop words customizationStemming and lemmatizationFeature extractionSentiment analysisCount vectorizerTfidfvectorizerNaive bayes algorithms

l�

l�

l�

l�

l�

l�

l�

l�

l�

Page 7: data science & machine Learning Using Python

l�

Dimensionality Reduction

Principal component analysis(pca)

l�

Open CV

Reading imagesUnderstanding gray scale imageResizing imageUnderstanding haar classifiersFace, eyes classificationHow to use webcam in open cvBuilding image data setCapturing videoFace classification in videoCreating model for gender prediction

l�

l�

l�

l�

l�

l�

l�

l�

l�

Tableau - Home

Tableau - overviewTableau - environment setupTableau - get startedTableau - navigationTableau - design flowTableau - file typesTableau - data typesTableau - show meTableau - data terminology

l�

l�

Tableau

l�

l�

l�

l�

l�

l�

l�

l�

Tableau - Data Sources

Tableau - custom data viewTableau - data sourcesTableau - extracting dataTableau - fields operationsTableau - editing metadataTableau - data joiningTableau - data blending

l�

l�

l�

l�

l�

l�

l�

Tableau – Work Sheet

Tableau - add worksheetsTableau - rename worksheetTableau - save & delete worksheetTableau - reorder worksheetTableau - paged workbook

l�

l�

l�

l�

l�

Tableau – Calculation

Tableau - operatorsTableau - functionsTableau - numeric calculationsTableau - string calculationsTableau - date calculationsTableau - table calculationsTableau - lod expressions

l�

l�

l�

l�

l�

l�

l�

Tableau – Sorting & Filter

Tableau - basic sortingTableau - basic filtersTableau - quick filtersTableau - context filtersTableau - condition filtersTableau - top filtersTableau - filter operations

l�

l�

l�

l�

l�

l�

Page 8: data science & machine Learning Using Python

l�

Tableau - Charts

Tableau - bar chartTableau - line chartTableau - pie chartTableau - crosstabTableau - scatter plotTableau - bubble chartTableau - bullet graphTableau - box plotTableau - tree mapTableau - bump chartTableau - gantt chartTableau - histogramTableau - motion chartsTableau - waterfall chartsTableau - dashboard

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

l�

Projects

One project using python & sqlOne project using python & mlOne dashboard using tableau

l�

l�

l�

Partners :

PITAMPURA (DELHI)NOIDAA-43 & A-52, Sector-16,

GHAZIABAD1, Anand Industrial Estate, Near ITS College, Mohan Nagar, Ghaziabad (U.P.)

GURGAON1808/2, 2nd floor old DLF, Near Honda Showroom, Sec.-14, Gurgaon (Haryana)

SOUTH EXTENSION

www.facebook.com/ducateducation

Java

Plot No. 366, 2nd Floor, Kohat Enclave, Pitampura, ( Near- Kohat Metro Station)Above Allahabad Bank, New Delhi- 110034.

Noida - 201301, (U.P.) INDIA 70-70-90-50-90 +91 99-9999-3213 70-70-90-50-90 70-70-90-50-90

70-70-90-50-90

70-70-90-50-90

D-27, South Extension-1New Delhi-110049

+91 98-1161-2707

(DELHI)