azure ml studio

71
Azure ML Studio Overview for Data Engineers & Data Scientists

Upload: lynguyet

Post on 04-Feb-2017

226 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Azure ML Studio

Azure ML StudioOverview for Data Engineers

& Data Scientists

Page 2: Azure ML Studio

Rakesh Soni, Big Data Practice Director

Randi R. Ludwig, Ph.D., Data Scientist

Daniel Lai, Data Scientist

Page 3: Azure ML Studio

Intersys Company Summary

Privately held IT services firm 150+ consultants spanning the full IT space Leverage a local, national and/or global model as

appropriate for each customer and engagement

3

Retail and Hospitality Financial Services Healthcare High Tech

Manufacturing Media and Advertising

Overview

Key Industries Core Values Be Accountable Bring Excellence Be Authentic Be in Service to

Others

NYC

AustinPhoenix

Page 4: Azure ML Studio

Core Practice Capabilities

4

Big Data & Analytics

Big Data

Enterprise Search

Business Intelligence

Analytics & Data Science

Information Management

Application ServicesApplication

Development

Modern Web

Mobility

Cloud

DevOps / Agile

Technology Staffing

Infrastructure

Project Management

Packaged Solutions

Quality Assurance

Cloud

AssessmentProject & Program Management Quality Assurance Strategy &

Roadmap

Page 5: Azure ML Studio

Agenda

5

Intersys Overview

Machine Learning

Azure Machine Learning Studio

Main Features of Azure Machine Learning Studio

Demo 1 – Predicting Income Category

Demo 2 – Predicting Patient Readmission

Q&A

Page 6: Azure ML Studio

Machine Learning – What Is It?

Page 7: Azure ML Studio

Machine Learning All Around Us

Strategic Game PlayReinforced learning to take actions with the highest reward

Autonomous TechnologyTraining computers to be make intelligent and intuitive decisions

7

Credit: Google Deep Mind, https://www.youtube.com/watch?v=vFr3K2DORc8

Credit: BBC News, http://www.bbc.com/news/technology-37117831

Page 8: Azure ML Studio

Machine Learning in Real Life

8

Optimize Business Decisions Real time insights into customer behavior

Credit: UiBS Microsoft Partnerhttp://www.uibs.net/uibscloud-cloud-circle-partner

Page 9: Azure ML Studio

Machine Learning – How Does it Work?

9

• Data: examples used to train and validate model

• Model: the system that makes predictions or classifications

• Parameters: the signals or factors used by the model to form its decisions

• Learner: the system that adjusts the parameters — and in turn the model —by looking at differences in

predictions versus actual outcome.

Credit: Google Research, "Machine Learning 101"Reported at: https://martechtoday.com/how-machine-learning-works-150366

Page 10: Azure ML Studio

Azure Machine Learning Studio

Page 11: Azure ML Studio

Azure ML Studio – What is it?

Collaborative, drag-and-drop, fully managed cloud platform Build, test, and deploy

predictive analytics Publish models as web

services to be consumed by custom apps or BI tools.

11

Page 12: Azure ML Studio

Azure ML Studio Public Gallery

12

Add picture of gallery

Don’t want to build a machine learning model from scratch? Avoid reinventing the wheel by browsing the public gallery for existing models that meet your needs.

Page 13: Azure ML Studio

Azure ML Studio Workspace

13

To have full control, open your own Experiment.For any part of the process you can use built in features. Find these via the search bar or browse the categories shown.Once you’ve found the module you need, just drag and drop onto the workspace.

Page 14: Azure ML Studio

Data Import And Export Options

14

Bring in data in many forms. Export wherever you need it.

Page 15: Azure ML Studio

Easy To Create And Visualize Workflow

15

Easy to follow workflow for creating and sharing.As you’re creating your model, you can see dependencies in your process.When sharing your results, the workflow tells your story.

Page 16: Azure ML Studio

Dataset Statistics At Every Step

16

Includes quick column statistics on your whole datasetOnce your data is loaded into Azure ML, you can explore overall distributions of each feature.You can also quickly identify columns with many missing values.

Page 17: Azure ML Studio

Algorithm Selection

17

Accuracy Training time Linearity ParametersLogistic Regression ● ● 5Support Vector Machine* ○ ● 5Neural Network ● 9Boosted decision tree† ● ○ 6Decision forest ● ○ 6

● Good performance

○ Moderate performance

Classification

Accuracy Training time Linearity ParametersLogistic Regression ● ● 4Bayesian Linear Regression ○ ● 2Neural Network ● 9Boosted decision tree† ● ○ 5Decision forest ● ○ 6

Regression

Accuracy Training time Linearity ParametersPrincipal Component Analysis ○ ● 3K-means Clustering ○ ● 4

Anomaly Detection

* Good for large feature sets

† Large memory footprint

Credit: Brandon Rohrer, azure.microsoft.com

Page 18: Azure ML Studio

Publish Output As A Web Service and BI Visualization

18

Page 19: Azure ML Studio

Azure Machine Learning Demo

Page 20: Azure ML Studio

Demo 1 - Predicting Income Category

20

Azure ML Tour

Page 21: Azure ML Studio

Demo 2 – Predicting Patient Readmission

21

Page 22: Azure ML Studio

Azure Machine Learning Studio Summary

Page 23: Azure ML Studio

Azure ML – Summary

Interactive & visual workspace Various data sources supported: SQL Server, HIVE tables, CSV

file etc. Dataset statistics and easy exploration Data cleaning & transformation Many modeling algorithms included out of the box SQL/R/Python code can be included in workflow Many built-in ways to evaluate and compare models using

standard performance metrics

23

Page 24: Azure ML Studio

Resources

Microsoft's provided documentation is quite thorough and helpful:https://azure.microsoft.com/en-us/documentation/services/machine-learning/

24

Page 25: Azure ML Studio

Thank you. Any questions?

25

Page 26: Azure ML Studio

Q&APlease submit your questions into the chat field.

Page 27: Azure ML Studio

Demo Resource Slides

Page 28: Azure ML Studio

Demo 1: This is model that predicts whether a person earns > $50k.

28

Here we show how to build a model from scratch, including training the model, predicting values for the test set, and evaluating results.

Page 29: Azure ML Studio

Input your data

29

Page 30: Azure ML Studio

Summary statistics

30

Page 31: Azure ML Studio

How to Clean Missing Data

31

Page 32: Azure ML Studio

How to Limit the Number of Features in Your Model

32

Page 33: Azure ML Studio

Splitting into Training/Test Sets

33

Page 34: Azure ML Studio

Choosing a Model

34

Page 35: Azure ML Studio

Training the Model

35

Page 36: Azure ML Studio

Which Features are Most Predictive?

36

Page 37: Azure ML Studio

Which Features are Most Predictive?

37

Page 38: Azure ML Studio

Predicting Values for Test Data

38

Page 39: Azure ML Studio

Check Predictions (Scored Labels)

39

Page 40: Azure ML Studio

Evaluate Model

40

Page 41: Azure ML Studio

Check Model Accuracy, Etc.

41

Page 42: Azure ML Studio

End of Demo 1

42

Page 43: Azure ML Studio

Demo 2: This model predicts whether a patient will be readmitted to a hospital for further treatment.

43

This model considers a variety of strong machine learning algorithms. It then tunes the strongest model to be more efficient, evaluation of prediction results, and implements custom code in from R, Python, and SQL for further visualization and examination of data.

Page 44: Azure ML Studio

Explore the Dataset

44

Page 45: Azure ML Studio

Impute Missing Values

45

Page 46: Azure ML Studio

Compare Models: Decision Jungle

46

Page 47: Azure ML Studio

Compare Models: Boosted Decision Tree

47

Page 48: Azure ML Studio

Compare Models: Logistic Regression

48

Page 49: Azure ML Studio

Compare Models: Neural Network

49

Page 50: Azure ML Studio

Cross Validation Results: Variation in Accuracy for All Algorithms

50

Page 51: Azure ML Studio

ROC Chart comparison for models

51

Page 52: Azure ML Studio

Move Forward with Most Accurate Model

52

Page 53: Azure ML Studio

How to Cross Validate a Tuned Model

53

Page 54: Azure ML Studio

Assign Folds

54

Page 55: Azure ML Studio

Set Model Parameter Ranges

55

Page 56: Azure ML Studio

Tune Model Parameters

56

Page 57: Azure ML Studio

Use Model to Predict Values for Test Set

57

Page 58: Azure ML Studio

Visualize Predicted Values

58

Page 59: Azure ML Studio

Permute Features to Find Most Important

59

Page 60: Azure ML Studio

Find Features Most Influential to the Model

60

Page 61: Azure ML Studio

Can Export Results for Multiple Uses

61

Page 62: Azure ML Studio

Export CSV for debugging in notebooks

62

Page 63: Azure ML Studio

Sample R Notebook

63

Page 64: Azure ML Studio

SQL scripts

64

Page 65: Azure ML Studio

65

Page 66: Azure ML Studio

Find optimal cutoff in R script

66

Page 67: Azure ML Studio

R result

67

Page 68: Azure ML Studio

Python script for further visualizations

68

Page 69: Azure ML Studio

Python Visualization

69

Page 70: Azure ML Studio

Evaluate Final Model

70

Page 71: Azure ML Studio

Results

71