azure ml studio

Post on 04-Feb-2017

226 Views

Category:

Documents

3 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Azure ML StudioOverview for Data Engineers

& Data Scientists

Rakesh Soni, Big Data Practice Director

Randi R. Ludwig, Ph.D., Data Scientist

Daniel Lai, Data Scientist

Intersys Company Summary

Privately held IT services firm 150+ consultants spanning the full IT space Leverage a local, national and/or global model as

appropriate for each customer and engagement

3

Retail and Hospitality Financial Services Healthcare High Tech

Manufacturing Media and Advertising

Overview

Key Industries Core Values Be Accountable Bring Excellence Be Authentic Be in Service to

Others

NYC

AustinPhoenix

Core Practice Capabilities

4

Big Data & Analytics

Big Data

Enterprise Search

Business Intelligence

Analytics & Data Science

Information Management

Application ServicesApplication

Development

Modern Web

Mobility

Cloud

DevOps / Agile

Technology Staffing

Infrastructure

Project Management

Packaged Solutions

Quality Assurance

Cloud

AssessmentProject & Program Management Quality Assurance Strategy &

Roadmap

Agenda

5

Intersys Overview

Machine Learning

Azure Machine Learning Studio

Main Features of Azure Machine Learning Studio

Demo 1 – Predicting Income Category

Demo 2 – Predicting Patient Readmission

Q&A

Machine Learning – What Is It?

Machine Learning All Around Us

Strategic Game PlayReinforced learning to take actions with the highest reward

Autonomous TechnologyTraining computers to be make intelligent and intuitive decisions

7

Credit: Google Deep Mind, https://www.youtube.com/watch?v=vFr3K2DORc8

Credit: BBC News, http://www.bbc.com/news/technology-37117831

Machine Learning in Real Life

8

Optimize Business Decisions Real time insights into customer behavior

Credit: UiBS Microsoft Partnerhttp://www.uibs.net/uibscloud-cloud-circle-partner

Machine Learning – How Does it Work?

9

• Data: examples used to train and validate model

• Model: the system that makes predictions or classifications

• Parameters: the signals or factors used by the model to form its decisions

• Learner: the system that adjusts the parameters — and in turn the model —by looking at differences in

predictions versus actual outcome.

Credit: Google Research, "Machine Learning 101"Reported at: https://martechtoday.com/how-machine-learning-works-150366

Azure Machine Learning Studio

Azure ML Studio – What is it?

Collaborative, drag-and-drop, fully managed cloud platform Build, test, and deploy

predictive analytics Publish models as web

services to be consumed by custom apps or BI tools.

11

Azure ML Studio Public Gallery

12

Add picture of gallery

Don’t want to build a machine learning model from scratch? Avoid reinventing the wheel by browsing the public gallery for existing models that meet your needs.

Azure ML Studio Workspace

13

To have full control, open your own Experiment.For any part of the process you can use built in features. Find these via the search bar or browse the categories shown.Once you’ve found the module you need, just drag and drop onto the workspace.

Data Import And Export Options

14

Bring in data in many forms. Export wherever you need it.

Easy To Create And Visualize Workflow

15

Easy to follow workflow for creating and sharing.As you’re creating your model, you can see dependencies in your process.When sharing your results, the workflow tells your story.

Dataset Statistics At Every Step

16

Includes quick column statistics on your whole datasetOnce your data is loaded into Azure ML, you can explore overall distributions of each feature.You can also quickly identify columns with many missing values.

Algorithm Selection

17

Accuracy Training time Linearity ParametersLogistic Regression ● ● 5Support Vector Machine* ○ ● 5Neural Network ● 9Boosted decision tree† ● ○ 6Decision forest ● ○ 6

● Good performance

○ Moderate performance

Classification

Accuracy Training time Linearity ParametersLogistic Regression ● ● 4Bayesian Linear Regression ○ ● 2Neural Network ● 9Boosted decision tree† ● ○ 5Decision forest ● ○ 6

Regression

Accuracy Training time Linearity ParametersPrincipal Component Analysis ○ ● 3K-means Clustering ○ ● 4

Anomaly Detection

* Good for large feature sets

† Large memory footprint

Credit: Brandon Rohrer, azure.microsoft.com

Publish Output As A Web Service and BI Visualization

18

Azure Machine Learning Demo

Demo 1 - Predicting Income Category

20

Azure ML Tour

Demo 2 – Predicting Patient Readmission

21

Azure Machine Learning Studio Summary

Azure ML – Summary

Interactive & visual workspace Various data sources supported: SQL Server, HIVE tables, CSV

file etc. Dataset statistics and easy exploration Data cleaning & transformation Many modeling algorithms included out of the box SQL/R/Python code can be included in workflow Many built-in ways to evaluate and compare models using

standard performance metrics

23

Resources

Microsoft's provided documentation is quite thorough and helpful:https://azure.microsoft.com/en-us/documentation/services/machine-learning/

24

Thank you. Any questions?

25

Q&APlease submit your questions into the chat field.

Demo Resource Slides

Demo 1: This is model that predicts whether a person earns > $50k.

28

Here we show how to build a model from scratch, including training the model, predicting values for the test set, and evaluating results.

Input your data

29

Summary statistics

30

How to Clean Missing Data

31

How to Limit the Number of Features in Your Model

32

Splitting into Training/Test Sets

33

Choosing a Model

34

Training the Model

35

Which Features are Most Predictive?

36

Which Features are Most Predictive?

37

Predicting Values for Test Data

38

Check Predictions (Scored Labels)

39

Evaluate Model

40

Check Model Accuracy, Etc.

41

End of Demo 1

42

Demo 2: This model predicts whether a patient will be readmitted to a hospital for further treatment.

43

This model considers a variety of strong machine learning algorithms. It then tunes the strongest model to be more efficient, evaluation of prediction results, and implements custom code in from R, Python, and SQL for further visualization and examination of data.

Explore the Dataset

44

Impute Missing Values

45

Compare Models: Decision Jungle

46

Compare Models: Boosted Decision Tree

47

Compare Models: Logistic Regression

48

Compare Models: Neural Network

49

Cross Validation Results: Variation in Accuracy for All Algorithms

50

ROC Chart comparison for models

51

Move Forward with Most Accurate Model

52

How to Cross Validate a Tuned Model

53

Assign Folds

54

Set Model Parameter Ranges

55

Tune Model Parameters

56

Use Model to Predict Values for Test Set

57

Visualize Predicted Values

58

Permute Features to Find Most Important

59

Find Features Most Influential to the Model

60

Can Export Results for Multiple Uses

61

Export CSV for debugging in notebooks

62

Sample R Notebook

63

SQL scripts

64

65

Find optimal cutoff in R script

66

R result

67

Python script for further visualizations

68

Python Visualization

69

Evaluate Final Model

70

Results

71

top related