the open analytics platform - files.knime.com - bw... · •special port for pmml models...

64
© 2016 KNIME.com AG. All Rights Reserved. What’s new Bernd Wiswedel

Upload: buitruc

Post on 07-Mar-2018

218 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved.

What’s new

Bernd Wiswedel

Page 2: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 2

What’s new…

• 2+1 feature releases last year: 2.12, (3.0), 3.1 (only KNIME Analytics Platform + Server)

• Changes documented online…

Page 3: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 3

What’s new pages and YouTube

Page 4: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 4

Changelog …

Page 5: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 5

Version # new nodes / sets # features

v2.12 27 53

v3.0 3 28

v3.1 14 21

Version # new nodes / sets # features

v2.12 27 53

v3.0 3 28

v3.1 14 21

Changelog …

New nodes/features by version:

Page 6: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 6

Outline

Interactive feature demos

… by the developers

Page 7: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 7

Outline

• Analytics / Mining

• Statistics

• PMML - Standardizing predictive models

• Streaming Executor

• KNIME Server

• KNIME Productivity Extension: Workflow Diff

Page 8: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 8

Analytics / Mining

Page 9: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 9

Analytics / Mining

• Trees / Forest / Ensembles

• Active Learning

• R Integration

Page 10: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 10

Trees / Forest / Ensembles

• Demo

Page 11: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 11

Active Learning

• Labs Extension

• Involve user to construct training data set

• Workflow loop to query and label ‘interesting’ data points

• Used user-labeled data set on remaining data

Page 12: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 12

Active Learning (example from Node.Pedia)

Page 13: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 13

R Integration

• Rewrite of infrastructure

– Significantly faster

– Concurrent execution

• No change of usage model

Page 14: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 14

Statistics

Christian Dietz

Page 15: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 15

Statistics Nodes

What is it about?

• Several new useful statistic nodes in KNIME.

• Thanks to Bob Muenchen (University of Tennessee).

• Work in progress! We are still adding nodes.

Page 16: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 16

Statistics Nodes

Parametric and nonparametric hypothesis testing

Linear Discriminant Analysis

Page 17: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 17

Statistics Nodes

Cronbach Alpha

Rank Correlation

Page 18: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 18

Statistics Nodes

Odds and Risk Ratios

And more to come…

Page 19: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 19

Statistics Nodes

ARIMA for time series analysis

• “Auto regressive integrated moving average”

• Several ARIMA related nodes in KNIME

Page 20: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 20

PMML - Standardizing Predictive Models

Alexander Fillbrunn

Page 21: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 21

What is PMML?

• Predictive Model Markup Language

• XML based standard for predictive models

• KNIME can export most of its models as PMML

• To consume 3rd-party models, a scoring engine such as Zementis Adapa/UPPI is more suitable

Page 22: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 22

PMML Creation in KNIME

• Special port for PMML models

• Supported by most KNIME learners

– Decision Trees, Neural Nets, …

– Ensembles

• Also used for Preprocessing

– Normalizing, Binning, Missing Values, …

• Modular PMML

– Built step by step parallel to the data flow

Page 23: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 23

Demo: Modular PMML

Page 24: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 24

Decision Tree to Ruleset

• Transforms a decision tree to a PMML ruleset model

– Easier to interpret

• Also outputs rules as a KNIME table

– Easier to export & deploy

– Can be manipulated using standard KNIME nodes

Page 25: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 25

Applying Rulesets

• New node: Rule Engine (Dictionary)

– Input: data and ruleset table

– Output: Results and optional PMML model

• Import rules from other sources

• Mix rules from multiple sources

Page 26: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 26

Streaming Executor

Martin Horn

Page 27: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 27

Streaming

• Default Execution

Page 28: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 28

Streaming

• Streaming Execution

Page 29: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 29

Streaming

• Row-wise

• Process, pass & forget → Faster with less I/O overhead

• Concurrent execution

Page 30: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 30

Streaming

• Demo

Page 31: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 31

Streaming - Demo

Page 32: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 32

Streaming - Demo

Page 33: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 33

Streaming - Demo

Page 34: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 34

Streaming - Demo

Page 35: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 35

Streaming - Demo

Page 36: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 36

Streaming - Demo

Page 37: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 37

Streaming – Pros and Cons

Advantages

• Less I/O overhead (process, pass & forget)

• Parallelization

Disadvantages

• No intermediate results, no interactive execution

• Not all nodes can be streamed

Page 38: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 38

Streaming – Streamed Nodes

• More than 100 Nodes

• Text Processing Nodes

• Image Processing Nodes

• …

Page 39: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 39

Streaming – How-to?

1. Streaming Execution Plugin (Labs) limitations: loops, metanodes

2. Wrapped Metanode

3. Set ‘Simple Streaming’-Job Manager

Page 40: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 40

KNIME Server … and related

Thorsten Meinl

Page 41: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 41

Overview

• MongoDB and JSON

• WebPortal templates

• Advanced scheduled execution

• RESTful webservice interface

• Automation by calling other workflows

Page 42: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 42

MongoDB and JSON (I)

• MongoDB is a NoSQL database based on JSON

• Special set of nodes – due to lack of a standard SQL interface

Page 43: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 43

MongoDB and JSON (II)

• JSON nodes for working with JSON data

– Similar to the XML nodes

• Use combination of MongoDB and JSON nodes

Page 44: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 44

WebPortal templates (I)

• Layout can be configured by templates

– Footer & header

– Main panel

– Login page

– …

• Custom stylesheet and common JS libraries

• Custom templates are part of the configuration and are not overridden by server updates

Page 45: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 45

WebPortal templates (II)

Page 46: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 46

Advanced scheduled execution (I)

• Requires KNIME Server(Space) 4.2

• Allows setting filters for repeating jobs – By day-of-week

– By day-of-month

– By month

• Skip execution if previous job is still running (optional)

• Respects daylight saving for daily schedules

• Scheduled jobs can be edited and disabled

Page 47: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 47

Advanced scheduled execution (II)

Page 48: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 48

RESTful webservice interface (I)

• Main addition to KNIME Server 4.1

• REST = Representational State Transfer – Communication based on HTTP

– Usually clear text (JSON, XML, …)

• Many possible clients – Web browser

– Java applications (e.g. via JAX-RS)

– KREST nodes :-)

• Goal: complete server interface based on REST

Page 49: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 49

RESTful webservice interface (II)

• Functionality currently exposed via REST

– Repository browsing

– Executing jobs • With optional input parameters

• Synchronously, asynchronously

• Retrieving results and/or report

– Show permissions

– Show all running jobs

• Read our corresponding blog posts for more details

Page 50: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 50

Automation by calling other workflows (I)

• Call Remote/Local Workflow nodes

• Called workflow parameterized via quickform nodes

– In principle every quickform node can be used for a REST-enabled workflow

Page 51: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 51

Automation by calling other workflows (II)

• This is how the JSON looks like… {

"hasReport" : false,

"inputParameters" : {

"model-number-20" : {

"integer" : 0

},

"data-record-1" : {

"Col0" : "A11",

"Col1" : 6,

"Col18" : "A192",

"Col19" : "A201"

}

},

"workflow" : "/Summit/workflows/Predictor Flow",

}

Page 52: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 52

Automation by calling other workflows (III)

• Calling a remote workflow

Page 53: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 53

Automation by calling other workflows (IV)

• Also works for local workflows via the Call Local Workflow node

– Part of the Personal Productivity Extensions

Page 54: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 54

Productivity Extensions: Workflow Diff

Ferry Abt

Page 55: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 55

KNIME WorkflowDiff

Page 56: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 56

KNIME WorkflowDiff

Page 57: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 57

KNIME WorkflowDiff

Page 58: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 58

KNIME WorkflowDiff

Page 59: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 59

KNIME WorkflowDiff

Page 60: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 60

KNIME WorkflowDiff

Page 61: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 61

Workflow Diff

• Demo

Page 62: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 62

Workflow Diff - Recap

• Identifies changes in workflow-structure

• Aligns workflows to identify differences

• Available for KNIME Server and KNIME Productivity Extensions

Page 63: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 63

Summary

• Questions / Interested in demo / comments? Talk to us in the breaks / at the booth

• Interested in What’s cooking? After lunch

• Interesting in Big Data? After lunch

Page 64: The Open Analytics Platform - files.knime.com - BW... · •Special port for PMML models •Supported by most KNIME learners –Decision Trees, Neural Nets, … –Ensembles •Also

© 2016 KNIME.com AG. All Rights Reserved. 64

The KNIME® trademark and logo and OPEN FOR INNOVATION® trademark are used by KNIME.com AG under license from KNIME GmbH, and are registered in the United States.

KNIME® is also registered in Germany.