pixie dust overview

9
©2016 IBM Corporation IBM Data Science Experience PixieDust: an Open Source Library that simplifies and improves Jupyter Python Notebooks

Upload: david-taieb

Post on 07-Apr-2017

219 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

PixieDust: an Open Source Library that simplifies and improves Jupyter Python

Notebooks

Page 2: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

PixieDust: an Open Source Library that simplifies and improves Jupyter Python

Notebooks

Jupyter + Pixiedust = 1. PackageManager2. Visualizations3. Cloud Integration4. Scala Bridge5. Extensibility6. Embedded Apps

https://github.com/ibm-cds-labs/pixiedust

Page 3: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

1/6 - Package ManagerInstall Spark packages or plain jars in your

Notebook Python kernel without the need to modify configuration file

Install GraphFrames Spark Package

Uses the GraphFrame Python APIs

Page 4: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

2/6 - Visualizations

Call the Options dialog

Performance statistics

Panning/Zooming options

One simple API: display()

Page 5: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

3/6 - Cloud IntegrationEasily export your data to csv, json, html, etc. locally on your laptop or into a cloud-based

service like Cloudant or Object Storage

Page 6: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

4/6 - Scala BridgeExecute Scala code directly from your python Notebook

%%scalaval demo = com.ibm.cds.spark.samples.StreamingTwitterdemo.setConfig("twitter4j.oauth.consumerKey",”XXXXX")demo.setConfig("twitter4j.oauth.consumerSecret",”XXXXX")demo.setConfig("twitter4j.oauth.accessToken",”XXXXX")demo.setConfig("twitter4j.oauth.accessTokenSecret",”XXXXX")demo.setConfig("watson.tone.url","https://watsonplatform.net/tone-analyzer/api")demo.setConfig("watson.tone.password",”XXXXX")demo.setConfig("watson.tone.username",”XXXX”)

import org.apache.spark.streaming._demo.startTwitterStreaming(sc, Seconds(10))

pythonVar = “pixiedust”Define Python variable

println(pythonVar) Use the python var in Scala

val __fromScalaVar = “Hello from Scala” Define scala variable

print(__fromScalaVar) Use the scala var in Python

Page 7: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

5/6 - ExtensibilityEasily extend PixieDust to create your own visualizations using HTML/CSS/JavaScript

Customized Visualization for GraphFrame Graphs

Page 8: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

6/6 - Embed Apps in NotebooksEncapsulate your analytics into compelling User

Interfaces better suited for Line of Business Usersfrom pixiedust_twitterdemo import *twitterDemo()

Page 9: Pixie dust overview

©2016 IBM Corporation IBM Data Science Experience

https://github.com/ibm-cds-labs/pixiedust