docker datascience pipeline · save docker image(s) to docker registry move serialized model to...

Docker Datascience Pipeline

Running datascience models on Docker at ING BIG DATA Conference Vilnius 2018

• Lennard Cornelis

• Big Data Engineer at ING

• DB2, Oracle, AIX, Linux, Hadoop, Hive, Sqoop, Ansible

• Let it all work on the Exploration Environment

• @chiefware

Lets introduce myself

Asking yourself how to get to production

Depending on different tools

Why I think you are here

You are running Datascience models

Interested in Docker

Asking yourself how to get to production

What is Docker

Docker is a tool designed to make it easier to create, deploy, and run applications by using containers. Containers allow a developer to package up an application with all of the parts it needs, such as libraries and other dependencies, and ship it all out as one package. By doing so, thanks to the container, the developer can rest assured that the application will run on any other Linux machine regardless of any customized settings that machine might have that could differ from the machine used for writing and testing the code

Agile way of working in Squads,Chapters and Tribes

big data prototype environment

Gitlab runner

Citrix Server

Dev node

Hadoop

Artifactory

Gitlab

Automation Server

Access to cluster rdp, browser, putty

Node for datascientists with tools and xrdp

Datanode cluster

Pip repo

Datasciences projects

Details about processServer

Pipeline

PrototypingDevelopment

TestAcceptance Production

Docker on big data prototype environment

Gitlab runner

Citrix Server

Dev node

Hadoop

Openshift

POD/JOB

Artifactory

Gitlab

Automation Server

Access to cluster rdp, browser, putty

Node for datascientists with tools and xrdp

Datanode cluster

Orchestration Containers

docker containers

Docker base Images

Docker files and Datasciences projects

Netezza

Rapid experimentation

Live modelDeploymen

Code versioning

Build Docker image(s) through Gitlab CI/Jenkins

Put serialized model in shared

docker datascience pipeline · save docker image(s) to docker registry move serialized model to...

Documents

docker docker docker

datascience introduction websci summer school 2014

part1: quest for datascience 101

serialized inventorycs

ink serialized format specification - download.microsoft.com

datascience&ai - cuti.org.uy

datascience& machine learning - electrocloud labs

unlinkability: a+datascience+perspec5ve+ ·...

docker docker docker chef

winning the i-com datascience hackathon...

splunk for datascience (.conf2014)

2012 production totals - great eastern cutlery · bark...

exploratory analysis part1 coursera datascience...

ticketmaster datascience

garuda robotics x datascience sg meetup (sep 2015)

docker 101 - developermarch€¦ · -docker machine -docker...

ink serialized format (isf) specification

asegúr@it 7: serialized sql injection

docker docker - docker security - docker

realizing)the potential)of)data science ·...