doing data science with f#

14
Doing data science with F# Tomas Petricek [email protected] | @tomaspetricek PhD Student at Cambridge & Coordinator of http://fsharp.org

Upload: tomas-petricek

Post on 18-Nov-2014

1.116 views

Category:

Technology


0 download

DESCRIPTION

The ability to take data, understand it, visualize it and extract useful information from it is becoming a hugely important skill. How can you turn all those logs, histories of purchases and trades or open government data, into useful information that help your business make money? In this talk, we’ll look at doing data science using F#. The F# language is perfectly suited for this task – type providers integrate external data directly into the language – your language suddenly _understands_ CSV, XML, JSON, REST services and other sources. The interactive development style makes it easy to explore data and test your algorithms as you’re writing them. Rich set of libraries for working with data frames, time series and for visualization gives you all the tools you need. And finally – F# easily integrates with statistical environments like R and Matlab, giving you access to the industry standard libraries.

TRANSCRIPT

Page 1: Doing data science with F#

Doing data science with F#

Tomas Petricek [email protected] | @tomaspetricek

PhD Student at Cambridge & Coordinator of http://fsharp.org

Page 2: Doing data science with F#

F# Software Foundation

http://www.fsharp.org

software stacks

trainings teaching F# user groups snippets

mac and linux community books and tutorials

consulting open-source MonoDevelop

contributions research support cross-platform mailing lists

Page 3: Doing data science with F#

Community matters!

Page 4: Doing data science with F#

All the Data of the World

Page 5: Doing data science with F#

kaggle vega grammar data sources presentation

R provider visualization

F# Data Science Working Group

www.fslab.org

data acquisition

statistics data cleaning machine learning

data transformation visualization type providers

time-series data aggregation

Page 6: Doing data science with F#

Acquire

AnalyzeVisualize

Page 7: Doing data science with F#

Demo: Analyzing Titanic survivors

Page 8: Doing data science with F#

Deedle data frame

Data exploration

Indexing and aggregation

F# Charting library

Simple & composable

Interactive style

www.fslab.org

Page 9: Doing data science with F#

Demo: Understanding the world

Page 10: Doing data science with F#

F# Data type providers

First-class data

CSV, REST, WorldBank…

R Type provider

Statistics & visualization

5000 tested packages

www.fslab.org

Page 11: Doing data science with F#

Demo: US debt over the last century

Page 12: Doing data science with F#

Deedle data frame

Time-series alignment

Data transformations

Vega visualization

F# wrapper for Vega

Pre-alpha version

www.fslab.org

Page 13: Doing data science with F#

F# for Data Science

acquire, analyze, visualize

interactive experience

safety and efficiency of .net

ready for production

@tomaspetricek

Page 14: Doing data science with F#

Going forward

Use #fsharp for fun & profit

Join local user groups

Help us build data science tools

fsharp.org | fslab.org | tomasp.net

@tomaspetricek