apache flink community update march 2015

11
Apache Flink Flink Community Update March 2015 Robert Metzger rmetzger @apache.org

Upload: robert-metzger

Post on 15-Jul-2015

678 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Apache Flink Community Update March 2015

Apache Flink

Flink Community Update

March 2015

Robert Metzger

rmetzgerapacheorg

Flink Community Updates

bull What happened in the Flink community

check out the monthly newsletter on the

blog

Subscribe to newsflinkapacheorg

1flinkapacheorg

What happened

bull Community decided to release Flink 09-

milestone1 next week

Flink 09 will come a few weeks afterwards

bull Flink runner for Google DataFlow API

available

bull Focus on Streaming stability (YARN

container restart Kafka source

checkpointing)

flinkapacheorg 2

Now in master (09-SNAPSHOT)

bull Expression API renamed to Table API

bull Java support for Table API

flinkapacheorg 3

Now in master Flink Machine

Learning Library

bull Merged

ndash ALS (Recommendations)

ndash Linear Regression amp Multiple Linear Regression

ndash Utilities basic data types (sparse vectors amp matrix)

bull Overview of open issues httpsissuesapacheorgjiraissuesjql=component203D2022Machine20Learning20Library2220AND20project203D20FLINK

flinkapacheorg 4

Flink on the Web

bull Blogpost Peeking into Apache Flinks Engine Room [1]

bull Naive Bayes on Apache Flink [2]

bull Announcing Google Cloud Dataflow runner for Apache Flink [3][4][5]

bull How to factorize a 700 GB matrix with Apache Flink [6]

[1] httpflinkapacheorgnews20150313peeking-into-Apache-Flinks-Engine-Roomhtml

[2] httpwwwitsharedorg201503naive-bayes-on-apache-flinkhtml

[3] httpgooglecloudplatformblogspotde201503announcing-Google-Cloud-Dataflow-runner-for-Apache-Flinkhtml

[4] httpwwwdata-artisanscomdataflowhtml

[5] httpwwwheisededevelopermeldungBig-Data-Google-Cloud-Dataflow-bekommt-Runner-fuer-Apache-Flink-2583392html

[6] httpwwwdata-artisanscomalshtml

flinkapacheorg 5

New Wiki pages with system

internals

bull Data Exchange between tasks

bull Type Extraction and Serialization

bull Memory Management in (Batch API)

bull Akka and Actors

flinkapacheorg 6

Happy Users on Twitter

flinkapacheorg 7

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 2: Apache Flink Community Update March 2015

Flink Community Updates

bull What happened in the Flink community

check out the monthly newsletter on the

blog

Subscribe to newsflinkapacheorg

1flinkapacheorg

What happened

bull Community decided to release Flink 09-

milestone1 next week

Flink 09 will come a few weeks afterwards

bull Flink runner for Google DataFlow API

available

bull Focus on Streaming stability (YARN

container restart Kafka source

checkpointing)

flinkapacheorg 2

Now in master (09-SNAPSHOT)

bull Expression API renamed to Table API

bull Java support for Table API

flinkapacheorg 3

Now in master Flink Machine

Learning Library

bull Merged

ndash ALS (Recommendations)

ndash Linear Regression amp Multiple Linear Regression

ndash Utilities basic data types (sparse vectors amp matrix)

bull Overview of open issues httpsissuesapacheorgjiraissuesjql=component203D2022Machine20Learning20Library2220AND20project203D20FLINK

flinkapacheorg 4

Flink on the Web

bull Blogpost Peeking into Apache Flinks Engine Room [1]

bull Naive Bayes on Apache Flink [2]

bull Announcing Google Cloud Dataflow runner for Apache Flink [3][4][5]

bull How to factorize a 700 GB matrix with Apache Flink [6]

[1] httpflinkapacheorgnews20150313peeking-into-Apache-Flinks-Engine-Roomhtml

[2] httpwwwitsharedorg201503naive-bayes-on-apache-flinkhtml

[3] httpgooglecloudplatformblogspotde201503announcing-Google-Cloud-Dataflow-runner-for-Apache-Flinkhtml

[4] httpwwwdata-artisanscomdataflowhtml

[5] httpwwwheisededevelopermeldungBig-Data-Google-Cloud-Dataflow-bekommt-Runner-fuer-Apache-Flink-2583392html

[6] httpwwwdata-artisanscomalshtml

flinkapacheorg 5

New Wiki pages with system

internals

bull Data Exchange between tasks

bull Type Extraction and Serialization

bull Memory Management in (Batch API)

bull Akka and Actors

flinkapacheorg 6

Happy Users on Twitter

flinkapacheorg 7

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 3: Apache Flink Community Update March 2015

What happened

bull Community decided to release Flink 09-

milestone1 next week

Flink 09 will come a few weeks afterwards

bull Flink runner for Google DataFlow API

available

bull Focus on Streaming stability (YARN

container restart Kafka source

checkpointing)

flinkapacheorg 2

Now in master (09-SNAPSHOT)

bull Expression API renamed to Table API

bull Java support for Table API

flinkapacheorg 3

Now in master Flink Machine

Learning Library

bull Merged

ndash ALS (Recommendations)

ndash Linear Regression amp Multiple Linear Regression

ndash Utilities basic data types (sparse vectors amp matrix)

bull Overview of open issues httpsissuesapacheorgjiraissuesjql=component203D2022Machine20Learning20Library2220AND20project203D20FLINK

flinkapacheorg 4

Flink on the Web

bull Blogpost Peeking into Apache Flinks Engine Room [1]

bull Naive Bayes on Apache Flink [2]

bull Announcing Google Cloud Dataflow runner for Apache Flink [3][4][5]

bull How to factorize a 700 GB matrix with Apache Flink [6]

[1] httpflinkapacheorgnews20150313peeking-into-Apache-Flinks-Engine-Roomhtml

[2] httpwwwitsharedorg201503naive-bayes-on-apache-flinkhtml

[3] httpgooglecloudplatformblogspotde201503announcing-Google-Cloud-Dataflow-runner-for-Apache-Flinkhtml

[4] httpwwwdata-artisanscomdataflowhtml

[5] httpwwwheisededevelopermeldungBig-Data-Google-Cloud-Dataflow-bekommt-Runner-fuer-Apache-Flink-2583392html

[6] httpwwwdata-artisanscomalshtml

flinkapacheorg 5

New Wiki pages with system

internals

bull Data Exchange between tasks

bull Type Extraction and Serialization

bull Memory Management in (Batch API)

bull Akka and Actors

flinkapacheorg 6

Happy Users on Twitter

flinkapacheorg 7

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 4: Apache Flink Community Update March 2015

Now in master (09-SNAPSHOT)

bull Expression API renamed to Table API

bull Java support for Table API

flinkapacheorg 3

Now in master Flink Machine

Learning Library

bull Merged

ndash ALS (Recommendations)

ndash Linear Regression amp Multiple Linear Regression

ndash Utilities basic data types (sparse vectors amp matrix)

bull Overview of open issues httpsissuesapacheorgjiraissuesjql=component203D2022Machine20Learning20Library2220AND20project203D20FLINK

flinkapacheorg 4

Flink on the Web

bull Blogpost Peeking into Apache Flinks Engine Room [1]

bull Naive Bayes on Apache Flink [2]

bull Announcing Google Cloud Dataflow runner for Apache Flink [3][4][5]

bull How to factorize a 700 GB matrix with Apache Flink [6]

[1] httpflinkapacheorgnews20150313peeking-into-Apache-Flinks-Engine-Roomhtml

[2] httpwwwitsharedorg201503naive-bayes-on-apache-flinkhtml

[3] httpgooglecloudplatformblogspotde201503announcing-Google-Cloud-Dataflow-runner-for-Apache-Flinkhtml

[4] httpwwwdata-artisanscomdataflowhtml

[5] httpwwwheisededevelopermeldungBig-Data-Google-Cloud-Dataflow-bekommt-Runner-fuer-Apache-Flink-2583392html

[6] httpwwwdata-artisanscomalshtml

flinkapacheorg 5

New Wiki pages with system

internals

bull Data Exchange between tasks

bull Type Extraction and Serialization

bull Memory Management in (Batch API)

bull Akka and Actors

flinkapacheorg 6

Happy Users on Twitter

flinkapacheorg 7

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 5: Apache Flink Community Update March 2015

Now in master Flink Machine

Learning Library

bull Merged

ndash ALS (Recommendations)

ndash Linear Regression amp Multiple Linear Regression

ndash Utilities basic data types (sparse vectors amp matrix)

bull Overview of open issues httpsissuesapacheorgjiraissuesjql=component203D2022Machine20Learning20Library2220AND20project203D20FLINK

flinkapacheorg 4

Flink on the Web

bull Blogpost Peeking into Apache Flinks Engine Room [1]

bull Naive Bayes on Apache Flink [2]

bull Announcing Google Cloud Dataflow runner for Apache Flink [3][4][5]

bull How to factorize a 700 GB matrix with Apache Flink [6]

[1] httpflinkapacheorgnews20150313peeking-into-Apache-Flinks-Engine-Roomhtml

[2] httpwwwitsharedorg201503naive-bayes-on-apache-flinkhtml

[3] httpgooglecloudplatformblogspotde201503announcing-Google-Cloud-Dataflow-runner-for-Apache-Flinkhtml

[4] httpwwwdata-artisanscomdataflowhtml

[5] httpwwwheisededevelopermeldungBig-Data-Google-Cloud-Dataflow-bekommt-Runner-fuer-Apache-Flink-2583392html

[6] httpwwwdata-artisanscomalshtml

flinkapacheorg 5

New Wiki pages with system

internals

bull Data Exchange between tasks

bull Type Extraction and Serialization

bull Memory Management in (Batch API)

bull Akka and Actors

flinkapacheorg 6

Happy Users on Twitter

flinkapacheorg 7

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 6: Apache Flink Community Update March 2015

Flink on the Web

bull Blogpost Peeking into Apache Flinks Engine Room [1]

bull Naive Bayes on Apache Flink [2]

bull Announcing Google Cloud Dataflow runner for Apache Flink [3][4][5]

bull How to factorize a 700 GB matrix with Apache Flink [6]

[1] httpflinkapacheorgnews20150313peeking-into-Apache-Flinks-Engine-Roomhtml

[2] httpwwwitsharedorg201503naive-bayes-on-apache-flinkhtml

[3] httpgooglecloudplatformblogspotde201503announcing-Google-Cloud-Dataflow-runner-for-Apache-Flinkhtml

[4] httpwwwdata-artisanscomdataflowhtml

[5] httpwwwheisededevelopermeldungBig-Data-Google-Cloud-Dataflow-bekommt-Runner-fuer-Apache-Flink-2583392html

[6] httpwwwdata-artisanscomalshtml

flinkapacheorg 5

New Wiki pages with system

internals

bull Data Exchange between tasks

bull Type Extraction and Serialization

bull Memory Management in (Batch API)

bull Akka and Actors

flinkapacheorg 6

Happy Users on Twitter

flinkapacheorg 7

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 7: Apache Flink Community Update March 2015

New Wiki pages with system

internals

bull Data Exchange between tasks

bull Type Extraction and Serialization

bull Memory Management in (Batch API)

bull Akka and Actors

flinkapacheorg 6

Happy Users on Twitter

flinkapacheorg 7

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 8: Apache Flink Community Update March 2015

Happy Users on Twitter

flinkapacheorg 7

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 9: Apache Flink Community Update March 2015

Happy Users on Twitter

flinkapacheorg 8

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 10: Apache Flink Community Update March 2015

Happy Users on Twitter

flinkapacheorg 9

GitHub stats

flinkapacheorg 10

Last Month

Page 11: Apache Flink Community Update March 2015

GitHub stats

flinkapacheorg 10

Last Month