taken some of the hype out of big data again - medtech pharma, nürnberg july 2014

26
MedTech Pharma Nürnberg 2014 Taking (some of) the mystery out of Big Data

Upload: claus-stie-kallesoe

Post on 27-Jan-2015

105 views

Category:

Technology


1 download

DESCRIPTION

I was invitted to redo the talk about Big Data i did in Berlin earlier this year - slides also here. Slides are similar but updated to reflect my new company and some slides are new. Enjoy

TRANSCRIPT

Page 1: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

MedTech PharmaNürnberg 2014

Taking (some of) the mystery out of Big Data

Page 3: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Contact

Claus Stie Kallesøe

Founder, CEO

[email protected]

+45 30 14 15 36

Page 4: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Introduction

Page 5: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Big Data –Either VERY large datasets AND/OR other complexities

Characteristics of big data

Source: IBM methodology

Page 6: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

A couple of words about scale• 100’s of Megabytes

• This should not be a problem. Can be handled with Matlab, R, Ruby

• 100/500 Gigabytes – 1Terabyte• 2 Terabyte harddrives can be bought in the local shop for €100

• Connect it to your laptop and install postgresql or a no-sql database on it

• > 5 Terabytes• Now you might have a size issue

Inspired by: http://www.chrisstucchio.com/blog/2013/hadoop_hatred.html

Page 7: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Big Data - “Definition”

"Big Data is high volume, high velocity, and/or high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization."

Page 8: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Cool, but remember where we are!Gartner Hype Cycle 2013

Page 9: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Big Data in Pharma R&D

Page 10: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

What is Big Data in Pharma R&D?• Many ideas/possibilities across Pharma R&D and market

access• But many of them are likley NOT “real” Big Data problems!

• Are they relevant and can they bring insights?• Yes, very much so

• Should we than find a way to handle them?• Absolutely

Page 11: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Disclaimer

• I am a (web) tech geek• I have nothing against new technologies

• Like many other geeks I like it

• But do try to use the right tool for the right job

Page 12: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

http://blog.mongohq.com/you-dont-have-big-data/

Page 13: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Another great tool - for some

Q: “Could you help me get to Nürnberg, pls?”A: “Yes, absolutely. Not a problem”

Q: “Ok, btw I want to try the Endeavour A: “...ahh why?”

Q: “Because I have read it’s great”A: “Yes, but the ICE….”

Page 14: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

MapReduce explained in 41 wordsGoal: Count the number of books in the library.

Map: You count up shelf #1, I count up shelf #2.

(The more people we get, the faster this part goes. )

Reduce: We all get together and add up our individual counts.

http://www.chrisstucchio.com/blog/2011/mapreduce_explained.html

Page 15: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

What is it then? Linked data?

Page 16: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Does it matter what it is?

No!

It’s data - and potential analytics (business) opportunities.

Size and complexity should drive the technology

Page 17: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

TechnologiesCan we do anything on our own

Page 18: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

For many people/companies”Big data technology” is a black box

”A lot of stuff”

And then the vendors go:If

{ box = magic or money}then

{ box = expensive}

Page 19: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Working within a communityA lot of tools available

From: ttp://people10.com/blog/ruby-on-rails-the-popular-platform-for-web-development/

Page 20: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

New visualisations – easy and free

http://philogb.github.io/jit/demos.html

Page 21: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Automated calculations - can bring you far

Job submitted to asynccalculation server

Page 22: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

https://circleci.com/

Also a lot of great tools to handle data

Page 23: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Elasticsearch text indexes

• Indexed research assay metadata=> Google like search to find the relevant assay

• Indexed sharepoint project workspaces=> Enable easy, fast cross project queries to find trends

Page 24: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

Conclusion – Big data in Pharma R&D• Many opportunities across R&D and market access

• More data linking and data analytics than Big Data

• You can use freely available tools on ”normal” hardware

• No magic ”Under the hood” – it’s just data

Page 25: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

BUT you still need to define the questions you

want to answer – before diving into technology!

Page 26: Taken some of the hype out of Big Data again - Medtech Pharma, Nürnberg july 2014

www.gritsystems.dk

Ask….