big data and wee data by avery rosen

22
MongoDB and Wee Data hacking a workflow

Upload: hakka-labs

Post on 10-May-2015

312 views

Category:

Technology


0 download

DESCRIPTION

Big Data and Wee Data - We all know MongoDB is great for Big Data, but it's also great for work on the other end of the scale -- call it "Wee Data". In this talk, MongoDB expert an Principal at Bringing Fire Consulting, Avery Rosen, talks on how this type of data is far more common than Big Data scenarios. Avery discusses how just about every project starts with it. In this domain, we don't care about disk access and indices; instead, we care about skipping past the wheel inventing and getting right down to playing with the data. MongoDB lets you persist your prototype or small-working-set data without making you deal with freeze-drying and reconstitution, provides structure well beyond csv, gets out of your way as you evolve your schemas, and provides simple tools for introspecting data and crunching numbers.

TRANSCRIPT

Page 1: Big Data and Wee Data by Avery Rosen

MongoDB and Wee Data

hacking a workflow

Page 2: Big Data and Wee Data by Avery Rosen

what it’s all about

• MongoDB lets you persist and investigate your application data while getting out of your way

• p.s. BDD

Page 3: Big Data and Wee Data by Avery Rosen

me

• http://bringingfire.com/

• @4BringingFire

• expertise at the intersection of technology, product formulation, and business strategy

Page 4: Big Data and Wee Data by Avery Rosen

stuff I’m into

• products > tech

• UX testing

• iterative, analytics-driven dev

• agility in production

Page 5: Big Data and Wee Data by Avery Rosen

http://www.shopwiki.com

Page 7: Big Data and Wee Data by Avery Rosen

http://penrosejudaica.com

Page 8: Big Data and Wee Data by Avery Rosen

total projectsbig data projects= ?< 1%NOT AN

ACTUAL FACT

Page 9: Big Data and Wee Data by Avery Rosen

what is wee data?

• query performance is not an issue

• in fact it might fit in RAM

• in fact it might fit in 2MB

Page 10: Big Data and Wee Data by Avery Rosen

where mongo fits

• scales down nicely

• access programmatically without SQL

• persists for free

• query later, too

Page 11: Big Data and Wee Data by Avery Rosen

ORMpuh-lease.

Page 12: Big Data and Wee Data by Avery Rosen

http://penrosejudaica.com

Page 13: Big Data and Wee Data by Avery Rosen

Behavior Driven Development

• “Outside-in” approach

• Everything is behavior

• Acceptance-test-driven planning

• Ubiquitous language

Page 14: Big Data and Wee Data by Avery Rosen

Cucumber

Page 15: Big Data and Wee Data by Avery Rosen

Cucumber

Page 16: Big Data and Wee Data by Avery Rosen

Cucumber

Page 17: Big Data and Wee Data by Avery Rosen

Cucumber

Page 18: Big Data and Wee Data by Avery Rosen

rspec

• specification

• object doubles (mocks, fakes, stubs, spies)

• method stubbing

• expectations

Page 19: Big Data and Wee Data by Avery Rosen
Page 20: Big Data and Wee Data by Avery Rosen

Behavior Driven Development

• http://cukes.info/

• http://rspec.info/

Page 21: Big Data and Wee Data by Avery Rosen

Name Materials Price Description

fancy menorah gold, silver 600

plain menorah pewter 150

the Penrose ETL pipeline

Product Spreadsheets

.csv.csv.csv

CSV files Product Descriptions

Blah blah blah exquisite

craftsmanship

db.products.aggregate( { $match : { product_type : “menorah” } }, { $group : { _id: “$material”, count:

{ $sum : 1 } } )

Buy AdWords?

Page 22: Big Data and Wee Data by Avery Rosen

Q & A