mass tlc big data panel sep 20
DESCRIPTION
TRANSCRIPT
![Page 1: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/1.jpg)
What Does All This
Data Mean?
September 20, 2012IBM Innovation Center
Waltham MA
MassTLC Big Data Seminar
@m
asstlc #bigdata
![Page 2: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/2.jpg)
What Does All This Data Mean?
Agenda •Setting the Context•Introducing the Panel•Panel Discussion•Q&A
– Hashtags: @masstlc #bigdata
![Page 3: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/3.jpg)
Your Panel
• Richard Dale, Managing Director, Big Data Boston
Ventures – Twitter: @rdale
• Irene Greif, Fellow, IBM Visualization– Twitter: @igreif
• Martin Leach, CIO, Broad Institute– Twitter: @mdleach
• Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
![Page 4: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/4.jpg)
Richard Dale
Managing Director, Big Data Boston VenturesMicro-VC fund investing in big data companies located in or connected to the regional big data cluster
Database techie turned Entrepreneur turned VC– Database Performance Guru, SQL Solutions– Co-founder, Phase Forward– Principal, Sigma Partners– Founder & Managing Director, Big Data Boston Ventures
![Page 5: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/5.jpg)
Setting the Context
• What is Big Data?
• Where does Big Data come from?
• What is Big Data going?
![Page 6: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/6.jpg)
What is Big Data?
a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools
(wikipedia)
![Page 7: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/7.jpg)
What is Big Data?
3 V’s: •volume •velocity•variety
(Doug Laney, Gartner)
![Page 8: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/8.jpg)
What is Big Data?
Data easier and cheaper to collect than to analyze
(??)
![Page 9: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/9.jpg)
What is Big Data?
Data that you can’t process on a single machine, however big your machine (and however long you wait)
or
Data growing faster than Moore’s law
(Richard Dale)
![Page 10: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/10.jpg)
Where Does Big Data Come From?
Behavior•Social Media•User Generated Content•Click streams•Viewing, Purchasing, Liking, Sharing•The Quantified Self
![Page 11: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/11.jpg)
Where Does Big Data Come From?
Observation (in ever finer granularity)•Machines
– Computers, Vehicles, Phones, Industrial Machines•Environments
– RFID, Traffic flow, Nature (and our impact)•People
– The Quantified Self– Medical imaging– Genetic sequencing
![Page 12: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/12.jpg)
Where Does Big Data Come From?
Correlations•Each data item, image or observation can be cross-correlated with any other
•Even if N is tractable, N x N x N x … is not
![Page 13: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/13.jpg)
Technology Landscape
Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving
Analytics: Algorithms, Visualization, Machine Learning
Analytics: Algorithms, Visualization, Machine Learning
Applications: Horizontal and Verticalbusiness or domain applications
Applications: Horizontal and Verticalbusiness or domain applications
Data Services:
Collecting,Collating,
Correlating,Curating
Data Services:
Collecting,Collating,
Correlating,Curating
Source:
![Page 14: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/14.jpg)
Technology Landscape
Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving
Analytics: Algorithms, Visualization, Machine Learning
Analytics: Algorithms, Visualization, Machine Learning
Applications: Horizontal and Verticalbusiness or domain applications
Applications: Horizontal and Verticalbusiness or domain applications
Data Services:
Collecting,Collating,
Correlating,Curating
Data Services:
Collecting,Collating,
Correlating,Curating
Source:
![Page 15: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/15.jpg)
Technology Landscape
Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving
Analytics: Algorithms, Visualization, Machine Learning
Analytics: Algorithms, Visualization, Machine Learning
Applications: Horizontal and Verticalbusiness or domain applications
Applications: Horizontal and Verticalbusiness or domain applications
Data Services:
Collecting,Collating,
Correlating,Curating
Data Services:
Collecting,Collating,
Correlating,Curating
Source:
![Page 16: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/16.jpg)
A Sea of Choices for Data Viz
• BI packages• Dashboard reporting tools • Ad hoc infographics• Whiteboards• Napkin scribbles
![Page 17: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/17.jpg)
Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow– Twitter: @igreif
•Martin Leach, CIO, Broad Institute– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
![Page 18: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/18.jpg)
Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow– Twitter: @igreif
•Martin Leach, CIO, Broad Institute– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
![Page 19: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/19.jpg)
IBM Center for Social BusinessIrene Greif, IBM Fellow, Chief Scientist for Social Business
Many Eyes
![Page 20: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/20.jpg)
Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow– Twitter: @igreif
•Martin Leach, CIO, Broad Institute– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
![Page 21: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/21.jpg)
• The Broad Institute is a non-profit biomedical research institute
• Ten core faculty members and approximately 150 associate members from across MIT and Harvard
• Greater than 1900 research and administrative staff
Programs and Initiativesfocused on specific disease or biology areas
CancerGenome BiologyGenome Sequencing and AnalysisCell CircuitsPsychiatric DiseaseMetabolismMedical and Population GeneticsChemical Biology/Novel TherapeuticsInfectious DiseaseEpigenomics
Platformsfocused technological innovation and application
Genomics PlatformBiological SamplesGenome SequencingGenetic Analysis
Chemical Biology/Novel TherapeuticsImagingMetabolite ProfilingProteomicsRNAiTherapeutics Discovery & Development
The Broad Institute of MIT & Harvard
Martin Leach, CIO
![Page 22: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/22.jpg)
Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow– Twitter: @igreif
•Martin Leach, CIO, Broad Institute– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
![Page 23: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/23.jpg)
Big Data VisualizationAndrew Pandre, Ph.D.,PrincipalSears Holdings Corporation
Google+ microblog: http://tinyurl.com/VisibleData
Data Visualization Bloghttp://apandre.wordpress.com
![Page 24: Mass tlc big data panel sep 20](https://reader033.vdocuments.us/reader033/viewer/2022051609/54770ad3b4af9f52728b493c/html5/thumbnails/24.jpg)
@masstlc #bigdata