Download - Intro to Python Data Analysis in Wakari
![Page 1: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/1.jpg)
Intro to Python Data Analysis in Wakari
Karissa McKelveySoftware Developer Continuum Analytics
@karissamck
November 8, 2013PyData NYC
![Page 2: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/2.jpg)
$ WHOAMI
karissamck.com@karissamck
![Page 3: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/3.jpg)
truthy.indiana.edu
![Page 4: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/4.jpg)
More Tweets, Mote Votes
![Page 5: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/5.jpg)
Get you excited about data analysis in Wakari
Walk through some basic analysis packages and wakari workflows
Kick-start your journey
MY GOALS
![Page 6: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/6.jpg)
WHO ARE YOU?
![Page 7: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/7.jpg)
![Page 8: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/8.jpg)
![Page 9: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/9.jpg)
![Page 10: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/10.jpg)
![Page 11: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/11.jpg)
![Page 12: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/12.jpg)
![Page 13: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/13.jpg)
![Page 14: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/14.jpg)
Putting Science back in Comp Sci
• Much of the software stack is for systems programming --- C++, Java, .NET, ObjC, web
- Complex numbers? - Vectorized primitives?
• Software stack for scientists is not as helpful as it should be
• Fortran is still where many scientists end up
![Page 15: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/15.jpg)
![Page 16: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/16.jpg)
Why Python?
![Page 17: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/17.jpg)
High Performance with BIG DATA
![Page 18: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/18.jpg)
Packages for data analysis and visualization
![Page 19: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/19.jpg)
Syntax – Gets out of your way
![Page 20: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/20.jpg)
Community Driven
![Page 21: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/21.jpg)
Ready for web applications, too.
![Page 22: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/22.jpg)
![Page 23: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/23.jpg)
• “Python is good for data cleanup, R for statistical models”
“Which is the better Data Analysis language? R or Python?” Quora. http://www.quora.com/Data-Analysis/Which-is-the-better-Data-analysis-language-R-or-Python
![Page 24: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/24.jpg)
• “Python is good for data cleanup, R for statistical models”
• “R is quirky and weird but the statisticians love it and there really isn’t any compelling reason to switch”
“Which is the better Data Analysis language? R or Python?” Quora. http://www.quora.com/Data-Analysis/Which-is-the-better-Data-analysis-language-R-or-Python
![Page 25: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/25.jpg)
• “Python is good for data cleanup, R for statistical models”
• “R is quirky and weird but the statisticians love it and there really isn’t any compelling reason to switch”
• “You’re running an MCMC simulation on a laptop? Perhaps you should write it in C++/FORTRAN”
“Which is the better Data Analysis language? R or Python?” Quora. http://www.quora.com/Data-Analysis/Which-is-the-better-Data-analysis-language-R-or-Python
![Page 26: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/26.jpg)
Ready for DATA, and then some
“You’re running an MCMC simulation on a laptop? Perhaps you should write it in C++/FORTRAN”
![Page 27: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/27.jpg)
Numba: just-in-time compiler to LLVM through @decorators
numba.pydata.org
![Page 28: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/28.jpg)
Numba: just-in-time compiler to LLVM through @decorators*
numba.pydata.org*aka, fast. easy.
![Page 29: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/29.jpg)
![Page 30: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/30.jpg)
Basic packages for data analysis and visualization
![Page 31: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/31.jpg)
NumPy: The foundation of the Python Data Analysis stack
![Page 32: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/32.jpg)
NumPy: Array-oriented
![Page 33: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/33.jpg)
![Page 34: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/34.jpg)
![Page 35: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/35.jpg)
![Page 36: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/36.jpg)
Pandas: Builds upon NumPy
![Page 37: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/37.jpg)
Matplotlib: 2D plotting library
![Page 38: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/38.jpg)
IPython: Interactive Python (+ in the Web)
tab completionmagic %-commands
Inline plots
![Page 39: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/39.jpg)
Anaconda: pulls it all together
![Page 40: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/40.jpg)
![Page 41: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/41.jpg)
wakari.io Browser-based Python & Linux environment
![Page 42: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/42.jpg)
Share files, IPython notebooks, and plots with pay-as-you-go compute
IPython Notebook
Scientific Packages
Terminal
![Page 43: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/43.jpg)
Sharing in Wakari
• Packages IPython notebooks, files, folders, data, and environment
• Get a link
• Share that link.
![Page 44: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/44.jpg)
Reproducible Research
![Page 45: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/45.jpg)
![Page 46: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/46.jpg)
“A rule of thumb among biotechnology venture capitalists is that half of published research
cannot be replicated”
![Page 47: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/47.jpg)
How do we replicate research today?
![Page 48: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/48.jpg)
How do we replicate research today?collaborate on
![Page 49: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/49.jpg)
How do we replicate research today?collaborate on
data analysis
![Page 50: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/50.jpg)
How do we collaborate today?
![Page 51: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/51.jpg)
How do we collaborate today?
![Page 52: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/52.jpg)
How do we collaborate today?
![Page 53: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/53.jpg)
How do we collaborate today?
![Page 54: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/54.jpg)
????????
![Page 55: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/55.jpg)
How do we replicate research today?
![Page 56: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/56.jpg)
wakari.io Browser-based Python & Linux environment
![Page 57: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/57.jpg)
Enterprise or Cloud
Online at wakari.io or install locally for access to your hardware and data
![Page 58: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/58.jpg)
wakari.io Browser-based Python & Linux environment
![Page 59: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/59.jpg)
Coming Soon
![Page 60: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/60.jpg)
Project-based interaction
Projects starting at 10$/month with unlimited team members
user
![Page 61: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/61.jpg)
Interactive Plotting
Next-generation collaborative data manipulation, analysis, and presentation
![Page 62: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/62.jpg)
Talks to see
• Jack Vanderplas (Washington)– Efficient computing with Numpy • 29th Floor combo 3pm (Right now, next door!)
• Julia Evans (N/A)– A practical introduction to IPython Notebook &
pandas • Here, 4:45pm.
![Page 63: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/63.jpg)
Talks to see
• Sarah Guido (Michigan)– A Beginner’s Guide to Machine Learning with
scikit-learn
• Imram Haque (Counsyl)– Beyond the dict
• Peter Wang (Continuum)– Bokeh Workshop
![Page 64: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/64.jpg)
Special Thanks
Ben ZaitlinMark FlorissonClayton Davis
Bryan Van de VenTravis Oliphant
![Page 65: Intro to Python Data Analysis in Wakari](https://reader037.vdocuments.us/reader037/viewer/2022102814/54c65e814a795934598b4608/html5/thumbnails/65.jpg)
Karissa McKelvey@karissamck