can i add this class?
TRANSCRIPT
![Page 1: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/1.jpg)
Can I add this class?
Lulu Liang (ll882) is handling the waiting list. We expect all majors and minors to be able to enroll.
![Page 2: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/2.jpg)
INFO 2950:Intro to Data Science
Prof. David Mimno
![Page 3: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/3.jpg)
Thank you for your interest, but...
This class is required for InfoSci majors and minors.
If you do not need it, please consider other options.
![Page 4: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/4.jpg)
Where to find things
● Course website: http://mimno.infosci.cornell.edu/info2950● Question answering: https://campuswire.com/c/G7E579AA4
(code 3402)● Assignments: CMS (enrollment will sync every 24 hrs)
![Page 5: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/5.jpg)
Textbooks
VanderPlas, Python Data Science Handbook
James, Witten, Hastie, Tibshirani, An introduction to statistical learning
Both are free, links from course website
![Page 6: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/6.jpg)
The wheat is stored...The information is stored...The data is stored...
![Page 7: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/7.jpg)
![Page 8: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/8.jpg)
Statistics (20th century version)
Experiments are designed
Computation is hard
Data is expensive
Goal is causation
Wikipedia, Fisher; Gosset
![Page 9: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/9.jpg)
Data Science (21st century)
Observations are gathered opportunistically
Computation is cheap
Data is abundant
Goal is prediction
linksys.com
![Page 10: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/10.jpg)
Drew Conway's Venn diagram
http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
![Page 11: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/11.jpg)
Data science pattern
1. Map real-world entities to a computational representation2. Perform mathematical operations on those representations3. Interpret results of those operations
![Page 12: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/12.jpg)
Data science pattern
1. Map real-world entities to a computational representation2. Perform mathematical operations on those representations3. Interpret results of those operations4. [go to step 1]
![Page 13: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/13.jpg)
Math questions
What representations are good for supporting mathematical operations?
How can we create accurate mathematical models of real-world events?
How can we convince ourselves and others that this isn't just randomness?
![Page 14: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/14.jpg)
The math is the easy part
● Is the data reliable and complete?● Are we answering the right question?● How can we balance between what is
useful and what is easily available?● Will anyone believe that we have the
right answer? Should they?
Wikipedia "Town hall meeting"
![Page 15: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/15.jpg)
Live experiment! Find a study group
https://forms.gle/NCZ6CSMB6qiiasfUA
![Page 16: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/16.jpg)
Where to find things
● Course website: http://mimno.infosci.cornell.edu/info2950● Question answering: https://campuswire.com/c/G7E579AA4
(code 3402)● Assignments: CMS (enrollment will sync every 24 hrs)
![Page 17: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/17.jpg)
Weekly pattern
Monday
Mimno office hours, 1:30-3:30 Gates 205
Tuesday
Presentation of new material
Wednesday Thursday
Presentation of new material;Homework due 11:59pm
Friday
Lab sessions: practice and discuss
![Page 18: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/18.jpg)
For Friday: Install Python 3
● Anaconda is the easiest, most reliable installation: https://anaconda.com/download
● NO PYTHON 2.○ To check: type print "hello" with no
(parentheses). You should get an error.
We will work in notebooks, scripts, and the command line (>>>)
![Page 19: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/19.jpg)
RIPPython 2
Wikipedia, "Headstone"
![Page 20: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/20.jpg)
How to do well in this class
Show up
Don't just read, test yourself
Start early
Snacks!
Healthy sleep
![Page 21: Can I add this class?](https://reader031.vdocuments.us/reader031/viewer/2022032118/622ed3cb4e847f1743053963/html5/thumbnails/21.jpg)
Can I add this class?
Lulu Liang (ll882) is handling the waiting list. We expect all majors and minors to be able to enroll.