data science orientation
TRANSCRIPT
![Page 1: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/1.jpg)
![Page 2: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/2.jpg)
Haritha ThilakarathneSoftware Engineer – Data Science & AnalyticsTech One Global – Enadoc Dev Center http://haritha.me
![Page 3: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/3.jpg)
![Page 4: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/4.jpg)
![Page 5: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/5.jpg)
Data science is a multidisciplinary blend of data inference, algorithm development, & Technology in order to solve analytically complex problems.
• Making decisions• Confirming hypotheses• Gaining insights• Predicting future
![Page 6: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/6.jpg)
![Page 7: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/7.jpg)
![Page 8: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/8.jpg)
![Page 9: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/9.jpg)
![Page 10: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/10.jpg)
Big Data Manipulation & Analysis
![Page 11: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/11.jpg)
![Page 12: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/12.jpg)
Data Mining
![Page 13: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/13.jpg)
![Page 14: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/14.jpg)
Data Visualization
![Page 15: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/15.jpg)
Detail on distribution of artworks in the Tate collection by birthdate of artists, visualized by Florian Krautli.
![Page 16: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/16.jpg)
Data Collection & Preparation • Extracting data from difficult sources• Filling in missing values•Removing suspicious data•Making formats, encoding, and units consistent•De-duplicating and matching
![Page 17: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/17.jpg)
Correlation and Causation•Correlation – Values track each other• Height and Shoe Size • Grades and Entrance Exam Scores
•Causation – One value directly influences another • Education Level ->Starting Salary • Temperature -> Cold Drink Sales
![Page 18: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/18.jpg)
![Page 19: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/19.jpg)
![Page 20: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/20.jpg)
Overfitting & Underfitting
![Page 21: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/21.jpg)
Languages, Systems, Platforms• Spreadsheets• Programming Languages (R/Python)• Relational Database Management Systems • NoSQL Systems (Cassandra/ DocumentDB/ MongoDB)• Specialized Languages on scalable systems ( MapReduce/
Hadoop)• Systems for data visualization (PowerBI/ Tableau)• Data Processing on Cloud (Azure, Amazon Web Services)
![Page 22: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/22.jpg)
![Page 23: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/23.jpg)
![Page 24: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/24.jpg)
![Page 25: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/25.jpg)
![Page 26: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/26.jpg)
![Page 27: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/27.jpg)
![Page 28: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/28.jpg)
Regression
![Page 29: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/29.jpg)
Regression Goal: Function f applied to training data should
produce values as close as possible in aggregate to actual outputs
![Page 30: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/30.jpg)
Classification
![Page 31: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/31.jpg)
Clustering
![Page 32: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/32.jpg)
![Page 33: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/33.jpg)
Neural Networks
![Page 34: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/34.jpg)
![Page 35: Data Science Orientation](https://reader031.vdocuments.us/reader031/viewer/2022030307/58e52c821a28abac7e8b4fd5/html5/thumbnails/35.jpg)