data scientist enablement roadmap 1.0

20
Data Scientist Enablement Roadmap Advanced Center of Excellence Modern Renaissance Corporation In Collaboration with SONO team and others

Upload: mohan-bavirisetty

Post on 15-Jan-2015

858 views

Category:

Technology


5 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Data Scientist Enablement roadmap 1.0

Data Scientist Enablement Roadmap

Advanced Center of ExcellenceModern Renaissance CorporationIn Collaboration with SONO team and others

Page 2: Data Scientist Enablement roadmap 1.0

We thank our community of committed and passionate volunteers, experts, educators, innovators, benefactors, advisers, advocates and supportersWe are also grateful to the outstanding support and encouragement from SONO team as well as other organizations like Open Courseware Consortium, MIT, IBM, HortonWorks, Stanford University, and Caltech etc.

Acknowledgement

Page 3: Data Scientist Enablement roadmap 1.0

Principles

Philanthropy through Free and Open Education, Knowledge Dissemination and Social Innovation

Synthesize Data Science, Big Data Architecture, Technology Platforms and Systems Engineering for Decision Making

Collaboration, Crowdsourcing and Innovation Diffusion

Emphasis on Knowledge, Skills and Abilities (KSAs) over Abstract Mathematics or Theoretical Profundity

Principles without programs are platitudes.- George Bernard Shaw

Page 4: Data Scientist Enablement roadmap 1.0

Industry needs Data Scientists with versatile background in Machine Learning, Statistics, Big Data Architecture, Advanced Analytics, Evidence-Oriented Systems Engineering.

Motivation

Aspiring Data Scientists, Big Data Engineers also need well-rounded education, mentorship from experts as well as practical skills

Page 5: Data Scientist Enablement roadmap 1.0

Goals

Prepare the students, practitioners to have set of T-shaped practical-skills emphasizing depth and breadth of a range of relevant disciplines and capabilities in Data/Decision Sciences and Big Data Architecture/Engineering.

Make the course delivery easy, engaging and engendering.

Page 6: Data Scientist Enablement roadmap 1.0

Data Science Enablement Roadmap - 2014

1 + 3 Courses gets you Master’s Level Certificate

Fast track toData Science

Ramping up Machine Learning with R

Modern Data Platforms

Advanced Techniques inBig Data Analytics

Page 7: Data Scientist Enablement roadmap 1.0

Data Science Enablement Roadmap - Future

Possible extensions in future

Fast track toData Science

Ramping with R

Modern Data Platforms

Data Mining Process Methodologies and Tools

Machine Learning/AI

Advanced Techniques inBig Data Analytics

Data Visualization

Page 8: Data Scientist Enablement roadmap 1.0

Introductory course with NO pre requisites. Topics include Algorithms, Statistical Inference, Data Analysis, Model Building, Validation, Calibration,Data at rest and in motion, Causality, Meaning of Data, Data Engineering, Hadoop, R, Machine Learning,Data Mining, Visualization, Applications, Case Studies, variety of tools and techniques etc.

Fast track to Data Science (DSE 400)

Page 9: Data Scientist Enablement roadmap 1.0

Prerequisite: DSE 400Applied Statistics, Machine Learning, Data Mining, Graphing, Analytics and VisualizationUse cases, Industry Applications

Ramping up with R (DSE 501)

Page 10: Data Scientist Enablement roadmap 1.0

Prerequisite: DSE 400Employ Hadoop and Hadoop Ecosystem to enable Enterprises handle data explosion and derive actionable analytics.MapReduce, Pig, Hive, NoSQL, Zookeeper etc.Also introduce streams computing with Storm and KafkaCase Studies: Fraud Prevention, Product Recommendation, Epidemic Prediction etc.

Modern Data Platforms (DSE 502)

Page 11: Data Scientist Enablement roadmap 1.0

Modern Data Platforms (Contd ...)

Page 12: Data Scientist Enablement roadmap 1.0

Prerequisite: DSE 400Explore and implement Machine Learning Algorithms such as Classification, Clustering, Ranking, Recommendation, Neural Networks, Adaptive Learning.

Also to include Knowledge Engineering, Expert Systems, Ontologies, NLP and Reasoning

Machine Learning and AI (DSE 503)

Page 13: Data Scientist Enablement roadmap 1.0

Prerequisite: DSE 400Decision TreesRegressionClassificationClusteringAssociation Rules etc.

Data Mining Process and Methodologies (DSE 504)

Page 14: Data Scientist Enablement roadmap 1.0

Prerequisite: DSE 400Story Telling/Data JournalismData Visualization MethodologyTools and TechniquesHTML5, d3.js, BIRT, Prefuse

Data Visualization (DSE 505)

Page 15: Data Scientist Enablement roadmap 1.0

Prerequisite: DSE 400On Demand Data IntegrationData VirtualizationEnterprise Data HubAnalytics DashboardsBig Data AppliancesAnalytics as a ServiceOpen Stack and SavannahPrivacy and Security

Advanced Techniques for Big Data Analytics ( DSE 600)

Page 16: Data Scientist Enablement roadmap 1.0

Advanced Techniques for Big Data Analytics ( contd ...)

Page 17: Data Scientist Enablement roadmap 1.0

DSE 2014 stream is set to commence on Jan 19, 2004For more details, visit DSE 400 Announcement Page <http://bit.ly/18zPE1j>To Enroll for DSE 400 visit Enrollment Page

<http://soknocommunity.com/xoops/modules/xforms/?form_id=2>This presentation can also accessed at

Data Scientist Enrollment Roadmap 1.0 <http://bit.ly/1c2wC2P>We welcome thoughts and suggestions. Write to us at <[email protected]>

Next Steps

Page 18: Data Scientist Enablement roadmap 1.0

Data Jujitsu - The Art of Turning Data into a Product by DJ PatelData Science eBook Dr. Vincent GarvilleData Scientist - Sexiest Job of 21 Century (HBR)Doing Data Science by Rachel ShuttData Visualization: a successful design process by Andy KirkDisruptive Possibilities: How Big Data Changes Everything by Jeffrey NeedhamHow to process, analyze and Visualize Data (MIT OCW) Knowledge-based Systems (MIT OCW) Learning from Data (Caltech)Statistical Thinking and Data Analysis (MIT OCW) The Complete Guide to Business Analytics (Collection) By: Thomas H. DavenportThink Bayes; Think Python; and Think Stats by Aleen Downey

References

Learn from the masters - Johann Wolfgang von Goethe

Page 19: Data Scientist Enablement roadmap 1.0

Google White papers on GFS, MapReduce, BigtableReal-time Analytics with Big Data - Facebook Case StudyInteractive Data Visualization by Scott MurrayData, Models and Decisions (MIT OCW)Communicating Data (MIT Sloan School of Management OCW)Unleashing the Power of Hadoop - DBTA Thought Leadership SeriesHow Educators Can Narrow Big Data Skills Gap - Jeff Bertloucci, Data Visualization with D3.js CookbookWhat is Data Science?Agile Data Science

References (contd …)

Page 20: Data Scientist Enablement roadmap 1.0

Thank You