wit career lecture series - cteixeira data scientist

17
© 2016 The MITRE Corporation. All rights reserved. Christopher Teixeira Lead Data Scientist, MITRE Life of a Data Scientist Twitter: @CT_Analytics LinkedIn: http://lnked.in/CTeixeira The author's affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended to convey or imply MITRE's concurrence with, or support for, the positions, opinions or viewpoints expressed by the author.

Upload: christopher-teixeira

Post on 11-Apr-2017

89 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: WIT Career Lecture Series - CTeixeira Data Scientist

© 2016 The MITRE Corporation. All rights reserved.

C h r i s t o p h e r Te i x e i r aL e a d D a t a S c i e n t i s t , M I T R E

Life of a Data ScientistTw i t t e r : @ C T _ A n a l y t i c sL i n k e d I n : h t t p : / / l n k e d . i n / C Te i x e i r a

The author's affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended to convey or imply MITRE's concurrence with, or support for, the positions, opinions or viewpoints expressed by the author.

Page 2: WIT Career Lecture Series - CTeixeira Data Scientist

| 2 |

© 2016 The MITRE Corporation. All rights reserved.

Math + Baseball = Future Red Sox General Manager…

Mathematics BS from Worcester Polytechnic Institute– Concentration in Applied Statistics– Capstone Project: Statistical Analysis of Defensive

Production in Major League Baseball

Operations Research MS from George Mason University– Concentration in Decision Analysis– Capstone Project: Team Organization for Senior Softball

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 3: WIT Career Lecture Series - CTeixeira Data Scientist

| 3 |

© 2016 The MITRE Corporation. All rights reserved.

How many industries use data science?

Operations Research Analyst at SAIC– NASA: Planning Missions to the Moon– DHS: Securing our Public Transit Systems

Advanced Analytics Senior Consultant at IBM– AETNA: Regionalized Fraud Analysis– DOD: Defeating IEDs

Senior Analytic Consultant at Epsilon– SunTrust: Real-time marketing– Bank of America: Understanding customer sentiment

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 4: WIT Career Lecture Series - CTeixeira Data Scientist

| 4 |

© 2016 The MITRE Corporation. All rights reserved.

Lead Data Scientist at MITRE

Using my skills from both school and previous jobs– Discrete Event Simulation, System Dynamics,

Agent Based Modeling– Data Analysis and Processing– Statistical Analysis and

Systems Engineering Techniques– Data Visualization

Problems I help solve:– How can we use math and statistics to better

serve Veterans waiting for benefits?– Can simulations help us to plan how clean up

America’s nuclear waste more safely, effectively, and within a reasonable cost?

– How can we make use of data and models to run our own organization more effectively?

– Can predictive analytics be used to help increase child welfare?

Chris Teixeira at the White House for the White House Foster Care & Technology Hackathon, May 28, 2016

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 5: WIT Career Lecture Series - CTeixeira Data Scientist

| 5 |

© 2016 The MITRE Corporation. All rights reserved.

Typical Day in the Life of a Data Scientist

Data Analysis20%

Cleaning Data40%

Statistical or Machine Learn-ing Model Build-

ing25%

Creating Visualization10%

Presenting Analysis5%

Typical Day's Activities

Page 6: WIT Career Lecture Series - CTeixeira Data Scientist

| 6 |

© 2016 The MITRE Corporation. All rights reserved.

Typical Day in the Life of a Lead Data Scientist

Data Analysis20%

Cleaning Data10%

Statistical or Machine Learn-ing Model Build-

ing25%

Creating Visualization10%

Presenting Anal-ysis35%

Typical Day's Activities

Page 7: WIT Career Lecture Series - CTeixeira Data Scientist

| 7 |

© 2016 The MITRE Corporation. All rights reserved.

Evaluating Children at Risk

Source: https://www.behance.net/gallery/3751117/Stop-Child-Abuse

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 8: WIT Career Lecture Series - CTeixeira Data Scientist

| 8 |

© 2016 The MITRE Corporation. All rights reserved.

National Public-Private Partnership to Eliminate Abuse and Neglect Fatalities

ABUSE AND NEGLECT

FATALITIES

FFRDC-operated, trusted analytic

environment

PROTECT OUR KIDS

ACT OF 2012and

CECANF Objectives

Identify

Analyze

Control

Manage

Predictive Risk Modeling, Data Visualization Tools, Monitoring

and Reporting

Integrated Stakeholder and Child Welfare Data

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 9: WIT Career Lecture Series - CTeixeira Data Scientist

© 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use.

| 9 |

N o t t h e s o c c e r c o m p a n y …B u t t h e b e s t p l a c e t o w o r k y o u h a v e p r o b a b l y n e v e r h e a r d o f !

The MITRE Corporation

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 10: WIT Career Lecture Series - CTeixeira Data Scientist

| 10 |

© 2016 The MITRE Corporation. All rights reserved.

Established to Serve the Public Interest

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 11: WIT Career Lecture Series - CTeixeira Data Scientist

| 11 |

© 2016 The MITRE Corporation. All rights reserved.

Today We Operate Seven FFRDCs

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 12: WIT Career Lecture Series - CTeixeira Data Scientist

| 12 |

© 2016 The MITRE Corporation. All rights reserved.

Understanding FFRDCs

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 13: WIT Career Lecture Series - CTeixeira Data Scientist

| 13 |

© 2016 The MITRE Corporation. All rights reserved.

Our Employees

Approved for Public Release; Distribution Unlimited. Case Number 16-2345

Page 14: WIT Career Lecture Series - CTeixeira Data Scientist

© 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use.

| 14 |

Advice on Getting into Data Science at MITRE

Page 15: WIT Career Lecture Series - CTeixeira Data Scientist

| 15 |

© 2016 The MITRE Corporation. All rights reserved.

Necessary Skills for a Data Scientist

Background in at least one of the following fields:– Statistics– Mathematics– Operations Research– Computer Science

Other skills:– Excellent written and verbal communication skills– Demonstrated ability to manipulate large datasets with at least one modern programming

language (e.g., Python, SAS, MATLAB, C++, R, Java) – Experience leveraging COTS tools or writing programs to visualize multi-dimensional data– Ability to apply, modify and formulate algorithms and processes to solve challenging

problems– Prior experience working with databases (e.g., Oracle, MySQL, MongoDB)

Junior Data Scientist (27721BR)www.mitre.org/careers

Page 16: WIT Career Lecture Series - CTeixeira Data Scientist

| 16 |

© 2016 The MITRE Corporation. All rights reserved.

Advice on Getting a Data Science position

Don’t be afraid to be a bit geeky!Start a Github accountLearn something new and show it off

–A new language (e.g. JavaScript / R / Python)–Analyze your own data (e.g. Fitbit or Apple Health)

Compete on a kaggle teamVolunteer at DataKind

Page 17: WIT Career Lecture Series - CTeixeira Data Scientist

© 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use.

| 17 |

Questions?