data science

54
• Data Science https://store.theartofservice.com/the-data-science- toolkit.html

Upload: lynn-campbell

Post on 12-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data Science

• Data Science

https://store.theartofservice.com/the-data-science-toolkit.html

Page 2: Data Science

Dynamic systems development method - DSDM and the DSDM Consortium: origins

1 People at that meeting all worked for blue-chip organisations such as

British Airways, American Express, Oracle and Logica (other companies

such as Data Sciences and Allied Domecq have since been absorbed

by other organisations).

https://store.theartofservice.com/the-data-science-toolkit.html

Page 3: Data Science

Shugart Associates - History

1 In the story told by Don Massaro the size and basic design was laid out by

him and Jimmy Adkisson with cardboard and scissors during a car

trip to Herkimer to visit Mohawk Data Sciences

https://store.theartofservice.com/the-data-science-toolkit.html

Page 4: Data Science

Hadoop - Commercially supported Hadoop-related products

1 * [ http://www.bigdatapartnership.com

Big Data Partnership], a leading European-based pure-play big data service provider in Data Science, Engineering and certified Training offers vendor neutral and platform independent Hadoop consulting

implementation services and certified training

https://store.theartofservice.com/the-data-science-toolkit.html

Page 5: Data Science

GoPivotal - Services

1 *'Pivotal Data Science Labs' - a data science consultation service provided

by Pivotal’s team of data scientists[http://gopivotal.com/pivota

l-services-and-solutions/pivotal-strategic-services/pivotal-data-

science-labs Pivotal Data Science Labs], retrieved 22-May-2013

https://store.theartofservice.com/the-data-science-toolkit.html

Page 6: Data Science

GoPivotal - Open source software

1 * OpenChorus - a platform for collaborative data science with Pivotal customers, data science

practitioners, open source developers, and partners, for

predictive analytics

https://store.theartofservice.com/the-data-science-toolkit.html

Page 7: Data Science

Prescriptive analytics - Further reading

1 * Laney, Douglas and Kart, Lisa, (March 20, 2012). [

http://www.parabal.com/uploads/docs/Greenplum/Emerging%20Role%20of

%20the%20Data%20Scientist%20and%20the%20Art%20of

%20Data%20Science.pdf Emerging Role of the Data Scientist and the Art

of Data Science] Gartner.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 8: Data Science

Tribe (internet) - Language profiling

1 Not only do Twitter tribes have mutual interestsCragg, M.,

[http://www.theguardian.com/music/2011/apr/02/twitter-tribes-lady-gaga-rihanna The rise of the Twitter tribes], The Guardian, April 2, 2011, but they also

share potentially subconscious language features as found in the 2013 study by researchers from Royal

Holloway University of London and Princeton University|Princeton.[http://www.epjdatascience.com

/content/2/1/3 Word usage mirrors community structure in the online social network Twitter], European Physical Journal|EPJ Data Science, 25

February 2013

https://store.theartofservice.com/the-data-science-toolkit.html

Page 9: Data Science

Punched card - History

1 Mohawk Data Sciences introduced a magnetic tape encoder in 1965, a system

marketed as a keypunch replacement which was somewhat successful, but punched cards were still commonly used for data entry and programming until the mid-1980s when the

combination of lower cost disk drive|magnetic disk storage, and affordable

computer terminals|interactive terminals on less expensive minicomputers made punched

cards obsolete for this role as well

https://store.theartofservice.com/the-data-science-toolkit.html

Page 10: Data Science

Chief data officer - History and evolution

1 More recently, with the adoption of Data science|Data Science the Chief Data Officer is sometimes looked upon as the key strategy person either reporting to the Chief Strategy

Officer or serving the role of CSO in lieu of one. He has responsibility of measurement along

various business lines and consequently defining the strategy for the next growth

opportunities, product offerings, markets to pursue, competitors to look at etc. We see this

in organizations like Chartis, AllState and Fidelity

https://store.theartofservice.com/the-data-science-toolkit.html

Page 11: Data Science

Cheltenham - Economy

1 Vertex Data Science, GE-Aviation, Chelsea Building Society, Endsleigh Insurance, Archant, Nelson Thornes,

UCAS (Universities Colleges Admissions Service), Kohler Mira Ltd|Kohler Mira, Zurich Financial Services

and Spirax-Sarco Engineering all have sites in and around

Cheltenham.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 12: Data Science

Greenplum - Technology

1 Greenplum Chorus is a social network portal for data science

teams.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 13: Data Science

Greenplum - Technology

1 Greenplum Analytics Lab was a data science consultation service, renamed Pivotal Data

Labs in 2013.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 14: Data Science

Dynamic systems development method - DSDM and the DSDM Consortium: origins

1 People at that meeting all worked for blue-chip organisations such as

British Airways, American Express, Oracle and Logica (other companies

such as Data Sciences and Allied Domecq have since been absorbed

by other organisations).

https://store.theartofservice.com/the-data-science-toolkit.html

Page 15: Data Science

Keypunch - Transition to direct data entry

1 Mohawk Data Sciences subsequently produced an improved magnetic tape

encoder in 1965, which was somewhat successfully marketed as

a keypunch replacement

https://store.theartofservice.com/the-data-science-toolkit.html

Page 16: Data Science

Data science

1 Data Science need not be always for big data, however, the fact that data

is scaling up makes big data an important aspect of data science.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 17: Data Science

Data science

1 This means that data science must be practiced as a team, where across the membership of the team there is expertise and proficiency across all

the disciplines.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 18: Data Science

Data science

1 Data science techniques impact how we access data and conduct research across various domains, including the

biological sciences, medical informatics, social sciences and the

humanities.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 19: Data Science

Data science - History

1 Here, for the first time, the term data science is included in the title of the

conference (Data Science, classification, and related methods).

https://store.theartofservice.com/the-data-science-toolkit.html

Page 20: Data Science

Data science - History

1 On 10 November 1998, C.F. Jeff Wu gave his inaugural lecture entitled

Statistics = Data Science? in honor of his appointment to the H. C. Carver Collegiate Professorship in Statistics

at the University of Michigan.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 21: Data Science

Data science - History

1 Later, he presented his lecture entitled Statistics = Data Science? as the first of his 1998 P.C. Mahalanobis

Memorial Lectures. These lectures honor Prasanta Chandra

Mahalanobis, an Indian scientist and statistician and founder of the Indian

Statistical Institute.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 22: Data Science

Data science - History

1 International Statistical Review / Revue Internationale de Statistique,

21-26 In his report, Cleveland establishes six technical areas which he believed to encompass the field of

data science: multidisciplinary investigations, models and methods

for data, computing with data, pedagogy, tool evaluation, and

theory.https://store.theartofservice.com/the-data-science-toolkit.html

Page 23: Data Science

Data science - History

1 Retrieved from Japan Science and Technology Information Aggregator,

Electronic: http://www.jstage.jst.go.jp/browse/dsj/1/0/_contents Shortly thereafter, in January 2003, Columbia University

began publishing The Journal of Data Science,The Journal of Data Science

https://store.theartofservice.com/the-data-science-toolkit.html

Page 24: Data Science

Data science - Domain Specific Interests

1 Data science requires a versatile skill-set

https://store.theartofservice.com/the-data-science-toolkit.html

Page 25: Data Science

Data science - Research Areas

1 As an interdisciplinary subject, data science draws scientific inquiry from a broad range of academic subject areas, mostly related to the hard

scientist. Some areas of research are:

https://store.theartofservice.com/the-data-science-toolkit.html

Page 26: Data Science

Data science - Security Data Science

1 Security data science is data driven, meaning that new insights and value

comes directly from data.http://www.securitydatascience.

org

https://store.theartofservice.com/the-data-science-toolkit.html

Page 27: Data Science

Data science - Clinical Data Science

1 Data science has always been prominent in the field of clinical

trials

https://store.theartofservice.com/the-data-science-toolkit.html

Page 28: Data Science

Data science - Conferences

1 *ICDSE (International Conference on Data Science and Engineering), held by Department of Computer Science,

Cochin University of Science and Technology, [http://icdse.cusat.ac.in

icdse.cusat.ac.in], 2012

https://store.theartofservice.com/the-data-science-toolkit.html

Page 29: Data Science

Data science - Conferences

1 *Annual International Workshop on Dataology and Data Science, held by Research Center on Dataology and

DataScience, Fudan University, China, [http://iwdds.fudan.edu.cn/ iwdds.fudan.edu.cn/], 2010, 2011,

2012

https://store.theartofservice.com/the-data-science-toolkit.html

Page 30: Data Science

Data science - Conferences

1 * Data science workshops -[http://www.datacurry.in],

[http://www.datacurry.com], 2013

https://store.theartofservice.com/the-data-science-toolkit.html

Page 31: Data Science

Medical Image Computing

1 'Medical image computing (MIC)' is an interdisciplinary field at the

intersection of computer science, data science, electrical engineering, physics, mathematics and medicine. This field

develops computational and mathematical methods for solving

problems pertaining to medical images and their use for biomedical research

and clinical care.https://store.theartofservice.com/the-data-science-toolkit.html

Page 32: Data Science

Rediff.com - History

1 * 'Rediff Labs': Showcases Rediff's data driven journalism, data science projects that include interactive web

apps and visualisation of big data

https://store.theartofservice.com/the-data-science-toolkit.html

Page 33: Data Science

Gregory I. Piatetsky-Shapiro

1 'Gregory I. Piatetsky-Shapiro' (born 7 April 1958) is a Data Scientist, co-founder of KDD conferences and Association for

Computing Machinery|ACM SIGKDD association for Knowledge Discovery and Data Mining, and President of KDnuggets, a leading site on Business Analytics, Data Mining, and Data Science. For simplicity,

he usually abbreviates his name as 'Gregory Piatetsky'.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 34: Data Science

Gregory I. Piatetsky-Shapiro - KDnuggets

1 KDnuggets started as a directory of main areas of data mining and data science,

including

https://store.theartofservice.com/the-data-science-toolkit.html

Page 35: Data Science

Gregory I. Piatetsky-Shapiro - KDnuggets

1 Business Analytics, Data Mining, and Data Science, including interviews with many key leaders of the field.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 36: Data Science

Lab notebook - Structure

1 Liehr, Improving the Traditional Information Management in Natural

Sciences, Data Science Journal, 2009, 8, 18-26,

[http://dx.doi.org/10.2481/dsj.8.18 DOI 10.2481/dsj.8.18] Many adhere to the concept that a lab notebook should be thought of as a diary of

activities that are described in sufficient detail to allow another scientist to replicate the steps

https://store.theartofservice.com/the-data-science-toolkit.html

Page 37: Data Science

Rediff - History

1 * 'Rediff Labs': Showcases [http://labs.rediff.com/ Rediff's data

driven journalism], data science projects that include interactive web apps and visualisation of big data. Data driven Lok Sabha Elections

2014 Candidates' Ethics Score their position in the previous elections to make the right decision before you

vote!https://store.theartofservice.com/the-data-science-toolkit.html

Page 38: Data Science

Two Sigma Investments - Fund information

1 * Two Sigma Ventures focuses on venture capital investments in

companies, with a focus on companies operating in the realm of data science, machine learning, and

artificial intelligence.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 39: Data Science

Rexer's Annual Data Miner Survey

1 'Rexer Analytics’s Annual Data Miner Survey' is the largest Statistical

survey|survey of data mining, data science, and analytics professionals

in the industry

https://store.theartofservice.com/the-data-science-toolkit.html

Page 40: Data Science

Zeitschrift für Physik - Topics covered

1 * European Physical Journal Data Science: Data Science

https://store.theartofservice.com/the-data-science-toolkit.html

Page 41: Data Science

1QBit - Locations

1 1QBit is headquartered in Vancouver, British Columbia, Canada. In early 2014, 1QBit was invited to join the OneEleven data community located in Toronto, Ontario, Canada. This

second location serves as the data science and software production arm

of the organization.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 42: Data Science

Filippo Menczer - Education, Career, Service

1 He holds editorial positions for the journals EPJ Data Science and Network Science

https://store.theartofservice.com/the-data-science-toolkit.html

Page 43: Data Science

Filippo Menczer - Research

1 Menczer's research focuses on Web science, social networks, social media, social computation, Web

mining, data science, distributed and intelligent Web applications, and modeling of complex information

networks. He introduced the idea of Focused crawler|topical and adaptive

Web crawlers, a specialized and intelligent type of Web crawler.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 44: Data Science

Carnegie Mellon School of Computer Science - Professional masters

1 * Masters of Science in Computational Data Science (MCDS)[http://mcds.cs.cmu.edu/ Masters in

Computational Data Science]

https://store.theartofservice.com/the-data-science-toolkit.html

Page 45: Data Science

Virtual research environment

1 Data Science Journal, Vol

https://store.theartofservice.com/the-data-science-toolkit.html

Page 46: Data Science

Chapman University - Schmid College of Science and Technology

1 degrees in computational and data sciences

https://store.theartofservice.com/the-data-science-toolkit.html

Page 47: Data Science

Fu Foundation School of Engineering and Applied Science - Facilities

1 SEAS has secured plots for new graduate facilities and the Institute for Data Science and Engineering

https://store.theartofservice.com/the-data-science-toolkit.html

Page 48: Data Science

Fu Foundation School of Engineering and Applied Science - Specialized centers

1 * [http://idse.columbia

.edu/ Institute for Data Sciences and

Engineering]https://store.theartofservice.com/the-data-science-toolkit.html

Page 49: Data Science

Fu Foundation School of Engineering and Applied Science - Specialized centers

1 * [http://idse.columbia.edu/foundations-data-science/ Foundations of Data Science Center]

https://store.theartofservice.com/the-data-science-toolkit.html

Page 50: Data Science

Love Parade stampede - Incident

1 EPJ Data Science 1:7 (2012),

https://store.theartofservice.com/the-data-science-toolkit.html

Page 51: Data Science

The Climate Corporation - History

1 In November 2013 the company launched Climate Basic and Climate

Pro, a set of advisory tools for farmers utilizing data science to help

farmers make optimal decisions.

https://store.theartofservice.com/the-data-science-toolkit.html

Page 52: Data Science

New York City Economic Development Corporation - Applied Sciences NYC

1 The Applied Sciences NYC initiatives also includes the establishment of a campus in

Downtown Brooklyn developed by a consortium led by NYU that focuses on the challenges facing cities, and a new institute

for data sciences at Columbia University.[http://www.nycedc.com/sites/default/files/filemanager/Projects/Applied_Sciences_NYC/

AppliedSciencesInfographic.pdf Applied Sciences NYC infographic]

https://store.theartofservice.com/the-data-science-toolkit.html

Page 53: Data Science

Eudaemons - History

1 As a science experiment, the group's objective was accomplished: to prove that there was a way of statistically

predicting where a ball would fall in a roulette wheel given some input

data. This outcome precursed data science and embodied the infancy of

predictive analytics.

https://store.theartofservice.com/the-data-science-toolkit.html