principles for data citation micah altman, institute for quantitative social science, harvard...

25
Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and the Scholarly Record, the Changing Landscape August 23-24, 2011

Upload: lillian-simmons

Post on 27-Mar-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

Micah Altman, Institute for Quantitative Social Science, Harvard University

Prepared for DataCite's Summer Meeting: Data and the Scholarly Record, the

Changing LandscapeAugust 23-24, 2011

Page 2: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Collaborators*

Principles for Data Citation

Leonid Andreev, Ed Bachman, Adam Buchbinder, Ken Bollen, Bryan Beecher, Steve Burling, Kevin Condon, Jonathan Crabtree, Merce Crosas, Gary King, Patrick King, Tom Lipkis, Freeman Lo, Jared Lyle, Marc Maynard, Nancy McGovern, Lois Timms-Ferrarra, Akio Sone, Bob Treacy

Research SupportThanks to the Library of Congress (PA#NDP03-1), the

National Science Foundation (DMS-0835500, SES 0112072), IMLS (LG-05-09-0041-09), the Harvard University Library, the Institute for Quantitative Social Science, the Harvard-MIT Data Center, and the Murray Research Archive.

* And co-conspirators

Page 3: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Related Work

Principles for Data Citation

M. Altman,2008, "A Fingerprint Method for Verification of Scientific Data" in, Advances in Systems, Computing Sciences and Software Engineering, (Proceedings of the International Conference on Systems, Computing Sciences and Software Engineering 2007) , Springer Verlag.

M. Altman and G. King. 2007. “A Proposed Standard for the Scholarly Citation of Quantitative Data”, D-Lib, 13, 3/4 (March/April).

G. King, 2007, " An Introduction to the Dataverse Network as an Infrastructure for Data Sharing", Sociological Methods and Research, Vol. 32, No. 2, pp. 173-199

Page 4: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

(19 Ways of Looking at Data)

^Citations

AKA

Page 5: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

Com

mon

Prin

cipl

es

Page 6: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Thanks to 37 Participants

Principles for Data Citation

Motivations Elements

Citing Data Virtual Archives

Page 7: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

What are we talking about?

Page 8: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Workflow

Page 9: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Workflow

Page 10: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Workflow

Page 11: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

- Separatescientific principles, use cases, requirements-Distinguish syntax from presentation-Design for ecosystem & lifecycle-Incremental value for incremental effort

Design Principles

Page 12: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

Theory

Page 13: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

Theory +

Data citations should be first class objects for publication -- appear with citation; should be as easy to cite as other works

At minimum, all data necessary to understand assess extend conclusions in scholarly work should be cited

Citations should persist and enable access to fixed version of data at least as long as citing work

Data citation should support unambiguous attribution of credit to all contributors, possibly through the citation ecosystem

Page 14: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Theory + Practice

Principles for Data Citation

Page 15: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Linking Data to Publications through Citation and Virtual Archives

Use Cases

Page 16: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

Use Cases (details)Operational Constraints?

-Syntax-Interoperability-Technical contexts of use

Page 17: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

Operational Requirements?

Syntax Metadata Interoperability Core technical contexts of use

Page 18: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Actors

Page 19: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Actors

Page 20: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Actors

Page 21: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Actors

Page 22: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Use Cases+ Requirements + Scientific Principles

Data citations should be first class objects for publication -- appear in references; be as easy to reference as other works

All data necessary to assess conclusions in scholarly work should be cited Citations should persist and enable access to fixed version of data,

for as long as the citing works exist Citation should support unambiguous attribution of credit to all contributors,

(possibly through the citation ecosystem of metadata, indices, etc.) Separate presentation and content

• The article is (only) a summary of the research• Science requires reproducibility• Scientific disciplines require a common evidence base

Scientific Principles

Requirements

Attribution – legal & scientific Persistence – persistence of reference; identify responsible curators Access – short & long term; machine & human Discovery – locate instances; discover derivative, parent, and related works Provenance – associate scientific claim and evidence; verify fixity of evidence

Key Use Cases

Page 23: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

Simple P

Page 24: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Principles for Data Citation

- Semantic:Persistent ID, Author, Title, Version (or date)

- Presentation:Any styleGrouped other referencesActionable in context

- PolicyIf its scientific evidence, cite itOffer credit to all contributors

Simple Proposal

Page 25: Principles for Data Citation Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for DataCite's Summer Meeting: Data and

Contact Us

Principles for Data Citation

Micah Altman

maltman.hmdc.harvard.edu

The Dataverse Network ®

thedata.org