cornell’s project harvest cni fall 2001 task force meeting anne r. kenney and nancy y. mcgovern

26
Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Upload: dayna-sparks

Post on 27-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Cornell’s Project Harvest

CNI Fall 2001 Task Force Meeting

Anne R. Kenney and Nancy Y. McGovern

Page 2: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Project Harvest Overview• Subject-based approach: agriculture

– National Preservation Plan– USAIN– Mann Library

• Core Historical Literature

• TEEAL

• USDA

• 75% of core journals now available in electronic form

Page 3: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Focus of Planning Year

• Investigating conditions under which publishers willing to participate in the development of an Subject-Based Digital Archives (SBDA)

• Two pronged iterative cycle: – Explore (potential of SBDA, business model,

broader preservation matrix)– Build (using agriculture as pragmatic

application)

Page 4: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

PBDA

Page 5: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

SBDA

Page 6: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Intersection of Digital Archives

Format-based

Page 7: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern
Page 8: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

USAIN Survey

• Access– 45% indicated need for both print and electronic– 55% indicated e-journal already substituted for

print; – 84% would cancel print if reliable archives built– JSTOR study – 78% of faculty think hard copy

should be retained even if reliable digital archives

Page 9: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

USAIN Survey• Observed loss in e-journals:

• 45% don’t know• 22% yes noted difference• 22% no, no difference

• What to preserve (priority order):1. Preserve content plus journal “look and feel” plus

publisher functionality2. Preserve content plus journal “look and feel”

• How to preserve:• Over 90% rejected single solution; prefer multiple

custodians or 3rd party

Page 10: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Sept. 6 Publishers’ Meeting

• American Dairy Science• Academic/Elsevier• American Phytopathological Society• BioOne• CABI• NRC-Canada • Wiley • NLA and USAIN representation

Page 11: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

What’s the Publisher Incentive to Archive?

• Protect assets, continuing value of material as it ages

• Low additional overhead

• Satisfy customers

• Risk tolerance; sustainable loss

• As calling card for or bi-product of services

Page 12: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Meeting Results

• All publishers intend to establish archives

• Shift from content currency to database development

• Publishers see revenue stream in retrospective holdings

• Publishers less concerned than librarians about “artifactual” archiving

Page 13: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Meeting Results• Differing perceptions around who should

do digital preservation • Librarians want trusted third-party

archiving• Publishers insufficiently aware that others

don’t trust them to safeguard materials and insufficiently aware of what it takes to archive

• Distrust of government (competition)

Page 14: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Meeting Results

• Publishers not enthusiastic about “lit” archives—some would consider it if revenue returned to publisher

• Convergence in formats• Reluctance to force authors to conform • Unwilling to share proprietary publisher DTD• Willing to consider archival DTD as another

output

Page 15: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Trigger Events

• None acknowledged by publishers

• Technology watersheds:– Retrofitting legacy digital files – When paper no longer represents access and

preservation alternative for electronic

Page 16: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

SBDA triggers

• Different subject domains have different half-lives

• When common interests outweigh individual interests

• Stakeholder pressure: when detrimental not to participate

Page 17: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Access and Funding

• Publishers and librarians went into the meeting presuming different things

• Publishers differed on access issues

• Librarians asserted that publishers would have to finance dark archives

Page 18: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

SBDA Distinguishes Between Metadata and Data

• Dark metadata/dark data

• Light metadata/light data

• Light metadata/dark data

• Light metadata/no data

Multiple options for different publishers and audiences

Page 19: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

SBDA Hybrid Model

• Ultimate goal is lightness• Comprehensiveness and buy-in trumps lightness• Commonality over distinctiveness emphasized• Hybrid model enables combinations of light to

dark metadata and data• Access to metadata/data will change over time

and in response to particular circumstances• Offers win/win possibilities

Page 20: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Possible Sustainability Models

• Preservation surcharge on subscription

• Preservation endowment

• Bartered access privileges for preservation

• Business insurance policy model

• Government support

Page 21: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

• Preservation pledge drives

Possible Sustainability Models

Page 22: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Possible Sustainability Models

• Develop new markets

• Harness the free riders• Charge for services, not content and

archiving• Build value-adds on the SBDA

Page 23: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Next Steps

• Developing subject domain profile

• Surveying agricultural publishers to determine level of cooperation in SBDA

• Evaluating existing architectural models

• Writing CLIR report on the significance of the SBDA

Page 24: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

Subject-based Profile• Who are the stakeholders? How many publishers?

Research demographics of new user groups? • How big is the field? How structured and defined

is it? What’s important? Why? Change driven by discipline and by technology

• How standardized is the literature? (xml, etc)• How complex/fixed is it? (database, virtual)• Who owns rights for re-use? Assessment of

economic, first-use, citations, second use, technology

Page 25: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

How Willing to Cooperate?

• Pre- and post-competitive collaboration• Standardized, normalized, and limited

number of formats• Preservation from conception

(requirements of authors; shut off point for non cooperation)

• Archival DTD• Preservation metadata

Page 26: Cornell’s Project Harvest CNI Fall 2001 Task Force Meeting Anne R. Kenney and Nancy Y. McGovern

How Willing to Cooperate?

• Self certification/ external certification

• Light (and common) metadata, move toward light data (monitoring with scheduling)

• Economy of scale

• Willing to financially support the effort