hathitrust: possibilities metadata working group cornell university library march 21, 2014

10
HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

Upload: jacob-york

Post on 18-Jan-2018

212 views

Category:

Documents


0 download

DESCRIPTION

HathiTrust is a digital library. A continuously growing digital library, currently home of 11M+ books Statistics and Visualizations (http://www.hathitrust.org/statistics_visualizations)http://www.hathitrust.org/statistics_visualizations  Almost 500 terabytes  3.7M volumes (~33% of total) in the public domain  All volumes indexed in full, and searchable in full

TRANSCRIPT

Page 1: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

HathiTrust: Possibilities

Metadata Working GroupCornell University LibraryMarch 21, 2014

Page 2: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

HathiTrust is a consortium.

A consortium of over 80 libraries http://www.hathitrust.org/partnership

http://www.hathitrust.org/governanceBoard of Governors

Program Steering CommitteeWorking Groups

Committees

Page 3: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

HathiTrust is a digital library.

A continuously growing digital library, currently home of 11M+ books

Statistics and Visualizations (http://www.hathitrust.org/statistics_visualizations )

Almost 500 terabytes3.7M volumes (~33% of total) in the public domainAll volumes indexed in full, and searchable in full

Page 4: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

HathiTrust is a digital library.

Cornell patrons can use a Cornell NetID to login

Create Collections (public or private)Download PDFs of any item available in full text Obtain “Enhanced Access” if a CUL affiliate has a

certified print disability (http://www.library.cornell.edu/svcs/disability#access)

Page 5: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

HathiTrust is a preservation repository.

HathiTrust is TRAC Certified.Every book, and every page of every book, has a persistent identifier.

Cornell University Library deposits books digitized through the Google partnership.

Page 6: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

HathiTrust: getting info out

information =metadata

info about resources

+data

resources themselves

http://www.hathitrust.org/data

Page 7: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

APIs: BibAPI

returns bibliographic, rights, and volume information when given a single

or multiple standard identifiers:

OCLC number, LCCN, ISSN, ISBN, HathiTrust Volume ID, HathiTrust record number

http://www.hathitrust.org/bib_api

Page 8: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

APIs: DataAPI

retrieves content (page images, OCR, METS and or whole volume

packages) and bibliographic info

http://www.hathitrust.org/data_api

Page 9: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

OAI Harvesting

OAI feed (MARC21 and Dublin Core) of public domain materials

http://www.hathitrust.org/data

Page 10: HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014

Hathifiles

tab-delimited files identifying the contents of HathiTrust repository

Posted daily: http://www.hathitrust.org/hathifiles

Described: http://www.hathitrust.org/hathifiles_description