hathitrust: possibilities metadata working group cornell university library march 21, 2014
DESCRIPTION
HathiTrust is a digital library. A continuously growing digital library, currently home of 11M+ books Statistics and Visualizations (http://www.hathitrust.org/statistics_visualizations)http://www.hathitrust.org/statistics_visualizations Almost 500 terabytes 3.7M volumes (~33% of total) in the public domain All volumes indexed in full, and searchable in fullTRANSCRIPT
HathiTrust: Possibilities
Metadata Working GroupCornell University LibraryMarch 21, 2014
HathiTrust is a consortium.
A consortium of over 80 libraries http://www.hathitrust.org/partnership
http://www.hathitrust.org/governanceBoard of Governors
Program Steering CommitteeWorking Groups
Committees
HathiTrust is a digital library.
A continuously growing digital library, currently home of 11M+ books
Statistics and Visualizations (http://www.hathitrust.org/statistics_visualizations )
Almost 500 terabytes3.7M volumes (~33% of total) in the public domainAll volumes indexed in full, and searchable in full
HathiTrust is a digital library.
Cornell patrons can use a Cornell NetID to login
Create Collections (public or private)Download PDFs of any item available in full text Obtain “Enhanced Access” if a CUL affiliate has a
certified print disability (http://www.library.cornell.edu/svcs/disability#access)
HathiTrust is a preservation repository.
HathiTrust is TRAC Certified.Every book, and every page of every book, has a persistent identifier.
Cornell University Library deposits books digitized through the Google partnership.
HathiTrust: getting info out
information =metadata
info about resources
+data
resources themselves
http://www.hathitrust.org/data
APIs: BibAPI
returns bibliographic, rights, and volume information when given a single
or multiple standard identifiers:
OCLC number, LCCN, ISSN, ISBN, HathiTrust Volume ID, HathiTrust record number
http://www.hathitrust.org/bib_api
APIs: DataAPI
retrieves content (page images, OCR, METS and or whole volume
packages) and bibliographic info
http://www.hathitrust.org/data_api
OAI Harvesting
OAI feed (MARC21 and Dublin Core) of public domain materials
http://www.hathitrust.org/data
Hathifiles
tab-delimited files identifying the contents of HathiTrust repository
Posted daily: http://www.hathitrust.org/hathifiles
Described: http://www.hathitrust.org/hathifiles_description