and the illinois harvest portal a presentation to the uiuc library faculty september 20, 2006 betsy...
TRANSCRIPT
and the
Illinois Harvest Portal
A Presentation to the UIUC Library Faculty September 20, 2006
Betsy Kruger and Tim Cole
UIUC Digitization Efforts Get Big Boost!!
TOTAL = $900,000 for FY2007 (must be spent by June 30, 2007)
$200,000
$200,000
$500,000
• Paula, in consultation with ULs from Chicago and Springfield, requested funding from Steve Rugg (UI Comptroller) to help libraries move into larger digitization projects.
• Provost’s Office agreed to supply matching funds.
• Rep. Naomi Jacobsson expressed interest in obtaining state funding for UI projects. Paula sent short proposal regarding our interest in mass digitization project with the Open Content Alliance.
Mass Digitization Working Group
Tim Cole Beth Sandore
Nuala Koetter Sarah Shreeves
Betsy Kruger, Chair Mary Stuart
Michael Norman Tom Teper
Chris Prom David Vess
Working Group’s Goals
• Insure funds are spent by June 30! Successfully and purposefully!
• Coordinate Library’s participation in the Open Content Alliance
• Explore and begin developing integrated access to UIUC owned/created digital content via a web portal
• Develop selection criteria for digitization• Document costs related to mass digitization• Develop in-house expertise• Attract future funding!
Budget Breakdown
DIGITIZATION SERVICES
Open Content Alliance scanners/staff/services $200,000
Illinois-related content digitization (non-OCA projects) $218,000OTHER ALLOCATIONS
Illinois Harvest Portal development staff $125,000
Data storage $80,000
Metadata librarian $42,000
Oak Street 3rd floor upgrades $65,000
Material preparation (wages) $80,000
Workstations $6,000
Contingency $84,000
TOTAL $900,000
In November 2005, Paula Kaufman asked Karen Schmidt to pull together a small group of Library faculty to recommend whether or not the Library should join the Open Content Alliance, a program of the Internet Archive. We said YES!
Karen Schmidt Betsy Kruger
Beth Sandore Mary Stuart
Nuala Koetter Tom Teper
Internet Archive – a nonprofit organization founded by Brewster Kahle in 1996 to build an “Internet library” offering permanent access for researchers, historians, and scholars to historical collections that exist in digital format. http://www.archive.org/index.php
The Open Content Alliance – a program of the Internet Archive started in early 2005
a group of cultural, technology, nonprofit, and governmental organizations from around the world that will help build a permanent archive of multilingual digitized text and multimedia content. http://www.opencontentalliance.org/
OCA GoalTo bring digital and newly digitized material online under principles of
openness.
OCA Principles• Contributed content is free to all for reading, viewing, listening to, downloading, sharing, crawling, indexing
• Rehost at discretion of contributor
• Open for research and computation
• Services can be built by both commercial and non-commercial parties (e.g., navigation services, print-on-demand, etc.)
A Few of the 60+ OCA Participants
• University of California Libraries
• University of Toronto Library
• Johns Hopkins University Libraries
• UNC Chapel Hill
• National Archives (United Kingdom)
• National Library of Australia
• Yahoo
• RLG
• MSN
• Microsoft
Contributions can be:
• Content
• Facilities
• Services
• Tools
• Funding
Mass Digitization at UIUC
OCA’s Responsibilities:• Install two “Scribe” scanning systems at Oak Street • Hire and train staff• Keep track of our materials• Fetch descriptive metadata from our OPAC via a Z39.50 connection• Digitization: creating content files (archival and access copies,
PDFs) and structural/administrative metadata• OCR• Quality control measures• Provide access to digital content via the Internet Archive (IA) website• Long term management of content on IA website
SCRIBE Scanning System
• Non-destructive: Books are not disbound for scanning
• Utilizes digital cameras rather than flatbed scanners
• Book is held face up in a cradle, open at a 90 degree angle, as operator turns pages (snore…)
• Pages held flat by a glass platen that is raised and lowered
• Scanning cost is 10¢ per page
• Up to 500 pages per hour
• Our production will be around 200 books per week for first year.
Mass Digitization at UIUCUIUC Responsibilities:
• Infrastructure improvements to Oak Street 3rd floor
• Selection of materials for digitization
• Daily or weekly retrieval of books to be sent for scanning
• Charging out materials, delivering materials to scanning center, returning material to shelves post-scanning
• Validation and ingestion of metadata and content files for preservation storage
• Linking from Voyager record to digital content files
• Possibly some level of quality control beyond that performed by OCA
Your Input Needed on Selection for OCA!• We anticipate digitizing 8,500 – 9,500 volumes this first year.
• Must be in public domain or UIUC must own rights (e.g., some microfilm)
• Uniqueness—We want to avoid duplication with the Michigan Google Project.
• “Collections” vs. hodge-podge selection
• Faculty support/interest and curricular tie-ins particularly attractive.
• We need your suggestions NOW!
• Suggestions accompanied by a little sweat equity are particularly attractive!
Illinois Harvest Portal
“A website combined with search, aggregation, and discovery services that will provide organized and thematic access to digitized and born-digital collections of public interest from the University of Illinois.”
• In conjunction with the Illinois Harvest portal project, we will also digitize numerous smaller collections, through outsourcing and some in-house digitization
• Will involve various formats (books, maps, audio, video)
• Most will focus on content about Illinois.
• We welcome suggestions of additional projects.
Illinois Projects Under Consideration
• Ilios
• Bronze Tablets
• Illinois counties surveys/maps
• Illinois Chemist
• WILL audio/video content
• INHS Technical Reports
• UI Historic Built Evironment
• Engineering Experiment Station Bulletins
• Chicago Foreign Language Press Survey
• University of Illinois Press
• UI Board of Trustees proceedings (1927-)
• Illinois county atlases
• Illinois newspapers
• Library speeches and guest lecturers
Visit the MassDigiWiki!
http://massdigiwiki.pbwiki.com/FrontPage
• Meeting agendas
• Minutes
• Project documents