agenda why discuss digital libraries what is a digital library history meta-data fedora nsdl d space

Post on 14-Jan-2016

215 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Agenda

• Why discuss Digital Libraries• What is a digital Library• History• Meta-data• FEDORA• NSDL• D Space

What is a Digital Library?• There are several definitions of Digital

libraries

• Borgman identifies two major aspects emphasized in these definitions – DL Researchers from Computer Science

focus on DLs as content for user communities and therefore emphasize the enabling technologies

– Library professionals appear to emphasize DLs as services / institutions

DL:Evolution• Digital Library Initiative I

• Beginning in 1992-93, The US Govt. agencies, NSF ,DARPA, and others initiated the digital library efforts in the US with massive funding resulting in 6 major DL initiatives

• Digital Library Initiative II ( 1998-2002)

• E-Lib programme in the UK

Roots of Digital Libraries• DLs have evolved from and are based on the

techniques and principles developed by the early IR researchers such as-– Calvin Mooers ( 1950)– Perry ( 1951)– Taube (1955)

• Information Retrieval Systems of 1980s

• Salton’s Automatic indexing and search systems (1960s)

• Hypertext systems of 1980s

Meta-data• Data about data

• Meta-data standards• Dublin Core• Open Archive Initiative (OAI)

• Meta-data collection and harvesting• Periodic• Federated

What is Meta-data

• Record of the existence of knowledge

(i.e. libraries)

• Linkable to each other

• Governed by standards to ensure compatibility

Dublin Meta-data Core Element Set

• (A.K.A) Dublin Core

• “Born” in Dublin Ohio in 1995 funded by Online Computer Library Center (OCLC) and the National Center for Supercomputing Applications (NCSA)

• The Main standard for meta-data

Dublin Core Elements

• 16 elements make up the Dublin core, used to identify data for easy searches 14 listed below

Subject IdentifierTitle

LanguageAuthor SourcePublisher RelationOther agent RightsDate LeaderObject typeForm

Open Archive Initiative (OAI)

• Harvesting protocol – most used of all the harvesting protocols

• requires Dublin Core

• Runs on XML

Collection and harvesting methods

Periodic

• Fixed interval collection

(e.g. every day)

• Ability to rate relevance when using search engines

Federated

• “on the Fly”

• Smaller amounts of data constantly linked

• Becomes less effective with larger stores of data

Issues with federated harvesting

• 1985

USGS - United States Geological Survey

FGDC - Federal Geographic Data Community• Attempted to make a large federated meta-

data that became much slower as the amount of data grew. Resulting in less efficient searches

Current trends

• Dublin Core more in use in Europe than the United States

• Periodic harvesting is preferred

• Among the first industries to use meta-data and digital libraries are publishers, and other industries that work closely with libraries

Fedora

The problem

• Institutions and organizations face increasing demands to deliver rich digital content

• Delivery is only one aspect of a suite of content management tasks. Content needs to be created, ingested, and stored

• Without standardization, the costs of management tasks become prohibitive

• Content managers need a flexible content repository system that allows them to

uniformly store, manage, and deliver all their existing content and that will accommodate

new forms that will arise in the future.

Costs of not finding information

• IDC studies, found that knowledge workers spend from 15% to 35% of their time searching for information

• 40% of corporate users reported that they can not find the information they need to do their jobs on their intranets

• 90% of the time that knowledge workers spend is spent in recreating information that already exists

• IDC, estimates $5,000 cost per worker per year

• For a 1,000 employee company, the cost of reworking is $12 million a year (15% of time spent in duplicating existing information)

• First digital object repository management system based on the Flexible Extensible Digital Object and Repository Architecture (Fedora).

• Fedora is an open source software.

• Different to the Red Hat Fedora Project

• Powerful digital object model that supports multiple views of each digital object and the relationships among digital objects

• Digital objects can encapsulate locally-managed content or make reference to remote content.

• Dynamic views are possible • Digital objects exist within a repository architecture

that supports a variety of management functions. • All functions of Fedora are exposed as web services

• Fedora an attractive solution in a variety of domains

• Applications: library collections management, multimedia authoring systems, archival repositories, institutional repositories, and digital libraries for education

National Science Digital Library (NSDL)

• Created by the National Science Foundation in 2000.

• 192 projects awarded since start.

• ~50 directly funded by NSDL.

• 400 unique collections.

• Goal of being a free service which directs users to exemplary resources for education.

NSDL

• Provides an organized point of access to STEM content (Science, Technology, Engineering, Mathematics).

• Supports Dublin Core standard.

• Most commonly, XML is used to store and encode documents for NSDL collections.

• Access to most resources discovered through the NSDL is free.

• NSDL is a collection of other digital library collections.– In essence, the NSDL is more of a digital

card-catalog spanning many different collections.

• Common selection mechanisms include: – Peer review boards– Content creation committees– User recommendations

• Advantage over other search methods in that it has a selection process and guaranteed quality results.

DSpace

• Developed by HP and MIT in 2002.• Open source digital repository system.

– Provides digital archiving.

• Available under the BSD open source license.• Free to download and use.• Mission to: “Collect, manage, preserve and

redistribute digital content.”

• Can store many different file types including: Books, Theses, Audio/Video and programs.

• Currently in use at many institutions world-wide including: Cornell, OSU, Vanderbilt and many others.

• DSpace software is maintained by the DSpace Federation– No formal membership structure– Adoption is growing

• DSpace focuses on making content accessible and preserving it over time.

DSpace Data Model

top related