nal-institutional repository: a case study csir metadata harvester i.r.n. goudar head, icast, nal...

34
NAL-Institutional Repository: A Case Study CSIR Metadata Harvester I.R.N. Goudar Head, ICAST, NAL [email protected] National Symposium on Open Access and Building Institutional Repository National Aerospace Laboratories Bangalore- 560 017 21-23 Jan 2009

Upload: lora-stevenson

Post on 23-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

NAL-Institutional Repository: A Case StudyCSIR Metadata Harvester

I.R.N. GoudarHead, ICAST, [email protected]

National Symposium on Open Access and Building Institutional Repository

National Aerospace LaboratoriesBangalore- 560 017

21-23 Jan 2009

NAL-IR

• Started in 2003 using GSDL

• Adopted E-Prints in 2005

• Plans to Switch over to DSpace

• Presently about 3000 Documents

IR Download Statistics

6000-10000/PM from more than 120 Countries• USA 40%• India 25%• UK 10%• Canada 6%• Japan 5%• China 3%• Germany 3%

• France 3%

Metadata Harvesting

• Harvesting – in the OAI context, harvesting refers

specifically to the gathering together of metadata from a number of distributed repositories into a combined data store

• OAI-PMH (OAI Protocol for Metadata Harvesting) – OAI-PMH is a harvesting protocol for

sharing metadata between services.

• Data Provider (Ex. Institutional repository)

•Maintain repository

•Expose metadata according to a metadata standard (e.g. DC)

•Register with OAI

•Service provider

•Register with OAI

•Extract metadata from registered repositories (‘harvest’)

•Provide services (e.g. central index)

Interoperability through OAI-PMH Protocol*

* http://www.openarchives.org/

IR-1 IR-2

Harvesting Software

• To harvest metadata from the OAI-compliant repositories (data providers), a harvesting software is needed– PKP Harvester from SFU

• http://pkp.sfu.ca/harvester_download

– Arc from ODU• http://oaiarc.sourceforge.net/

CSIR Knowledge Harvester

• Set up at ICAST, NAL

• PKP Harvester

• Presently Covers 4 CSIR Labs

• About 5500 documents

Harvesting CSIR IRs

Tech Reports Pre-prints Journal Articles

Access & Dissemination

NAL NCL NIO NPL SERC Etc

Deposit

Metadata +Full Pub)

Service Provider

ICAST, NAL

Presentation Thesis, etc

Digital Repository

Local Intranetaccess

Remote Internet access

MetadataOAI-PMH

Thank you

?

EPrints and DSpace Widely used IR software

Platform

– EPrints: Unix/ Linux/ Perl/ Apache/ MySQL/

XML/ HTML/

– DSpace: Unix/ Linux/ Java/ Tomcat or

Apache/ XML/ HTML/ Ant/ PostGreSQL

Imply software knowledge required for installing, configuring, and maintaining archives developed using these packages.

OAI-PMH: Structure Model

Se

rvic

e P

rovi

der

e-print

Da

ta

Pro

vid

er e-prints

e-print

Da

ta

Pro

vid

er Images

e-print

Da

ta

Pro

vid

er

OPAC

e-print

Da

ta

Pro

vid

er Museum

e-print

Da

ta

Pro

vid

er Archive

Requests:

Identify

ListMetadataformats

ListSets

ListIdentifiers

ListRecords

GetRecord

Responses:

General information

Metadata formats

Set structure

Record identifier

Metadata

Da

ta

Pro

vid

er Harvester

Repository

Repository

Repository

Repository

Repository

Some Useful References

• http://www.openarchives.org/• To register as data provider

– http://www.openarchives.org/pmh/

• For OAI-related tools– http://www.openarchives.org/pmh/tools/

• OAI Repository Explorer for interactive exploration and validation of OAI repositories– http://re.cs.uct.ac.za/