shibabroto banerjee department of computer science and engineering iit kharagpur

24
National Workshop on Institutional Digital Repository Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Upload: leon-craig

Post on 22-Dec-2015

220 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

National Workshop on

Institutional Digital Repository

Shibabroto BanerjeeDepartment of Computer Science and EngineeringIIT Kharagpur

Page 2: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

National Workshop on Institutional Digital Repository

Create Users and Groups in DspaceAuthentication MethodsAccess AuthorizationUsage Statistics

Page 3: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Community & Collection (Data Model)

Community

Sub-Communi

ty

Collection

Item

Metadata Bitstream License

Collection

Item

Bitstream Metadata Licenses

Page 4: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Creating Community and Collection

•Administrative Log in

•Click Communities & Collections (under Browse)

•Click on Create Community (under Context)

•Give the community a name and add a short

description and introductory text (optional)

•Upload a logo file (optional)

•Finish/Return

Page 5: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Create Users

Dspace user is called E-person

[dspace]/bin/dspace user --add --email <email address of user> -g <Firstname> -s <Surname> --password <Password for the

user>

Select “People” (under Administrative – Access Control). Click “Create a new e-person”.

Add the new person’s name and contact information. Click

“Create E-person”

Command

Prompt

XMLUI / JSPUI

Interface

Go

Page 6: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Create GroupsChoose “Groups” (under Administration – Access Controls)

Select “Create a new Group”

Give the group a name

Use “Search members” to add e-people the new group

Click “Add” next to the e-people you want to add to the group

Click “Save”

Page 7: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Groups and Policies

Administrator Can do anything in the site.

Anonymous

The list that contain all users.

Collection• ADD/REMOVE• DEFAULT_ITEM_READ• DEFAULT_BITSTREAM_READ• COLLECTION_ADMIN

Item• ADD/REMOVE• READ• WRITE

Bundle• ADD/REMOVE• READ• WRITE

Bitstream• READ• WRITE

Page 8: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Stackable Authentication MethodsConfiguration File: [dspace]/config/modules/authentication.cfg

• Use of inbuilt e-mail address/password-based log-in

• Users can register themselves, set their own passwords

• Users are not members of any special (dynamic) e-person groups

Password

X.509 Certific

ate

IP Address based

Shibboleth

LDAP

• Distributed authentication system for securely authenticating users

• No one will be able to access that URL without first authenticating : Active Session.

• Public access to content, while restricted access to only particular areas: Lazy Session

• IP Based Authentication• Allow a particular set of IP Address• E.g allow all the requests from Class B

Network except Class C

• New users will be able to register by entering their username and password without being sent the registration token

• Users not having a username and password, can still register and login with just their email address

• This method uses an X.509 certificate sent by the client to establish his/her identity.

• Clients should have a personal Web certificate installed on their browser which is issued by a Certifying Authority (CA) recognized by the web server

Page 9: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Access Authorization E-persons & Groups

Subscribe to collections in Dspace and receive a daily email containing the new

items in the collections.

Create, modify and delete Community and Collection.

Submit new items to collection.

Withdraw and Expunge items from the collection.

Page 10: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Withdraw Vs Expunge

WITHDRAW

Removes item

from view

Does not show up

in search results

Recoverable

EXPUNGE

Unrecoverable

Handle is not

reused

Can only be done

by a repository

administrator

Page 11: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Usage Statistics

• Total page visits (all time)• Total Visits per Month• File Downloads (all time)• Top Country Views (all time)• Top City Views (all time)

Item

• Total page visits (all time)• Total Visits per Month• File Downloads (all time)• Top Country Views (all time)• Top City Views (all time)

Collection

• Total page visits (all time)• Total Visits per Month• File Downloads (all time)• Top Country Views (all time)• Top City Views (all time)

Community

Page 12: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Statistical Information

1)A customizable general overview of activities in the archive, by default includinga. Number of items archivedb. Number of bitstream viewsc. Number of item page viewsd. Number of collection page viewse. Number of community page viewsf. Number of user loginsg. Number of searches performedh. Number of license rejections

2)Number of OAI Requests3)Customizable summary of archive contents4)Broken-down list of item viewings5)A full break-down of all performed actions6)User logins7)Most popular searches8)Log Level Information9)Processing information

Page 13: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Thank you

Page 14: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

National Workshop on Institutional Digital Repository

Item Submit and WorkflowIndexingRetrieval

Page 15: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Item Submission Workflow

• If not already selected, the user must select a collection to deposit the Item into.

Select Collection

• This is where the user may enter descriptive metadata about the Item. This step may consist of one or more pages of metadata entry. Describe

• This is where the user may upload one or more files to associate with the Item. Upload

• This is where the user may review all previous information entered, and correct anything as needed.Review

• This is where the user must agree to the repository distribution license in order to complete the deposit. License

• The deposit is now completed. The Item will either become immediately available or undergo a workflow approval process.

Complete

Page 16: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Item Submission Workflow (GUI)

1 2AdminLogin

Initial Questions -> Describe -> Upload -> Verify -> License -> Complete

45

1 2 3 4 5

3

Page 17: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Indexing

Faceted/Filtered Search & Browse (via Solr & DSpace Discovery) - enabled by default since DSpace 4.0

DSpace can apply filters or transformations to files/bitstreams, creating new content. Filters that extract text for full-text searching, and create thumbnails for items that contain images.

Option I

Option II

Page 18: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Creating Indexes

Create the DB search/browse tables

[dspace]/bin/dspace index-db-browse

[dspace]/bin/dspace index-lucene-init(Complete Index Regeneration)[dspace]/bin/dspace index-lucene-update(Update Luncene index)

Configure search and browse to use PostgreSQL

Alter dspace.cfg to set the browseDAO to postgres

Alter dspace.cfg to have ItemCount use Postgres

Page 19: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Creating Indexes

Filter Media• Available Media Filters• HTML Text, JPEG Thumbnail, PDF text, XPDF ,

Word, Powerpoint, ImageMagick

Filter Media from Command Prompt

• [dspace]/bin/dspace filter-media

Page 20: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Indexing Strategies & Customization

Browse Index Customization• Add a Series Browse• You want to add a new browse using a

previously unused metadata element• Combine more than one metadata field into a browse• You may have other title fields used in your

repository. You may only want one or two of them added, not all title fields. And/or you may want your series to file in there.

• Separate subject browse• You may want to have a separate subject

browse limited to only one type of subject.

Page 21: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Indexing Strategies & Customization

Search Index Customization in [dspace]/config/dspace.cfg• Where to put the search index files• ${dspace.dir}/search

• Implement Search Analyzer• Standard is DSAnalyzer

• Search Operator• AND or OR

• List of metadata fields to index• search.index.1 = author:dc.contributor.*

Page 22: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Retrieval

Discovery• Discovery Module enables faceted searching &

browsing for your repository

Sidebar Facet• Breaks up search results into multiple categories and

counts for each• Allows the user to restrict their search results based

on those facets

Search Filter• User can modify the list of displayed search results by

specifying additional filters that will be applied on the list of search results

Page 23: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Retrieval

Enabling Discovery

Browse with Discovery

• [dspace]/bin/dspace index-db-browse -f –d

Configuration Files

• [dspace]/config/modules/discovery.cfg -> General Settings

• [dspace-install-dir]/config/spring/api/discovery.xml

Page 24: Shibabroto Banerjee Department of Computer Science and Engineering IIT Kharagpur

Minimum Hardware Configuration

Minimum Required:

A Desktop Computer with the following configuration:

1. Intel Dual Core Processor2. RAM: 2 GB3. Hard Disk capacity – 320 GB4. Network – 10/100 Mbps

network card 5. WAN – Registered domain

name and internet connectivity

Preferred:

A Desktop Computer with the following configuration:

1. Intel Quad Core Processor2. RAM at least 8 GB3. Hard Disk capacity – 1 TB 4. Network – 10/100/1000 Mbps

network card 5. WAN – Registered domain

name and internet connectivity