National Workshop on
Institutional Digital Repository
Shibabroto BanerjeeDepartment of Computer Science and EngineeringIIT Kharagpur
National Workshop on Institutional Digital Repository
Create Users and Groups in DspaceAuthentication MethodsAccess AuthorizationUsage Statistics
Community & Collection (Data Model)
Community
Sub-Communi
ty
Collection
Item
Metadata Bitstream License
Collection
Item
Bitstream Metadata Licenses
Creating Community and Collection
•Administrative Log in
•Click Communities & Collections (under Browse)
•Click on Create Community (under Context)
•Give the community a name and add a short
description and introductory text (optional)
•Upload a logo file (optional)
•Finish/Return
Create Users
Dspace user is called E-person
[dspace]/bin/dspace user --add --email <email address of user> -g <Firstname> -s <Surname> --password <Password for the
user>
Select “People” (under Administrative – Access Control). Click “Create a new e-person”.
Add the new person’s name and contact information. Click
“Create E-person”
Command
Prompt
XMLUI / JSPUI
Interface
Go
Create GroupsChoose “Groups” (under Administration – Access Controls)
Select “Create a new Group”
Give the group a name
Use “Search members” to add e-people the new group
Click “Add” next to the e-people you want to add to the group
Click “Save”
Groups and Policies
Administrator Can do anything in the site.
Anonymous
The list that contain all users.
Collection• ADD/REMOVE• DEFAULT_ITEM_READ• DEFAULT_BITSTREAM_READ• COLLECTION_ADMIN
Item• ADD/REMOVE• READ• WRITE
Bundle• ADD/REMOVE• READ• WRITE
Bitstream• READ• WRITE
Stackable Authentication MethodsConfiguration File: [dspace]/config/modules/authentication.cfg
• Use of inbuilt e-mail address/password-based log-in
• Users can register themselves, set their own passwords
• Users are not members of any special (dynamic) e-person groups
Password
X.509 Certific
ate
IP Address based
Shibboleth
LDAP
• Distributed authentication system for securely authenticating users
• No one will be able to access that URL without first authenticating : Active Session.
• Public access to content, while restricted access to only particular areas: Lazy Session
• IP Based Authentication• Allow a particular set of IP Address• E.g allow all the requests from Class B
Network except Class C
• New users will be able to register by entering their username and password without being sent the registration token
• Users not having a username and password, can still register and login with just their email address
• This method uses an X.509 certificate sent by the client to establish his/her identity.
• Clients should have a personal Web certificate installed on their browser which is issued by a Certifying Authority (CA) recognized by the web server
Access Authorization E-persons & Groups
Subscribe to collections in Dspace and receive a daily email containing the new
items in the collections.
Create, modify and delete Community and Collection.
Submit new items to collection.
Withdraw and Expunge items from the collection.
Withdraw Vs Expunge
WITHDRAW
Removes item
from view
Does not show up
in search results
Recoverable
EXPUNGE
Unrecoverable
Handle is not
reused
Can only be done
by a repository
administrator
Usage Statistics
• Total page visits (all time)• Total Visits per Month• File Downloads (all time)• Top Country Views (all time)• Top City Views (all time)
Item
• Total page visits (all time)• Total Visits per Month• File Downloads (all time)• Top Country Views (all time)• Top City Views (all time)
Collection
• Total page visits (all time)• Total Visits per Month• File Downloads (all time)• Top Country Views (all time)• Top City Views (all time)
Community
Statistical Information
1)A customizable general overview of activities in the archive, by default includinga. Number of items archivedb. Number of bitstream viewsc. Number of item page viewsd. Number of collection page viewse. Number of community page viewsf. Number of user loginsg. Number of searches performedh. Number of license rejections
2)Number of OAI Requests3)Customizable summary of archive contents4)Broken-down list of item viewings5)A full break-down of all performed actions6)User logins7)Most popular searches8)Log Level Information9)Processing information
Thank you
National Workshop on Institutional Digital Repository
Item Submit and WorkflowIndexingRetrieval
Item Submission Workflow
• If not already selected, the user must select a collection to deposit the Item into.
Select Collection
• This is where the user may enter descriptive metadata about the Item. This step may consist of one or more pages of metadata entry. Describe
• This is where the user may upload one or more files to associate with the Item. Upload
• This is where the user may review all previous information entered, and correct anything as needed.Review
• This is where the user must agree to the repository distribution license in order to complete the deposit. License
• The deposit is now completed. The Item will either become immediately available or undergo a workflow approval process.
Complete
Item Submission Workflow (GUI)
1 2AdminLogin
Initial Questions -> Describe -> Upload -> Verify -> License -> Complete
45
1 2 3 4 5
3
Indexing
Faceted/Filtered Search & Browse (via Solr & DSpace Discovery) - enabled by default since DSpace 4.0
DSpace can apply filters or transformations to files/bitstreams, creating new content. Filters that extract text for full-text searching, and create thumbnails for items that contain images.
Option I
Option II
Creating Indexes
Create the DB search/browse tables
[dspace]/bin/dspace index-db-browse
[dspace]/bin/dspace index-lucene-init(Complete Index Regeneration)[dspace]/bin/dspace index-lucene-update(Update Luncene index)
Configure search and browse to use PostgreSQL
Alter dspace.cfg to set the browseDAO to postgres
Alter dspace.cfg to have ItemCount use Postgres
Creating Indexes
Filter Media• Available Media Filters• HTML Text, JPEG Thumbnail, PDF text, XPDF ,
Word, Powerpoint, ImageMagick
Filter Media from Command Prompt
• [dspace]/bin/dspace filter-media
Indexing Strategies & Customization
Browse Index Customization• Add a Series Browse• You want to add a new browse using a
previously unused metadata element• Combine more than one metadata field into a browse• You may have other title fields used in your
repository. You may only want one or two of them added, not all title fields. And/or you may want your series to file in there.
• Separate subject browse• You may want to have a separate subject
browse limited to only one type of subject.
Indexing Strategies & Customization
Search Index Customization in [dspace]/config/dspace.cfg• Where to put the search index files• ${dspace.dir}/search
• Implement Search Analyzer• Standard is DSAnalyzer
• Search Operator• AND or OR
• List of metadata fields to index• search.index.1 = author:dc.contributor.*
Retrieval
Discovery• Discovery Module enables faceted searching &
browsing for your repository
Sidebar Facet• Breaks up search results into multiple categories and
counts for each• Allows the user to restrict their search results based
on those facets
Search Filter• User can modify the list of displayed search results by
specifying additional filters that will be applied on the list of search results
Retrieval
Enabling Discovery
Browse with Discovery
• [dspace]/bin/dspace index-db-browse -f –d
Configuration Files
• [dspace]/config/modules/discovery.cfg -> General Settings
• [dspace-install-dir]/config/spring/api/discovery.xml
Minimum Hardware Configuration
Minimum Required:
A Desktop Computer with the following configuration:
1. Intel Dual Core Processor2. RAM: 2 GB3. Hard Disk capacity – 320 GB4. Network – 10/100 Mbps
network card 5. WAN – Registered domain
name and internet connectivity
Preferred:
A Desktop Computer with the following configuration:
1. Intel Quad Core Processor2. RAM at least 8 GB3. Hard Disk capacity – 1 TB 4. Network – 10/100/1000 Mbps
network card 5. WAN – Registered domain
name and internet connectivity