primo at ticer 2009 - afternoon session
DESCRIPTION
Primo at Ticer 2009 - afternoon session. By Jørgen Madsen from The Royal Library of DenmarkTRANSCRIPT
Integrated search – Primo Part IIIntegrated search – Primo Part II
Jørgen Madsen, The Royal Library of DenmarkJørgen Madsen, The Royal Library of Denmark
Tilburg, July 31st 2009Tilburg, July 31st 2009
Presentation - overviewPresentation - overview
The Royal Library explained
The complexity: different roles and different needs
Integrated? Not quite. The pain of articles
Clean-up. Deduplication and its blessings
Future development – the road ahead
The Royal Library of DenmarkThe Royal Library of Denmark
National Library of Denmark
University Library for Copenhagen University (30.000 FTE’s)
Principal collections of about 6M records
ALEPH host for ~100 other research libraries
The Royal Library of DenmarkThe Royal Library of Denmark
Main catalog 5M records
9 separate ALEPH databases
Numerous additional non-catalog databases
Close cooperation with the 8 University faculties and their many institute libraries
The complexityThe complexity
Why are we so complex?
Legal deposit library with strict reading room access (national library)
Lending library (university library)
Digital access to special collections for everyone (national library)
Highly restricted access to licensed material (e-books, e-journals, and databases)
Full integration of faculty and institute libraries – some of which are open lending libraries, some of which are closed to the point of being almost secret libraries
The state of affairsThe state of affairs
Our information universe was, in other words:
Fragmented
Complex
Quite user unfriendly
The state of affairsThe state of affairs
What we want to do is:
Bring everything together in a single interface (including licensed material – articles)
Make it easier to navigate our complex collections
Get our licensed material exposed and made easily accessible
Integrated? It’s complicated…Integrated? It’s complicated…
Local catalog and remote articles:
Our own data is harvested and thus integrated (including e-books, e-jornals, and databases)
Articles are searched remote in the same interface
We can only integrate data that we can get
The article problem is handled via DADS and Deep Search
DADSDADS
DADS: DTV Article Database Service
Data bank of harvested and normalized article (meta)data
Hosted by The Technical Information Centre of Denmark (DTIC)
Contains 100M+ records
Currently a project exists to create a national databank containing everything relevant to all Danish research libraries
Databases: ABI-inform
Biosis Previews
Compendex
Ebsco Academic Search Elite
Ebsco Business Source Premier
FSTA
InspecP
Periodicals Index Online – Proquest
Wilson Art and Humanites
------------------------------------------------
Data aggregators:Swets (+20,000 journals)
DOAJ
Publishers: ACS – American Chemical Society
Blackwell
Cambridge University Press
Elsevier (incl. Acpress)
Emerald
JSTOR
Karger
Nature Publishing Group
Oxford University Press
Science Magazine
Springer (incl. Kluwer)
Swetscan
Sage
…and a couple more…
Deep Search (3rd node) Deep Search (3rd node)
Deep Search is Primo’s way of searching and presenting a remote pool of data
It is configurable and can be set up for any SRU enabled target
Delivery is handled via SFX in our case
Deep Search results can be blended with local Primo results to create one result set
So how does all this work?
IntegratedIntegrated and federated and federated
Integrated and Integrated and federatedfederated
Integrated and federated – access via SFXIntegrated and federated – access via SFX
Integrated and federated – fulltext articleIntegrated and federated – fulltext article
Clean-up - deduplicationClean-up - deduplication
Deduplication
Defined as part of normalization (pipes)
Fully configurable
Merges records and places holdings on one record
Clean-up - deduplicationClean-up - deduplication
DeduplicationDeduplication
Deduplication – under the hood:
Vectors are created in the Back Office:
DeduplicationDeduplication
Deduplication – under the hood:
The vectors created are used to identify matching records:
DeduplicationDeduplication
Deduplication – under the hood:
These matched records have their holdings merged:
Further clean-up - FRBRFurther clean-up - FRBR
FRBR:
FRBR works just like deduplication
The Royal Library has currently disabled FRBR
Version 3 – release end of 2009Version 3 – release end of 2009
Some of the highlights
ILS functionality fully integrated in frontend
WARC pipe - web harvesting
Primo Central – central article and e-book index. Currently:
EBSCOAlexander Street PressIOP PublishingProject MUSEPNASAmerican Institute of PhysicsSPIESIAMOECD
A final wordA final word
If you opt for integrated search, here are a few thoughts:
Re-think your data entirely
Re-think selection – what does it actually mean to ”own” things these days (case in point: Dawsonera)?
It is NOT a catalog! It is ALSO a catalog!