kathryn lybarger fourth friday february 22, 2013

30
Ebook cataloging: trouble, even in batch Kathryn Lybarger Fourth Friday February 22, 2013

Upload: catalina-claybrook

Post on 01-Apr-2015

219 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Kathryn Lybarger Fourth Friday February 22, 2013

Ebook cataloging: trouble, even in batch

Kathryn LybargerFourth FridayFebruary 22, 2013

Page 2: Kathryn Lybarger Fourth Friday February 22, 2013

MARC

A data format used to encode and share bibliographic data

Developed in the 1960’s, still quite popular

Page 3: Kathryn Lybarger Fourth Friday February 22, 2013

Cataloging

Catalog

Library of Congress

OCLCor

SkyRiver

Original Cataloging

Page 4: Kathryn Lybarger Fourth Friday February 22, 2013

Vendors often provide MARC records

Page 5: Kathryn Lybarger Fourth Friday February 22, 2013

Batch loading

Vendor MARC Catalog

Page 6: Kathryn Lybarger Fourth Friday February 22, 2013

All done?

Page 7: Kathryn Lybarger Fourth Friday February 22, 2013

Not quite…

Page 8: Kathryn Lybarger Fourth Friday February 22, 2013

Records may be icky…

Title: CESMM3 price database 2009, edited by Franklin + Andrews

100 1_ Franklin.245 10 CESMM3 price database 2009 ‡h [electronic resource] / ‡c edited by Franklin and Andrews.500 __ Ebook.516 __ Document.538 __ PDF: Adobe PDF700 1_ Andrews.856 40 …

Page 9: Kathryn Lybarger Fourth Friday February 22, 2013

…but worse, non-functional!

Data may be unhelpful, or misleading

Links may not work

This may change over time

Page 10: Kathryn Lybarger Fourth Friday February 22, 2013

A crazy mixed-up record(with 112 holdings)

From one book: Title Author Series Subject headings

From another book: Notes ISBN Link to e-book

Page 11: Kathryn Lybarger Fourth Friday February 22, 2013

URLs from other vendors

Provider-neutral records may have URLs from multiple vendors

An OCLC search for records with URLs from eblib, ebrary, ebscohost AND

myilibrary returned over 43,000.

Even if they are labeled, patrons don’t know which vendor we’re using

Page 12: Kathryn Lybarger Fourth Friday February 22, 2013

URLs that point nowhere

Page 13: Kathryn Lybarger Fourth Friday February 22, 2013

URLs that point somewhere new!

Page 14: Kathryn Lybarger Fourth Friday February 22, 2013

DOI troubles

Page 15: Kathryn Lybarger Fourth Friday February 22, 2013

Books may not be available yet(or ever)

Page 16: Kathryn Lybarger Fourth Friday February 22, 2013

“Slippage”

Some ebooks on a frontlist may never appear on the site

Individual ebooks may just disappear

Page 17: Kathryn Lybarger Fourth Friday February 22, 2013

Lists may be available…

But not forthcoming.

You may have to periodically dig several levels deep on the website to get them:

Page 18: Kathryn Lybarger Fourth Friday February 22, 2013

Or they may just not mention it…

After we confronted one vendor:

“Of the 15 accounts that I spot checked the most usage for (TITLE) was 5 views. However, I see that the 2 titles listed below are in the top 5 most viewed for UKY in 2012.”

Page 19: Kathryn Lybarger Fourth Friday February 22, 2013

Platform change

Page 20: Kathryn Lybarger Fourth Friday February 22, 2013

Solutions?

Start with the best records you can find

Edit MARC records to conform with local standards

Verify access to all titles (periodically)

Report problems when you find them

Page 21: Kathryn Lybarger Fourth Friday February 22, 2013

Vendors may do some editing

But how do you predict what you will need?

Page 22: Kathryn Lybarger Fourth Friday February 22, 2013

MarcEdit

Developed by Terry Reese

MARC editing in a friendly yet powerful text editor

Z39.50 client

(Binary editor!)

Page 23: Kathryn Lybarger Fourth Friday February 22, 2013

Automation

Efficiency, Consistency

.mrk format is text, so you can process with your favorite programming language

Don’t have a favorite language (yet)?

Page 24: Kathryn Lybarger Fourth Friday February 22, 2013

Codecademy #libcodeyear

Page 25: Kathryn Lybarger Fourth Friday February 22, 2013

My automation: Normac

NORMalization and Access Checking

Open source software, releasing soon

Configure per vendor: Fields to add/delete Changes to make How to check links

Page 26: Kathryn Lybarger Fourth Friday February 22, 2013

Generic link checkers may not be effective

Ebook errors can be valid web pages, and errors don’t mean you should give up!

HTTP/1.1 200 OK Full text ebook Web site form to buy the book

HTTP/1.1 404 Not Found No such page on server Broken DOI (that you should report)

Page 27: Kathryn Lybarger Fourth Friday February 22, 2013

Effective link checking (my method)

Database holds a list of links to be checked

Script checks each according to site profile (pausing 10 seconds between each link): Is it a PDF? Does it contain the phrase “This is not

part of your subscription”? Can you click through to fulltext

chapters?

Page 28: Kathryn Lybarger Fourth Friday February 22, 2013

If you see broken links…

Let me know!

I have to know what broken ebooks look like (from a given vendor) before I can detect them

If a vendor has many brokenones I can do a systematic

check

Page 29: Kathryn Lybarger Fourth Friday February 22, 2013

Any questions?

Page 30: Kathryn Lybarger Fourth Friday February 22, 2013

Links

MarcEdit http://people.oregonstate.edu/~reeset/marcedit/html/index.php

Code Academyhttp://www.codeacademy.com

My GitHub repositoryhttps://github.com/zemkat/