nate trail network development & marc standards office 8/1/2006 with help from sydney olive how...

Post on 03-Jan-2016

216 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Nate TrailNetwork Development & MARC Standards Office8/1/2006

With help from Sydney Olive

How to Build, Display and Find METS Objects

 

•A little about the technology

•How we use it

•Searching and Lucene

•Cocoon Tools

•Future Directions

•Conclusions

Outline

Technology Used• XML documents (METS, MODS, etc.)

• METS Object

• XSL processes • Transformation or display

• Cocoon framework (and a little Java)

• User input via URL matches

Cocoon Concepts (1)

–<map:match pattern="navigation/*">

<map:generate src="cocoon:/{1}/mets.xml"/>

<map:transform src="navigation.xsl"/>

<map:serialize type="xml"/>–</map:match>

• Combine small matches to create

a pipeline

• Separate Data from Action and Design

Cocoon Concepts (2)

• Cocoon for Display o Navigation, Description,

Pageturning

o xml data for display

o add header, footer,

breadcrumbs

o VHP presentations

How we use it (3)

Searching and Lucene

• We have XML objects, so we need an XML based search system.

• Cocoon comes with Lucene.

• Lucene indexes are easy to configure and modify: METS In x t mde ed i e

Searching

• Search page – limit search to certain object types,

– search certain fields only, or

– search all (default)

• Canned search as collection presentation tool (LC Publications, baseball)

• New browsing capability

Cocoon Tools (1)

• METS Profile Validation (fs01)

• SQL : query any Oracle, MySQL tables from the browser to get back XML

Cocoon Tools (2)

• Site administration, enhancement of processes

– Get IDs for some set of the database for extraction, cleanup

– Convert from MARC to MODS (input) or MODS to MARC: (extract)

– Convert HTML to XML

– Detect deletions of objects

– Link checking

– Search for MARC records using SRU

Future Directions

• New Behaviors, Profiles (Article OCR) (Multivolume Monograph / rarebook source)

• RSS• Integrate profile validation with METS building• Integrate JHOVE file inspection tool (MIX)• Investigate using JPEG2000 file format for image

manipulation enhancement• Submit objects to a repository• Exchange objects with another entity

Conclusions

• What are we learning from building this application?

Conclusions

• The METS standard is flexible enough to describe multiple kinds of complex objects

Conclusions

• METS Profiles help define an object and it’s range of behaviors

Conclusions

• METS and MODS play well together.

Conclusions

• METS files can be built automatically.

Conclusions

• It may look hard, but METS objects can be parsed, validated and used without difficulty

Conclusions

• MODS is extremely powerful as a structural tool, not just a bibliographic tool

• (but good, consistent, authoritative cataloging makes search and display

much more reliable)

Conclusions

• Display is for today only; well structured data lasts.

Conclusions

Without standardized structures, we end up with egg on our face.

Nate Trail

ntra@loc.gov

7-2193Photos © 2006 Nate Trail

Any Questions?

Get back to work!!

Photos © 2006 Nate Trail

top related