deep impact: metadata and suncat
DESCRIPTION
Presented by Natasha Aburrow-Jones at the CILIP Cataloguing and Indexing Group Conference 2014 at Canterbury on 8 September 2014. Poor quality, non-standardised metadata may not lead directly to the end of the world, but it won't help!TRANSCRIPT
![Page 1: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/1.jpg)
DEEPIMPACT
METADATA & SUNCATNatasha Aburrow-Jones
![Page 2: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/2.jpg)
Introduction to SUNCAT
• SUNCAT: the Serials Union Catalogue for the UK
• Project started in 2003; service launched in 2005 – and still going strong!
• 100 Contributing Libraries – National, University, Specialist
![Page 3: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/3.jpg)
How we accept data - carrier
• MARC Communications Format files ftp’d to a secure area on the SUNCAT server (preferred)
• WORD Documents• Excel spreadsheets• Access databases• csv / tab_separated files• Anything (everything) else
![Page 4: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/4.jpg)
How we accept data - content
• AACR2• RDA• Hybrid• Anything (everything) else
![Page 5: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/5.jpg)
Data normalisation
• For all libraries, some standard normalisation, e.g.,
• Change in tag 022 lower case “x” to upper case “X”
• Change 245$h[computer file] to $h[electronic resource]
• Change 6XX$xPeriodicals to $vPeriodicals only when it is the last subfield in the tag
![Page 6: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/6.jpg)
Data normalisation - tailored
• Bib. data and holdings are tailored for each library, e.g.:
• Transfer 930$y to 852$b• Transfer 930$m to 852$3• Transfer 930$1 to 852$h
• If the 022 tag is not in the format of 4 digits dash 4 digits, then reformat
![Page 7: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/7.jpg)
Incoming data
![Page 8: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/8.jpg)
Incoming data (II)
![Page 9: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/9.jpg)
Incoming data (III)
![Page 10: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/10.jpg)
Incoming data (IV)
![Page 11: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/11.jpg)
Normalised data
![Page 12: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/12.jpg)
Impact of (non)-use of data standards
• Lack of consistency across records• Not matching with other records due to
paucity of data / different data used to describe the same item
• Multiple records in the same library catalogued differently
• Data not homogenous even within one library catalogue, let alone the 100 in SUNCAT
![Page 13: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/13.jpg)
Satellite titles
![Page 14: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/14.jpg)
Existing matching algorithm
• Based on that originally used by the California Digital Library
• Adapted by SUNCAT to include extra MARC fields
• Points based• Weighted to have non-matches rather
than mis-matches• Good for standardised materials
![Page 15: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/15.jpg)
New matching algorithm
![Page 16: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/16.jpg)
Conclusions
• It would be much simpler if everyone followed the existing standards, whether that be for content or carrier!
• BUT – that’s not going to happen. • So, we know that we’ll have to keep on
trying to standardise the non-standard.• The joys of cataloguing in a shared
environment!
![Page 17: Deep Impact: Metadata and SUNCAT](https://reader036.vdocuments.us/reader036/viewer/2022062511/54b6a4e24a7959092b8b4630/html5/thumbnails/17.jpg)
Any questions?
L
Logan and Maiya