an arizona model for capturing and describing documents on the web richard pearce-moses director of...
TRANSCRIPT
![Page 1: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/1.jpg)
An Arizona Model for Capturing and Describing Documents on
the Web
Richard Pearce-Moses
Director of Digital Government Information
Arizona State Library, Archives and Public Records
rpm at lib.az.us
![Page 2: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/2.jpg)
What Does WWW Stand For?
They both abbreviate to WWW
Rugged Individualism
Lack of standards ~ Lawlessness
[Collage of Robert Conrad as James West in the Wild, Wild West removed to
avoid violation of copyright.]
![Page 3: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/3.jpg)
The Dream
To collect, manage, preserve, and make useful the
enormous amount of digital information
our culture is now producing
![Page 4: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/4.jpg)
The Reality
Two Approaches
Bibliocentric (Item-by-Item)
Tech-centric (Capture-It-All)
Emphasis on Software Tools and Technology
Limited Assistance from Content Providers
![Page 5: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/5.jpg)
Library of Congress & NDIIPP
University of Illinois at
Urbana-ChampaignSchool of Library • Information Science
OCLC
Content ProvidersTufts University Perseus Project • Michigan State University Library • State libraries: Arizona Connecticut, Illinois, North
Carolina, Wisconsin • UIUC partners: NCSA • WILL-AM/FM/TV • Information Management Services
![Page 6: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/6.jpg)
Digital Archives
LibrariesArtificial collections • Item Level Control
ArchivesProvenance • Original Order • Hierarchy • Aggregate Control
![Page 7: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/7.jpg)
Websites as Archival Collections
Documents of Common Provenance
Organized into Directories (Archival Series)
Publications v. Records
![Page 8: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/8.jpg)
The Art and Craft of Building a Collection
What we do remains the same
How we do it will change
※
Identification/Selection
Acquisition
Description
Reference
Preservation
![Page 9: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/9.jpg)
Identification — Where Do We Look?
Finding the Forest az.gov • state.az.us
※
Domain ToolIdentifies all distinct domains Reports
new sites since previous spider
Reports when sites disappear
![Page 10: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/10.jpg)
Selection: Which Collections Do We Harvest?
Collection-Level Analysis
Macro appraisal sets priorities
Materials appraised as series
Content Providers Taxonomy Tool
Names • Administrative history
Relationships • Subjects • Functions
![Page 11: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/11.jpg)
Selection: Which Documents Do We Harvest?
Identify Series Aggregate selection Set frequency of harvests
Site Analysis Tool Display structure Harmonize physical, intellectual structure Identify inaccessible content Show what’s new Show significant changes
![Page 12: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/12.jpg)
Description
To be able to locate documents• when the creator or provenance is known• when the subject is known• and to aid in selection as to character
Series Description• Make directory name a meaningful title• Scope and contents note• High-level subject headings• Recorded in site analysis tool database
Document Description• Creator: taxonomy, internal metadata• Title: from internal metadata, noun
phrases• Subject: from series metadata, internal
metadata
![Page 13: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/13.jpg)
Access
Finding Aids A valuable bird’s-eye view for archivists Of limited value to patrons . . . Unless they’re transformed into topic maps
Full Text Search Engines Ranking Algorithms Categorization / Packaging Results Based on series-level metadata Based on autoclassification
![Page 14: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/14.jpg)
Description and AccessSeries-Level Description
name=“Creator” Governor’s Drought Task Force Rural Watershed Alliance
name=“Subject” reservoirs ground water
name=“Subject” drought water conservation
name=“Subject” potable water agriculture
name=“Type” planning reports
Categorized ResultsYour search for water, Phoenix Found documents in the following categories water (500+) water conservation (357) Salt River Project (210) drought (110) flood control (98) xeriscape (25)
Found documents from the following agencies Water Resources (135) Governor's Drought Task Force (102) Phoenix (87) Maricopa County (84) Corporation Commission (35)
![Page 15: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/15.jpg)
Administration / Curation / Stewardship
SystematicRegular Workflows
Not idiosyncratic
CollaborativeConsensual , Not Idiosyncratic
Avoid Redundant Efforts
Quality ControlNeed for Good Metrics
Need for Regular Audits
![Page 16: An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,](https://reader030.vdocuments.us/reader030/viewer/2022032722/56649ce45503460f949b0145/html5/thumbnails/16.jpg)
Stay Tuned . . . .