etana-dl managing complex information applications: an archaeology digital library this research is...
TRANSCRIPT
ETANA-DLManaging complex information applications: An archaeology digital library
This research is funded in part by NSF-ITR grant #IIS-0325579
Edward A. Fox, Virginia Tech
James W. Flanagan, Case Western Reserve University
ASOR Annual Meeting, AtlantaNovember 21,
2003
Acknowledgements
• Karen Borstad, MPP
• Douglas Clark, Walla Walla College
• Joanne Eustis, CWRU
• Weiguo Fan, Virginia Tech
• Nick Fischio, CWRU
• Paul Gherman, Vanderbilt U.
• Marcos Goncalves, Virginia Tech
• Larry Herr, Canadian University College
• Christopher Holland, LRP
• Paul Jacobs, Mississippi State U.
• Douglas Knight, Vanderbilt U.
• Stan LaBianca, Andrews U.
• Ming Luo, Virginia Tech
• David McCreery, Willamette U.
• Unni Ravindranathan, Virginia Tech
• Jack Sasson, Vanderbilt U.
• Rao Shen, Virginia Tech
• Ricardo Torres, U. Campinas, Brazil
• Randall Younker, Andrews U.
Overview
An Archaeology Digital Library (DL)
ETANA-DL Architecture
Digital Libraries, 5S Framework -> Structures
Open Archives Initiative, Open Digital Libraries
Canned Demonstration
Conclusions
Discussion: Archaeology DL Requirements
NSF ITR Funding
IT Research Digital library: Integration of DB,
HCI, HT, IR, LIS, MM, …
Complexity! Variety! Distributed! => 5S Framework + OAI / ODL
Archaeology Research
Multiple sites Multiple kinds of
artifacts Multiple terminologies
General/special services Multiple views Hypothesis testing Rapid publication
Map courtesy: www.enchantedlearning.com
Current ETANA-DL Member Locations
Virginia Tech
Mississippi State University
Vanderbilt University
Canadian University College
Walla Walla College
Andrews University
CWRU
Willamette University
ETANA Website
Lahav Website
Nimrin Website
Umayri/MPP Website
ETANA-DL Website
Overview
An Archaeology Digital Library (DL)
ETANA-DL Architecture
Digital Libraries, 5S Framework -> Structures
Open Archives Initiative, Open Digital Libraries
Canned Demonstration
Conclusions
Discussion: Archaeology DL Requirements
ETANA-DL Architecture
Users Services DataETANA-DL
UnionServices Users
DigKit
DigBase
ETANA Digital Library Core Components - DigKit
DigKit (DK) Tools for collecting and recording
archaeological data in the field Metadata will migrate to DigBase
(DB)
Real-time collaborative archaeology: metadata in DB will be rapidly available to others
ETANA Digital Library Core Components - DigBase
DigBase (DB) Central repository - stores metadata Union catalog - for collections that are in
ETANA-DL Various kinds of digital objects – excavation
records, images, text collections, etc. General services - Search, Browse, Annotate,
Recommend, etc. Archaeology-specific services - artifact
analysis, visualizations, artifact interpretation, workflows, etc.
ETANA-DL Architecture
DigBase and DigKit
Lahav
Nimrin
Umayri
Hisban
Megiddo
Jalul
New Sites
DATABASE
WRAPPERS
ETANA-DLUNION
CATALOG
SearchUSER
INTERFACE
Browse
Recommend
Note
Personalize
Review
Visualizations
ArchaeologySpecific
DigKit DigBase
Work in progress
…
Overview
An Archaeology Digital Library (DL)
ETANA-DL Architecture
Digital Libraries, 5S Framework -> Structures
Open Archives Initiative, Open Digital Libraries
Canned Demonstration
Conclusions
Discussion: Archaeology DL Requirements
ContentTypes
TextDocuments
VideoAudio
GeographicInformation
Software,Programs
BioInformation
Images andGraphics
Articles,Reports,Books
Speech,Music
(Aerial)Photos
ModelsSimulations
GenomeHuman,animal,plant
2D, 3D,VR,CAT
Digital Library Content
Computing (flops) - e.g., VT’s Terascale Computing Facility, 10 teraflops - 3rd fastest in world
Digital Content
Com
mun
icat
ions
(ban
dwid
th, c
onne
ctiv
ity)
Digital Libraries in Computing andCommunications Technology Space
Digital Libraries technologytrajectory: intellectualaccess to globally distributed information
less more
5S Model - Informally
Digital libraries are complex information systems that: help satisfy info needs of users (societies) provide info services (scenarios) organize info in usable ways (structures) present info in usable ways (spaces) communicate info with users (streams)
5S in Archaeology - Structures
Streams
Structures
Spaces
Scenarios
Societies
5S
RegionsExample: Madaba Plains
5S in Archaeology – Structures
(contd.)
REGION *PARTITION *SUB-PARTITION
*LOCUS*CONTAINER*FIND
*SITE
has
subdivided
subdivided
has
hascontains
LahavNimrinUmayri
Lahav:FieldNimrin:QuadUmayri:Field
Lahav:AreaNimrin:Quad
Umayri:Square
BonePotterySeed
Figurine
Lahav:BasketNimrin:BagUmayri:Pail
Below, Above, Co-existing
*Specific-FINDing
HumanMandible
…
planned
Site Partition Sub-partition
Locus Container
Lahav FieldI
AreaA8
LocusA8074
Basket224
Nimrin QuadrantNW
Quadrant Value
N25/W50
Locus96
Bag240
Umayri FieldA
Square7J59
Locus001
Pail12
5S Structural Model Organization
Data Organization in ETANA-DL
Bone Seed Figurine
ETANA-DLObject
Count
Animal
……
Species
Name
……
Description
Dimensions
……
Owner
Subpartition
PartitionLocus
ID Container
Collection
……
Database Representation
QUAD NW/EW Locus Animal Bone
NW N40/W25 178 SHEEP/GOAT METAPODIAL
SW S40/W170 1 HOMO SAPIENS
-
NE N50/E50 - UNIDENTIFIED UNIDENTIFIED
…..
•
•
•
A Sample Bone Record in XML
<etana:OBJECT>
<etana:ID> 1 </etana:ID>
<etana:COLLECTION>Nimrin</etana:COLLECTION>
<etana:OBJECTTYPE>Bone</etana:OBJECTTYPE>
<etana:OWNERID>[email protected]</etana:OWNERID>
<etana:PARTITION> NW </etana:PARTITION>
<etana:SUBPARTITION> N40/W25 </etana:SUBPARTITION>
<etana:LOCUS> 178 </etana:LOCUS>
<etana:CONTAINER> 212 </etana:CONTAINER>
<etana:BONE>
<etana:AGES> IRON II </etana:AGES>
<etana:AGE> 900-800 BC </etana:AGE>
<etana:BONENAME> METAPODIAL </etana:BONENAME>
<etana:ANIMAL> SHEEP/GOAT </etana:ANIMAL>
……
</etana:BONE>
</etana:OBJECT>
Overview
An Archaeology Digital Library (DL)
ETANA-DL Architecture
Digital Libraries, 5S Framework -> Structures
Open Archives Initiative, Open Digital Libraries
Canned Demonstration
Conclusions
Discussion: Archaeology DL Requirements
Open Archives Initiative
OAIwww.openarchives.org
DiscoveryCurrent
AwarenessPreservation
Service Providers
Data Providers
Meta
data
harv
estin
g
The World According to OAI
Some OAI Data Providers
Analytical Sciences Digital Library California Digital Library Repository Caltech Archives Oral Histories Online Carnegie Mellon U Informedia Public Domain Video Archive DSpace at MIT Library of Congress Open Archive Initiative Repository Perseus Digital Library The University of Michigan Library The University of Tennessee Library University of Illinois Library U of Pittsburgh Electronic Thesis and Dissertation Archive Virginia Tech ImageBase
repository
repos i tory
OAI protocol
harves ter
supportdata
harvestingdata
items
selective harvesting - datestamps
repos i tory
harvest withindate range
record
record
Data and Service Providers
Data Providers possess metadata and share it (internally / externally) via well-defined OAI protocols (e.g., database servers)
Service Providers harvest data from Data Providers provide higher-level services to users (e.g., search engines)
Who will fit where in ETANA-DL? Data Provider – YOUR PROJECT Service Provider – ETANA-DL
What then is an Open Archive?
Any WWW-based system accessed through the well-defined interface of Open Archives Protocol for Metadata Harvesting
Also known as OAI-Compliant Repository No implications for:
Physical storage of data Cost of data Metadata and data formats Access control to server
Will my current digital system be affected? NO An Open Archive is built separately without
disturbing the data or the current system
Introduction to ODL(Open Digital Libraries)
Open Digital Libraries Framework for componentized Digital Libraries
Design principles for components Protocols for inter-component communications
Built upon OAI
Traditional Digital Libraries
?1010100101010010101010010101010101010101
Program
1010100101010010101010010101010101010101
Document
1010100101010010101010010101010101010101
Document
1010100101010010101010010101010101010101
Document
1010100101010010101010010101010101010101
Program
1010100101010010101010010101010101010101
Program
1010100101010010101010010101010101010101
Image
1010100101010010101010010101010101010101
Image
1010100101010010101010010101010101010101
Image1010100101010010101010010101010101010101
Video
1010100101010010101010010101010101010101
Video
1010100101010010101010010101010101010101
Video?Monolithic
and/orCustom-built
web-basedapplication
Users Digital Library
Digital Objects
Open Digital Libraries Approach
Users ETANA-DL Sites
1010100101010010101010010101010101010101
1010100101010010101010010101010101010101
Bone
Search Filter
Union
Recent
Browse
US
ER
INT
ER
FA
CE
Filter
1010100101010010101010010101010101010101
1010100101010010101010010101010101010101
Seed
1010100101010010101010010101010101010101
1010100101010010101010010101010101010101
Figurine
1010100101010010101010010101010101010101
1010100101010010101010010101010101010101
Pottery
Basic ODL Model: An application for Archaeology
OAI Data Provider
OAI-PMH
ODL Protocol
User Interface
Nimrin
ETANA-DLUnion Catalog
OAI-PMH
ETANA-DL Search Engine
ODL Service ProviderComponent
WWW Interface
ODL Protocol
ODL Protocol
Overview
An Archaeology Digital Library (DL)
ETANA-DL Architecture
Digital Libraries, 5S Framework -> Structures
Open Archives Initiative, Open Digital Libraries
Canned Demonstration
Conclusions
Discussion: Archaeology DL Requirements
Home Page
Login Page
New Account
New Account Creation Page
Navigations
ETANA-DL Tutorial
CollectionsDescription
Items of Interest, Marked Items,Browse, Search, etc.
Collections Description Page
More information
about the site
Search Results
Links toresult pages
Objectsbelonging to site: Lahav,
area:G6,and locus:G6006
Objectsbelonging to site: Lahav
and Area:K7
Search Query /Number of Hits
Detailed Display (Lahav Figurine)
FromLahavdata
collection
Lahav Term
s
Field
Area
Locus
Basket
Detailed Display (Nimrin Seed)
Nimrin Terms
Quad
Quadrant
Locus
Bag
Detailed Display (Umayri Bone)* *Data Integration in
progress
Umayri Terms
Field
Square
Locus
Pail
…..
Advanced Search Page
Site Partition Sub-partition
Locus Container
Lahav FieldI
AreaA8
LocusA8074
Basket224
Nimrin QuadrantNW
Quadrant Value
N25/W50
Locus96
Bag240
Umayri FieldA
Square7J59
Locus001
Pail12
5S Structural Model Organization
Advanced Search Options Example
Search Results
Query
Browsing
Fields in Umayri’sbone collection
Browsing - II
Squares in Field ‘H’
Records inField ‘H’ for Umayri
Adding to Items of Interest
Add item topersonal collection
Items of Interest Display
Objectsuser is
interested
Marking an Item
Mark Items
Marking – writingnotes for
a specific user
Marking Items
Marked Items Display
Sender, Date,Object OAI ID
SenderComments
Options:View Record,
Add record to Items Of Interest,Re-mark item (Redirect),
Unmark item (Remove item from list)
Discussions Page
Discussions about an
object
View/Post messages, create new
threads
Recommendations
Items recommendedon the basis of
similar interests
Overview
An Archaeology Digital Library (DL)
ETANA-DL Architecture
Digital Libraries, 5S Framework -> Structures
Open Archives Initiative, Open Digital Libraries
Canned Demonstration
Conclusions
Discussion: Archaeology DL Requirements
Conclusions
ETANA-DL: integrated services built upon many archaeology projects
Harvesting (, OAI, ODL) 5S Framework: Model, Tailored
Generation Links for more information Welcome collaboration!
Harvesting vs. Federation
Competing approaches to interoperability Federation is when services are run remotely on
remote data (e.g., Meta-searching) Harvesting is when data/metadata is transferred
from the remote source to the destination where the services are located (e.g. Union catalogues)
Federation requires more effort at each remote source but is easier for the local system and vice versa for harvesting
OAI currently focuses on harvesting
5S Model
Models Examples Objectives
Stream Text; video; audio; image Describes properties of the DL content such as encoding and language for textual material or particular forms of multimedia data
Structures Collection; catalog; hypertext; document; metadata; organization tools
Specifies organizational aspects of the DL content
Spaces Measure; measurable, topological, vector, probabilistic
Defines logical and presentational views of several DL components
Scenarios Searching, browsing, recommending,
Details the behavior of DL services
Societies Service managers, learners, Teachers, etc.
Defines managers, responsible for running DL services; actors, that use those services; and relationships among them
5SLGen: Automatic Digital Library Generation
Links to Resources
•ETANA Home Pagehttp://www.etana.org
•ETANA-DL Home Pagehttp://feathers.dlib.vt.edu
•ETANA-DL Prototypehttp://feathers.dlib.vt.edu:8080/etana/servlet/Start
•Proposal submitted to NSF-ITRhttp://feathers.dlib.vt.edu/ETANAProposal.pdf
•Open Archives Initiativehttp://www.openarchives.org
•OAI Metadata Harvesting Protocolhttp://www.openarchives.org/OAI/openarchivesprotocol.htm
•Virginia Tech DLRL Projectshttp://www.dlib.vt.edu/
Overview
An Archaeology Digital Library (DL)
ETANA-DL Architecture
Digital Libraries, 5S Framework -> Structures
Open Archives Initiative, Open Digital Libraries
Canned Demonstration
Conclusions
Discussion: Archaeology DL Requirements