archive imaging searchable via the webpac marthie de kock the hong kong institute of education 9...
TRANSCRIPT
ARCHIVE IMAGING SEARCHABLE VIA THE
WEBPAC
Marthie de Kock The Hong Kong Institute of Education
9 December 2002
Education Imaging System(EdIS)
Hong Kong Institute of Education Library
3
Points for discussion
• Scope and functions
• EdIS Phase I
• EdIS Phase II
• Background
• Different document classes
• Data retrieval & searching
• INNOPAC and the Z server
4
ScopeScope
• Provide a sophisticated system to manage the growing electronic media including text, black & white scanned images, colour photos, audio, video and multimedia presentations available to and in HKIEd library.
• Provide an effective web interface to retrieve on-line digitised materials.
5
System FunctionsSystem Functions
• Capture of content, storage & management
• Scanning & OCR
• Supports both English and Chinese indexing and full text searching
6
BackgroundBackground
First Digital Library initiatives of HKIed Library
• Joint project between IBM & Library with technical support by ITS
• July 1997 - signed contract with IBM and it’s Digital Library
• June 23 1998 - the system was launched
7
Search Interface of EdIS > The Main Screen
8
Contents of EdIS Phase I Contents of EdIS Phase I Four Document TypesFour Document Types
Document types Digitised itemsNewspaper clippings Image scanning & OCR
Examination papers Image scanning & OCR
Curriculum materials Multimedia objects
Student Projects Multimedia objects
9
Document Types:Document Types:News Clippings & Exam PapersNews Clippings & Exam Papers
• News clippings:• Past newspaper clippings
• scanning, OCR, indexing
• Wiser News indexing & CMC operations
• Exam Papers:• Departments
• scanning, OCR, indexing
10
Document Types:Document Types:Curriculums & Student ProjectsCurriculums & Student Projects
• Digitising procedures included:• Content Analysis
• Categorise multimedia objects
• Write a summary
• Digitise materials, saving files with logical file names, web page design & preparing scripts for uploading
• Upload documents & testing
11
Basic Search Screen of Curriculum Materials
12
Search results screen of [Title = dance]
13
Selected the target page from the hit-list.
14
EdIS Phase II
• Include Archive materials
• Improve multimedia searching
• Search Archive materials via INNOPAC
• No response – IBM’s DL and CMC
• June 2001 new Tender specifications
• Vitova
15
EdIS Phase II Development
• Customise system
• Project development – July 2001
• Z server
• System delivered – April 2002
• Interface – uploading of Wiser news
16
System ArchitectureSystem Architecture
Three subsystems:
• Client subsystem• The front-end PC workstations with
Netscape or Microsoft web browser are available for record retrieval and viewing.
• Capturing Subsystems • Used for content preparation
(scanning OCR and indexing)• Server Subsystem • The production server - stores
records and manages the systems operations
17
ConfigurationConfiguration
• Hardware:• SUN Enterprise 250 server
• 36 GB data storage space
• Configured as RAID 0 (disk mirror)
• Operating Software:• ORACLE Database 8i for SUN Sparc Solaris Unix 2.7
Z39.50 server for document searching
18
Hardware and software
• Application software• VitalDoc Document Imaging system - 40 user
license
• Two VitalScan licenses for desktop Scanning and OCR
• Chinese OCR - TsingHau Wintone ver. 8.0
19
20
21
Other hardware
•Two scanning/OCR workstations
•Minolta PS7000 Scanner
•Ricoh IS330DC DF and Flatbed scanner
22
23
24
25
26
27
Typical Searching ProcedureTypical Searching Procedure
Enter Searching Criteria
Browsing Hit List
View Result/Content
Review HistoryNew Search
Select Class/Database
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
Future?
End