1 bridging the gap between the paper past and digital future
TRANSCRIPT
1
Bridging the gap between the paper past Bridging the gap between the paper past
and digital futureand digital future
2
Founded in 1999Founded in 1999 Powering 5 of the top Ten US publishers archives Powering 5 of the top Ten US publishers archives Recognized leader of top North American university and Recognized leader of top North American university and
library digital archives projectslibrary digital archives projects 400 customers400 customers 2 Million end users2 Million end users
About usAbout us TheThe pioneer of XML based digital archiving solutions pioneer of XML based digital archiving solutions enabling intelligent search and future-proof enabling intelligent search and future-proof preservationpreservation
3
Washington Post Washington Post Financial Times Financial Times LA TimesLA Times Daily TelegraphDaily Telegraph Reed ElsevierReed Elsevier Time Warner Time Warner US DoD US DoD IL Judicial SystemIL Judicial System UK Kings CollegeUK Kings College Open University ILOpen University IL
Brooklyn Public Brooklyn Public LibraryLibrary
Colorado State Colorado State LibraryLibrary
The Scotsman The Scotsman Pennsylvania State Pennsylvania State
University LibraryUniversity Library Quincy Public LibraryQuincy Public Library The British Library The British Library Oxford UniversityOxford University Israel National Israel National
LibraryLibrary Tel Aviv UniversityTel Aviv University And more…And more…
Leading CustomersLeading Customers
4
Smart digitization Smart digitization and transformation…and transformation…
IntelligentIntelligent search and viewsearch and view
Olive Digital Archiving Olive Digital Archiving TechnologyTechnology
Long-termLong-term preservationpreservation
5
Special FeaturesSpecial Features Automatic digitization and XML transformation Automatic digitization and XML transformation
Instantly transforms microfilm, microfiche and digital files Instantly transforms microfilm, microfiche and digital files intointoXML/PDF digital repositories – up to 500,000 items a day!XML/PDF digital repositories – up to 500,000 items a day!
Content enrichmentContent enrichmentStructure analysis, automatic metadata extraction, Structure analysis, automatic metadata extraction, classification, smart taggingclassification, smart tagging
Future-Proof archiving Future-Proof archiving The XML based, unified content vision, protects the The XML based, unified content vision, protects the content from future technology changes, ensuring long-content from future technology changes, ensuring long-term preservationterm preservation
Research, not search…Research, not search…Olive smart tagging and indexing provides robust data Olive smart tagging and indexing provides robust data mining and research toolsmining and research tools
Unified viewing interface - browser basedUnified viewing interface - browser basedImproves information accessibility on any platformImproves information accessibility on any platform
6
Archive Applications & DemosArchive Applications & Demos
NY University demoNY University demoThe British Library The British Library ProjectProject
Time Magazine Time Magazine ArchiveArchive
NewspapersNewspapers Files and recordsFiles and recordsJournals /Journals /MagazinesMagazines
Manchester Manchester UnitedUnited
MultimediaMultimedia
7
XML Schema XML Schema TransformersTransformers
PRISM PRISM OAI OAI S1000D S1000D
Warehouse Warehouse ServicesServices
Viewing and Viewing and ePublishing servicesePublishing services
Web Server (windows)Web Server (windows)
Olive XAP Platform™Olive XAP Platform™
Olive Olive WarehouseWarehouse™™
Olive Viewfinder ServerOlive Viewfinder Server™™
Olive XML Distiller™Olive XML Distiller™
Legacy Content Legacy Content SystemsSystems
PDF / Office PDF / Office Images / PaperImages / Paper MicrofilmMicrofilm MicroficheMicrofiche
Tools and ApplicationsTools and Applications
8
Olive XML Distiller™Olive XML Distiller™Instantly Transforms Microfilm, Microfiche and Digital Files Into Digital Instantly Transforms Microfilm, Microfiche and Digital Files Into Digital
RepositoriesRepositories
Performance Performance - The fastest and most - The fastest and most accurate document capturing, digitization accurate document capturing, digitization and enrichment technologyand enrichment technology
Smart transformation Smart transformation - Quality - Quality improvement, structure recognition, improvement, structure recognition, metadata extraction, tagging and hyper metadata extraction, tagging and hyper linkinglinking
Scalable Scalable – up to 96 CPU’s, parallel image – up to 96 CPU’s, parallel image processing architecture – process up to processing architecture – process up to 500,000 pages per day! 500,000 pages per day!
Comprehensive Comprehensive - Dedicated process for - Dedicated process for microfilm, microfiche, MS-Office and PDF microfilm, microfiche, MS-Office and PDF files - Supports 140 languagesfiles - Supports 140 languages
9
Olive’s XML Distiller ™ Olive’s XML Distiller ™ TechnologyTechnology
Hard CopyHard Copy
PDF / Office PDF / Office
ECM or
Web Server
Capture
•Scanning, check in•Image Pre-Process:
•De-skew•Cropping•Cleaning
•Basic PDF conversion•Layout zoning•OCR•Font encoding fixing •Bit Map indexing
Tagging
•Meta Data extraction•Classification•Structural tagging•Semantic tagging**:
- Concepts - Entities - Summaries•Hyperlinks detection
Output
•Rich tagged PDF•Olive PrXML schema:•-ePrint look & feel•-Rich navigation & search
MicroficheMicrofiche
10
Olive ViewFinder™Olive ViewFinder™Intelligent document viewing, publishing and delivery systemIntelligent document viewing, publishing and delivery system
Improve information accessibilityImprove information accessibility with browser based, unified with browser based, unified viewing Interfaceviewing Interface
Improve productivityImprove productivity – fast access – fast access to components or full documentsto components or full documents
Better controlBetter control –– Component level, Component level, document usage statisticsdocument usage statistics