addressing metadata in the mpeg-21 and pdf-a iso standards niso workshop: metadata on the cutting...

18
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress [email protected]

Upload: roger-conley

Post on 27-Dec-2015

221 views

Category:

Documents


0 download

TRANSCRIPT

Addressing Metadata in the MPEG-21 and PDF-A ISO StandardsNISO Workshop: Metadata on the Cutting EdgeMay 2004

William G. LeFurgyU.S. Library of [email protected]

MPEG-21 and PDF-A Metadata 2NISO Workshop, May 2004

Presentation Overview

• Outline metadata approaches in two developing International Organization for Standardization specifications:– MPEG-21 (ISO 21000)– PDF-A (ISO 19005)

• Highlight some practical advantages and disadvantages of both

• My personal views, not those of LC

MPEG-21 and PDF-A Metadata 3NISO Workshop, May 2004

Issues to Keep in Mind

• Specs are in process: parts of MPEG-21 await initial approval; PDF-A is not yet approved

• Limited implementation experience• Different communities have not yet

compared different standards in detail• Landscape is sure to change over next

several years

MPEG-21 and PDF-A Metadata 4NISO Workshop, May 2004

MPEG Background• Series of standards from Moving Picture

Experts Group• MPEG-1, -2: Coding methods to compress

A/V streams (DVD, digital TV, MP3 audio)• MPEG-4: Compression plus ability to interact

with A/V objects (Quicktime 6, iTunes AAC)• MPEG-7: Standard way to represent/bundle

multimedia objects using XML; useful for indexing, searching, distributing content

MPEG-21 and PDF-A Metadata 5NISO Workshop, May 2004

MPEG-21 Overview• Builds on MPEG-7 use of XML to

define/constrain/bundle multimedia objects (text, video, audio, etc.)

• Also defines how any kind of user can interact with content

• A comprehensive framework to enable interoperable but controlled use of content from creation, distribution, and consumption

• Protection of commercial IP rights is central

MPEG-21 and PDF-A Metadata 6NISO Workshop, May 2004

MPEG-21 Features

• Multiple modular parts• Part 2, Digital Item Declaration Language

(DIDL) uses XML to express content elements, e.g.: – Container (items grouped as a logical package)– Item (a logical work)– Statement (descriptive, administrative metadata)

• Part 3, Digital Item Identification states how to link unique identifiers to items

MPEG-21 and PDF-A Metadata 7NISO Workshop, May 2004

From: http://www.chiariglione.org/mpeg/standards/mpeg-21/image004.gif

MPEG-21 and PDF-A Metadata 8NISO Workshop, May 2004

MPEG-21 Rights Expression Language • Part 5, Details “machine readable

language to declare rights and permissions”

• Aims to support end-to-end interoperability with access and use controls

• Uses ContentGuard/Microsoft XrML (Extensible Rights Markup Language) spec

MPEG-21 and PDF-A Metadata 9NISO Workshop, May 2004

The REL Data Model

From: http://www.chiariglione.org/mpeg/standards/mpeg-21/image008.gif

MPEG-21 and PDF-A Metadata 10NISO Workshop, May 2004

Digital Repositories and MPEG-21• Los Alamos National Laboratory uses spec to

represent and manage complex digital objects • Serves to package objects for repository

submission, storage, and dissemination as outlined in OAIS model

• Accommodates existing metadata records and enables capture of life cycle management metadata

• Also used in conjunction with NISO OpenURL and OAI-PMH

MPEG-21 and PDF-A Metadata 11NISO Workshop, May 2004

MPEG-21 Pros and Cons• Pro

– International standard– Broad commercial support for earlier MPEG specs– Packages any type of complex digital object per OAIS– Powerful and flexible means to express descriptive,

structural, administrative, and other metadata

• Con– Complex and highly technical– Few tools currently exist– Unclear now the extent to which spec will be adopted– Strong emphasis on commercial rights protection?

MPEG-21 and PDF-A Metadata 12NISO Workshop, May 2004

PDF-A Background

• Goal is to define use of Adobe PDF version 1.4 for long-term preservation purposes

• Effort stems from extensive use of PDF as primary format for records and documents that require long-term retention

• Standard is being developed by ISO working group with representatives from government, industry, academia

MPEG-21 and PDF-A Metadata 13NISO Workshop, May 2004

PDF-A Overview• Defines a constrained version of PDF that

should remain viable for many years:– Audio and video content are forbidden – Javascript and executable file launches are prohibited – All fonts must be embedded and also must be legally

embeddable for unlimited, universal rendering – Colorspaces specified in a device-independent manner – Encryption is disallowed

• Requires a standards-based approach to metadata

MPEG-21 and PDF-A Metadata 14NISO Workshop, May 2004

PDF-A Metadata Features• Based on Adobe Extensible Metadata

Platform (XMP)• XMP is based on Resource Description

Framework (RDF) and XML specifications• Adobe has defined core schemas

(descriptive, media management, rights management, etc.)

• XMP permits broad extensibility for defining customized schemas

• Provides for embedding XML text packets in binary PDF files

MPEG-21 and PDF-A Metadata 15NISO Workshop, May 2004

Digital Repositories and PDF-A

• Not yet tested in a repository context…• …But repositories are (or will soon be)

rife with PDF files from e-government, online publishing, and business workflow activities

• PDF-A addresses conceptual repository needs for a stable file format with rich descriptive and administrative metadata

MPEG-21 and PDF-A Metadata 16NISO Workshop, May 2004

PDF-A Pros and Cons• Pro

– International standard (pending)– Broad commercial support for Adobe PDF spec– Meets many user needs across the digital life cycle – Powerful and flexible means to express descriptive and

administrative metadata

• Con– Complex and highly technical– Few tools currently exist– Unclear now the extent to which spec will be adopted– XMP limitations for validating metadata– Requires another standard to package items per OAIS

MPEG-21 and PDF-A Metadata 17NISO Workshop, May 2004

Further Information: MPEG-21• MPEG-21 Overview v.5: http://www.chiariglione.org/mpeg/standards/mpeg-21/mpeg-21.htm

• Using MPEG-21 DIDL to Represent Complex Digital Objects in the Los Alamos National Laboratory Digital Library

http://tinyurl.com/22e65

• Using MPEG-21 DIP and NISO OpenURL for the Dynamic Dissemination of Complex Digital Objects in the Los Alamos National Laboratory Digital Library http://tinyurl.com/2klwz

• From MPEG-1 to MPEG-21 http://tinyurl.com/3brc8

MPEG-21 and PDF-A Metadata 18NISO Workshop, May 2004

Further Information: PDF-A• PDF-A Committee web site:

http://www.aiim.org/standards.asp?ID=25013

• PDF-A: Developing a File Format for Long-Term Preservation

http://www.rlg.org/preserv/diginews/diginews7-6.html#feature1

• Adobe XMP web site http://www.adobe.com/products/xmp/main.html