click to edit master subtitle style 4/14/12 future of newspaper digitization and preservation...
TRANSCRIPT
Click to edit Master subtitle style
4/14/12
Future of Newspaper Digitization and PreservationPhilippine Association of Academic and Research Librarians, Inc (PAARL) National Summer Conference on “Planning, Developing and Managing Digitization & Research Projects for Libraries and Information Centers, April 18-20, 2012, Coron, Palawan
4/14/12
Why are Newspapers Important?The desire to be informed is a
basic human need. There is no other medium that does this best in written form except newspapers which come periodically and in most cases on a daily basis.
Newspapers are primary sources of local history—events, customs, traditions, attitudes, practices, births and deaths, etc.
4/14/12
Sustainability of Newspapers in PrintNewspaper advertising has been
declining steadily over the past decade. The drop from 2008 to 2009 was 28 % (Ingram, 2010)
Newspapers in print have been closing down since 2007.
Those that adopted hybrid approaches have made sustainability more difficult.
4/14/12
The future of Newspapers is
Digital (Murdock, 2009) accessible by PCs and mobile devices with updates (in full or alerts) every hour or two 24/7. This format will last many years
Pay for online access either by subscription or per view for sustainability.
4/14/12
Print Newspapers in Libraries
Newspapers because of their size, frequency of publication and cheap paper present huge problems in preservation and access
Reformatting into microfilm and digital formats are some solutions but users have criticized libraries for discarding the paper format.
4/14/12
Why Microfilm? Digitize? Microfilm—is used for
preservation. The format under ideal environmental conditions will last 500 years. It reduces the handling of and replaces print.
Digital—is used for access and preservation. It is searchable and available to anyone with a computer/Internet connection. It reduces the handling of and replaces print or film.
4/14/12
Problems with Microfilm
It requires a three generation process to produce a service copy. (Master, Slave, Service Copy)
It requires special storage facilities to keep well.
4/14/12
Challenges of Digitizing NewspapersLarge size of page
Variety and sizes of fonts
Requires indexing for more efficient access
Mixture of text, graphics and pictures on a single page
Layout
Expense
Technology changes
4/14/12
International Digital InitiativesOffically established in 1999, ICON:
International Coalition on Newspapers develops strategies to preserve and improve access to newspapers from around the globe.
It addresses issues about bibliographic access, copyright and information dissemination.
It addresses challenges related to the storage and preservation of international newspaper collections.
It provides access to bibliographic data to more than25,000 newspaper titles.
4/14/12
4/14/12
International InitiativesIllinois Newspaper Project
Since receiving funding in 2009, the INP has digitized four titles that are now available on the Library of Congress' Chronicling America web site:
Chicago Eagle (1892-1920)
The Broad Ax (1895-1922)
The Day Book (1911-1917)
The Cairo Bulletin (1868-1876)
4/14/12
The Rizal Library Digitization ProjectNewspapers on DVD from 2000 to
the present
Can be browsed by date
Searched by keywords
Articles can be cropped and printed, saved or emailed
4/14/12
Why We Did It
To respond to requests of subscribers who
Could not afford the equipment to read and print from microfilm
Could not afford to maintain the necessary environmental conditions for storage of films to prevent deterioration marked by the vinegar syndrome.
4/14/12
How we did it Prior to digitization, the Philippine
Library Materials Project (PLMP) conducted a feasibility and market study.
Findings indicated user preference for digital over microfilm for five newspaper titles
Scanned the market for microfilm scanners
Conducted acceptability studies on the product
Made recommended changes
Produced and marketed acceptable product
4/14/12
How we do it
Select and evaluate microfilm
Obtain permission from the publisher and execute MOA
Scan microfilm using Eclipse Rollfilm Scanner
Convert raw images to a database using DGX software
Index manually
Produce DVD copies of the database with index.
4/14/12
Eclipse Rollfilm Scanner Pre-scan film cleaning (both sides), adaptive speed control (20ppm-300 ppm) OS Windows XP Professional, Film polarities (positive and negative), film size (16 and 35 mm), film types (vesicular, blue and black diazo and silver), file formats (TIFF monochrome, TIFF uncompressed, Multipage TIFF, JPEG, CALS, PDF and JPEG 2000) nextScan Fusion™ softwareOutput grayscale and bitonal at the same time
Microfilm Scanning Equipment
4/14/12
Newspapers on DVD
Titles: 4 titles (2000- present )
1 roll microfilm = 1 DVD, 1 year of microfilm is 12- 24 rolls depending on the title
Title and name indexes and selected subjects
Can crop and print selected articles
4/14/12
Technology ConcernsWill the media last?
Will the media's hardware (reading device) and its controlling software remain available?
How are the new policy issues to be handled (copyright, fair use, etc.)?
Is the process affordable?
4/14/12
Preservation Issues
Archiving of digital data. Anybody involved in digitization processes is facing the problem of long-term archiving of digital data (Rosenthaler, 2007).
4/14/12
Preservation pointers (Rosenthaler, 2007)Longevity of digital data can be
best achieved by implementing a migration model based on the following rules:
1) Redundancy. Data must be kept with a high level of redundancy. At least three copies on a minimum of two different types of storage media (such as two copies on hard disk and one copy on magnetic tape) should be kept at geographically different locations.
4/14/12
Preservation pointers (Rosenthaler, 2007) 2) Checksums. For all data files, checksums should be calculated and archived with the data files. This allows for checking data files at any time for aging-related changes or errors
3) Proofreading. Every 12 to 24 months, the data should be proofread and the checksums compared. If errors are detected, a migration should be launched immediately.
4/14/12
Preservation pointers (Rosenthaler, 2007)4) Migration: Migrations have to be planned in advance, including financing. A bit stream migration is necessary about every 5 years. A format migration is advised if a new file format becomes standard and the conversion can be done without loss of data.
5) Documentation: Every step has to be documented in detail and all media must be labelled properly.
By following these rules, you may preserve digital data indefinitely. However, constant care is required
4/14/12
Preservation pointers Lukas Rosenthaler (2006), Proposed using
peer-to-peer file networks on the Internet--the same technology used to circulate pirated music files--as a way of letting photographers of all stripes store their work Copies of encrypted digital files would be distributed across a shared network of servers around the world, which Rosenthaler calls Distarnet. "The idea is not to give everyone access to the data, but to prevent data loss," he says.
4/14/12
Preservation ResearchRosenthaler is currently looking at ways of using
microfilm, the old standby of archivists and librarians everywhere, to preserve digital files.
Rather than preserving the image itself, Rosenthaler's proposal is to preserve the individual bits from the image file as a series of light and dark dots indicating the ones and zeros of the file. The method could also be used to preserve music or even video information for centuries in such a way that basic technology like a microscope could be used to assemble the dots into usable data.
"There are many important things stored digitally now that, if they aren't preserved, will simply vanish in time," he
says. "Our place in history is what's at stake."
4/14/12
Plans
Regularly conduct user surveys to obtain feedback
Keep abreast of technology developments specially for long term storage standards, and Information Retrieval software
Digitize and make available newspapers published below 2000
4/14/12
PlansUpload into a digital repository for
wider access
Look for and test automated indexing systems such as OCR to lower indexing expense and make indexing faster
Look at feasibility of cloud computing
Click to edit the outline text format
Second Outline Level
Third Outline LevelFourth
Outline Level Fifth
Outline Level
Sixth Outline Level
Seventh Outline Level
Eighth Outline Level
Ninth Outline LevelClick to edit Master text styles
Second level
Third level
Fourth level
Fifth level
ConclusionUsers prefer the
digital format over the microfilm because of ease of access, downloading, and printing
There are technology and preservation issues to monitor
4/14/12
References
Hesseldahl, Arik (2006) How To Preserve Photos For 500 Years http://www.forbes.com/2005/04/14/cx_ah_0414photo.html
ICON. http://icon.crl.edu/
Illinois Newspaper Project. http://www.library.illinois.edu/inp/
Ingram, Matthew (2010). What does the future hold for newspapers? http://gigaom.com/2010/03/28/what-does-the-future-hold-for-newspapers/
Kinsley, Michael (2006). Do newspapers have a future? Time Magazine. http://www.time.com/time/magazine/article/0,9171,1538652,00.html
Murdock, Rupert (2009). Future of Newspapers is digital. AFP http://www.google.com/hostednews/afp/article/ALeqM5jx9gAs5FOaV6IPy4an5gRxBQLR1g
Rosenthaler, Lukas (2007). Rosenthaler, Lukas. "Digital Archiving." Focal Encyclopedia of Photography: Digital Imaging, Theory and Applications, History, and Science. Ed. Michael R. Peres. 4th ed. Amsterdam: Elsevier, 2007. 359-364. Gale Virtual Reference Library. Gale. Ateneo de Manila University. 6 Nov. 2009 <http://go.galegroup.com/ps/start.do?p=GVRL&u=phadmu>.