processing pdf: how to go from pdf to e-text to audio

54
Processing PDF: Processing PDF: How to Go from PDF How to Go from PDF to to E-text to Audio E-text to Audio Gaeir Dietrich Director High Tech Center Training Unit of the California Community Colleges Foothill Community College District

Upload: lali

Post on 11-Jan-2016

29 views

Category:

Documents


0 download

DESCRIPTION

Processing PDF: How to Go from PDF to E-text to Audio. Gaeir Dietrich Director High Tech Center Training Unit of the California Community Colleges Foothill Community College District. PDF from Publishers. Portable document format (PDF) Reads the same on any computer Looks like the book - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Processing PDF: How to Go from PDF to E-text to Audio

Processing PDF:Processing PDF:How to Go from PDF toHow to Go from PDF toE-text to AudioE-text to Audio

Gaeir DietrichDirectorHigh Tech Center Training Unitof the California Community CollegesFoothill Community College District

Page 2: Processing PDF: How to Go from PDF to E-text to Audio

PDF from PublishersPDF from Publishers

Portable document format (PDF) Reads the same on any computer Looks like the book Smaller than TIFFs Contains all the text

Always check to make sure the book is the right one!

Easy for publishers

Page 3: Processing PDF: How to Go from PDF to E-text to Audio

Requesting through ATNRequesting through ATN

Access Text Network Now free for requesting files from ATN-

member publishers Paid membership to exchange files www.accesstext.org

Not all publishers But ATN does have the largest ones

Page 4: Processing PDF: How to Go from PDF to E-text to Audio

Other Resources at ATNOther Resources at ATN

Accessible Textbook Finder http://www.accesstext.org/atf.php

Link to Publisher Lookup http://www.publisherlookup.org/ Will have to contact non-ATN member

publishers directly

Page 5: Processing PDF: How to Go from PDF to E-text to Audio

Using Publisher PDFsUsing Publisher PDFs

Sometimes students can use files directly

Often files will need further processing for student use

At the very least, large files may need to be broken into chapters

Page 6: Processing PDF: How to Go from PDF to E-text to Audio

PDF StrengthsPDF Strengths

Good format for large print Cropping Fit to page on large pages Print sections on large pages (tiling)

Adobe Reader has some nice features Change colors Reflow Limited voicing

Works on both Mac and PC Easy for most publishers to create

Page 7: Processing PDF: How to Go from PDF to E-text to Audio

PDF WeaknessesPDF Weaknesses

Not always fully accessible Screen readers do not always like them—

even when they are text-based Reading order can be problematic

May be graphics (pictures of text) May have too much security

Page 8: Processing PDF: How to Go from PDF to E-text to Audio

As an Aside…As an Aside…

When faculty create PDFs… The PDF always started as something

else…usually a Word file Try to get the starting document if the

student prefers audio Security concerns?

Word files can be password protected Button > Prepare > Encrypt

Page 9: Processing PDF: How to Go from PDF to E-text to Audio

Types of PDF DocumentsTypes of PDF Documents

Text-based Text can be selected

Graphical Picture of text (i.e., a graphic) Text cannot be selected

Use text-select tool to tell the difference Files may be “locked”

Page 10: Processing PDF: How to Go from PDF to E-text to Audio

Processing PDFsProcessing PDFs

Adobe Acrobat Professional Check on College Buys for discount

Good OCR program Abbyy FineReader Nuance OmniPage

IF you are a Kurzweil campus, you will also need Kurzweil

Page 11: Processing PDF: How to Go from PDF to E-text to Audio

Adobe ToolsAdobe Tools

Adobe Reader Free Useful for students who need minimal

accessibility features http://www.adobe.com/products/reader/

Adobe Acrobat Professional Essential for alt media specialists Extract text, create accessible PDFs, enabled

Adobe Reader features www.uscollegebuy.com Discounted Price

Page 12: Processing PDF: How to Go from PDF to E-text to Audio

Acrobat ReaderAcrobat Reader

Reads aloud But does not highlight or track

Enlarges text Nice reflow feature

Changes text/background colors Text highlighting, sticky notes, and

comments Access for text-based PDFs

Page 13: Processing PDF: How to Go from PDF to E-text to Audio

Production Features in Reader

Really designed for reading, not reformatting

Export PDF Subscription service (about $20/year) Upload PDF file, service auto-converts to

Word, download

Page 14: Processing PDF: How to Go from PDF to E-text to Audio

Process with Acrobat ProProcess with Acrobat Pro

Cropping Enlargement for printing Tiling Extracting/deleting pages Combining/inserting pages Text extraction

Works best with text-based PDF Does have built-in OCR capability

Page 15: Processing PDF: How to Go from PDF to E-text to Audio

Customize Quick Tools

Click on the “gear”

View > Show/hide > Toolbar Items > Quick Tools

Page 16: Processing PDF: How to Go from PDF to E-text to Audio

Quick Tools Menu

Page 17: Processing PDF: How to Go from PDF to E-text to Audio

Customize

Page 18: Processing PDF: How to Go from PDF to E-text to Audio

Please Note

To enable single-key shortcuts Open Preferences dialog box Ctrl + K Under General > select Use Single-Key

Accelerators To Access Tools (first checkbox under Basic Tools)

Page 19: Processing PDF: How to Go from PDF to E-text to Audio

Cropping

Tools > Pages > Crop

Shortcut: C (Please note: This shortcut brings up the

mouse-driven cropping tool—must double click to open the dialog box!)

Page 20: Processing PDF: How to Go from PDF to E-text to Audio

Crop Tool

Page 21: Processing PDF: How to Go from PDF to E-text to Audio

Crop Toolbox

Page 22: Processing PDF: How to Go from PDF to E-text to Audio

Enlarging

Choose paper size/printer File > Print > Size…to Fit

Shortcut: Ctrl + P (tab through)

Tip: Crop document before enlarging

Page 23: Processing PDF: How to Go from PDF to E-text to Audio

Print to Fit

Page 24: Processing PDF: How to Go from PDF to E-text to Audio

Tiling

Choose paper size/printer File > Print > Poster > Tile Scale and

Overlap

Shortcut: Ctrl + P (tab through)

Tip: Crop document before tiling

Page 25: Processing PDF: How to Go from PDF to E-text to Audio

Enlarge with Tiling

Page 26: Processing PDF: How to Go from PDF to E-text to Audio

Extracting Pages

Tools > Pages > Extract

Delete Shortcut: Ctrl + Shift + D Extract Pages Shortcut: Alt V + T + P

(opens Pages pane; F6 focuses in pane and can arrow down)

Page 27: Processing PDF: How to Go from PDF to E-text to Audio

Extraction Tool

Page 28: Processing PDF: How to Go from PDF to E-text to Audio

Tips for Extracting Chapters

Crop on complete file before extracting Work on a copy!!!!! Extract from end toward front! Use table of contents to help Place focus on first page of chapter to

extract (beginning with last)

Page 29: Processing PDF: How to Go from PDF to E-text to Audio

Starting from the Back

Page 30: Processing PDF: How to Go from PDF to E-text to Audio

Combining

File > Pages > Insert

OR

Create > Combine files

Page 31: Processing PDF: How to Go from PDF to E-text to Audio

Inserting Pages

Page 32: Processing PDF: How to Go from PDF to E-text to Audio

Combining Pages

Page 33: Processing PDF: How to Go from PDF to E-text to Audio

Auto Extracting Text

File > Save As > MS Word Retains styles and paragraphs

File > Save As > More options… Text (Accessible)

Lose styles, places hard returns at end of line Text (Plain)

Lose styles, keeps paragraphs

Shortcut: Alt F + A

Page 34: Processing PDF: How to Go from PDF to E-text to Audio

Save As Options

Page 35: Processing PDF: How to Go from PDF to E-text to Audio

Better Text Extraction

OCR programs analyze text and structure Acrobat Pro has built-in OCR, but other

programs provide more control Can control which text to include

Page 36: Processing PDF: How to Go from PDF to E-text to Audio

More Control over Text

For graphical PDFs Or To maintain more control over extracting

text from text-based PDFs Use an OCR program!

Page 37: Processing PDF: How to Go from PDF to E-text to Audio

Processing Graphical PDFsProcessing Graphical PDFs

Must run optical character recognition (OCR) Computers cannot read pictures OCR programs recognize the “characters” in the

picture

How you process the file depends on the end format the student wants!

Page 38: Processing PDF: How to Go from PDF to E-text to Audio

Want to Stay in PDF?

Sometimes students do want a text-based PDF

Can OCR in Adobe Pro Tools> Recognize Text

Page 39: Processing PDF: How to Go from PDF to E-text to Audio

Under Tools

Page 40: Processing PDF: How to Go from PDF to E-text to Audio

Want Text OutWant Text Out

OmniPage or FineReader FineReader generally easier to learn Save to Word or HTML or Text based on student

preference

Use virtual printer with Kurzweil Create KESI files

R&W Save as Word

Page 41: Processing PDF: How to Go from PDF to E-text to Audio

Which One When?Which One When?

Want a Word file? Best choice is OmniPage or FineReader

Want a Kurzweil document? Use Kurzweil to process the PDF

For students to do themselves? Whichever program they prefer

Page 42: Processing PDF: How to Go from PDF to E-text to Audio

Why?Why?

OCR programs are designed to make extraction and editing easy

Document readers (R&W, Kurzweil, etc.) are designed to make reading easy…NOT editing.

Page 43: Processing PDF: How to Go from PDF to E-text to Audio

NEVER!!!NEVER!!!

Do NOT run OCR with FineReader or OmniPage…save to PDF…and then take into Kurzweil, R&W, etc.

Kurzweil, R&W, WYNN will run their own OCR on the PDF! Wastes time, adds error to do OCR twice

Page 44: Processing PDF: How to Go from PDF to E-text to Audio

OCR ProgramsOCR Programs

Treat PDFs the same as a TIFF If you OCR scanned documents, use the

same process

Load image file Select zones Create templates as needed

Page 45: Processing PDF: How to Go from PDF to E-text to Audio

OCR Process Details

Crop before loading into OCR engine Turn on multiple languages as needed

If doing math, turn on Greek Only turn on the languages you need

Edit in the OCR program Some OCR programs have font matching features

Save to Word

Page 46: Processing PDF: How to Go from PDF to E-text to Audio

Captions and Such

For students who want audio or who are using screen readers Separate the main body of the text and the

“ancillary text” (captions, sidebars, footnotes)

Create two documents 00 Chapter and 00A Chapter

Allows the student to hear main text uninterrupted

Page 47: Processing PDF: How to Go from PDF to E-text to Audio

Two Doc Workflow

Open PDF in OCR Program Analyze layout for entire document

Save a copy On one copy…delete all ancillary text

Save to Word as 00 Chapter On other copy…delete all main body text

Save as 00A Chapter Keep page numbers in both documents!

Page 48: Processing PDF: How to Go from PDF to E-text to Audio

Once in Word

Learn to use “show hidden” Ctrl + Shift + 8

Beware of the optional hyphen Search and replace to delete Search for ^- replace with nothing Run spell check

Use styles to structure files for braille program

Page 49: Processing PDF: How to Go from PDF to E-text to Audio

Converting Files

Page 50: Processing PDF: How to Go from PDF to E-text to Audio

Mobile Readers?

Check formats that device can handle Some handle PDF and DOC, some do not

All readers handle TXT Also called text, ASCII Can save from Word as plain text

Page 51: Processing PDF: How to Go from PDF to E-text to Audio

Magic Conversion Tool

Calibre Converts to and from many formats Fairly intuitive Free!

http://calibre-ebook.com/

Page 52: Processing PDF: How to Go from PDF to E-text to Audio

Another Conversion Tool

TechAdapt http://www.techadapt.com/

TechAdapt Accessible Media Center (TAMC) For converting NIMAS and DAISY

DAISY to… RTF HTML

Page 53: Processing PDF: How to Go from PDF to E-text to Audio

File Transfer

Can use DropBox or Box to transfer files for most readers

Kindle and iPad can often use e-mail

Page 54: Processing PDF: How to Go from PDF to E-text to Audio

Resource InfoResource Info

Gaeir Dietrich [email protected] 408-996-6047

www.htctu.net Alt media listserv Manuals online