How To Create A Pdf File From A Scanned Document

By converting a scanned document to PDF you can add more value to your important scanned documents. This article will help you learn how to convert scanned documents to PDF with PDFelement. This makes it easier to make modifications to a scanned document and save it into many output formats such as PDF, Word, Excel, PPT, EPUB and more. Here's how you can use the OCR tool built-into Adobe Acrobat to turn your scanned documents and pictures of text into real digital text. OCR a Document or Image in Acrobat. Adobe Acrobat is the original standard program for creating, editing, and viewing PDF files.

1 Make a PDF Searchable
2 Outline Text in GIMP
3 Edit Text in Microsoft Document Imaging
4 Convert a File From ANSI to UTF8

PDFs are to modern businesses what faxes once were to companies operating in the 1980s. They provide a convenient way to not only send documents to colleagues and customers but also facilitate easy commenting and collaboration. Yet when a document is scanned into PDF format, it can be difficult to edit—unless you first convert the text using a process known as OCR, or optical character recognition.

Run Adobe Acrobat X. Go to the 'File' menu, select 'Open' and choose your scanned PDF document. At this point, your scan is considered a single image by the program and the text areas are not editable.

Make the text editable by going to the 'Tools' menu, selecting 'Recognize Text' and choosing 'In This File.' Click the 'Edit' button to make any adjustments ('ClearScan' tends to be the most effective option for this process), then click 'OK' to begin the character recognition process.

Click on the cursor tool in your toolbar, then highlight the text you'd like to edit. Right-click on the highlighted copy and you'll be given a series of options including 'Highlight Text,' 'Replace Text' and 'Add Note to Text.'

Save your document by going to the 'File' menu and clicking 'Save.'

Tips

If you get the following error message after clicking 'OK' in Step 2 -- 'Acrobat could not perform OCR on this page because: This page contains renderable text' -- simply check the box that says 'Ignore Future Errors' and the process will continue.
To have even more control over the text in a scanned document, go to the 'Tools' menu, select 'Content' and choose 'Edit Document Text.' Then, highlight the text you'd like change and type to change it, or navigate to 'Edit' then 'Delete' or 'Copy' to perform those functions.

Warning

Be sure to save your document even if you haven't made any edits so that your document remains editable.

References (3)

About the Author

Michael Franco has been writing professionally since 1990. Having lived in both Singapore and Prague, he now works as a writer and editor in Asheville, N.C. Franco's work has appeared in publications such as 'Discovery Channel Magazine' and 'Islands,' as well on various websites. He holds a Bachelor of Arts in English and an a Master of Arts in creative writing.

Photo Credits

Jupiterimages/Photos.com/Getty Images

Cite this Article

Choose Citation Style

Franco, Michael. 'How to Create Editable PDF Files From Scanned Documents.' Small Business - Chron.com, http://smallbusiness.chron.com/create-editable-pdf-files-scanned-documents-52826.html. Accessed 28 January 2020.

Franco, Michael. (n.d.). How to Create Editable PDF Files From Scanned Documents. Small Business - Chron.com. Retrieved from http://smallbusiness.chron.com/create-editable-pdf-files-scanned-documents-52826.html

Franco, Michael. 'How to Create Editable PDF Files From Scanned Documents' accessed January 28, 2020. http://smallbusiness.chron.com/create-editable-pdf-files-scanned-documents-52826.html

Note: Depending on which text editor you're pasting into, you might have to add the italics to the site name.

When you scan a document directly into a PDF file, Acrobat captures all the text and graphics on each page as though they were all just one big graphic image. This is fine as far as it goes, except that it doesn’t go very far because you can neither edit nor search the PDF document (because, as far as Acrobat is concerned, the document doesn’t contain any text to edit or search, just one humongous graphic). That’s where the Paper Capture plug-in in Acrobat 5 for Windows comes into play: You can use it to make a PDF that you can just search or both search and edit.

For some unknown reason, some of the first copies of Acrobat 5 for Windows shipped without the Paper Capture plug-in. If you find that your Tools menu in Acrobat 5 is missing the Paper Capture item, you need to download and install the Paper Capture plug-in from the Adobe Web site. Note that the Paper Capture plug-in has a 50-page document limit. If you need to process PDF documents over 50 pages in length, you need to look into purchasing Adobe Acrobat Capture, a full-blown version of the Paper Capture plug-in that can handle longer documents.

To use Paper Capture, all you have to do is choose Tools –> Paper Capture to open the Paper Capture Plug-In dialog box, select the page or pages to be processed (All Pages, Current Page, or From Page x to y), and then click the OK button; the Paper Capture utility does the rest. As it processes the page or pages in the document that you designated, a Paper Capture Plug-In alert dialog box keeps you informed of its progress in preparing and performing the page recognition. When Paper Capture finishes doing the page recognition, this alert dialog box disappears and you can then save the changes to your PDF document with the File –> Save command.

When doing the page recognition in a PDF document, the Paper Capture plug-in offers you a choice between the following three Output Style options:

Formatted Text & Graphics to make the text in the PDF document both editable and searchable. Select this setting if you not only want to be able to find text in the document but also possibly make editing changes to it.

Searchable Image (Exact) to make the text in the PDF document searchable but not editable (this is the default setting). Use this setting if you’re processing a document that needs to be searchable but should never be edited in any way, such as an executed contract.

Searchable Image (Compact) to make the text in the PDF document searchable but not editable and to compress its graphics. Select this setting if you’re processing a document whose text requires searching without editing and that also contains a fair number of graphic images that need compressing. When you select this setting, Paper Capture applies JPEG compression to color images and ZIP compression to black-and-white images.

To select a different output style setting, click the Preferences button in the Paper Capture Plug-In dialog box to open the Preferences dialog box. This dialog box not only enables you to select a new output style in the PDF Output Style pop-up menu but also to designate the primary language used in the text in the Primary OCR Language pop-up menu (OCR stands for Optical Character Recognition, which is the kind of software that Paper Capture uses to recognize and convert text captured as a graphic into text that can be searched and edited).

If your PDF document contains graphic images, you can tell Paper Capture how much to compress the images by selecting the maximum resolution in the Downsample Images pop-up menu. This menu offers you three options in addition to None (for no compression): Low (300 dpi), Medium (150 dpi), and High (72 dpi). The Low, Medium, and High options refer to the amount of compression applied to the images, and the values 300, 150, and 72 dpi (dots per inch) refer to their resolution and thus their quality. As always, the higher the amount of compression, the smaller the file size and the lower the image quality.

After processing the pages of your PDF document with the Paper Capture plug-in, use the Find feature (Ctrl+F on Windows and Command key+F on the Mac) to search for words or phrases in the text to verify it can be searched. If you used the Formatted Text & Graphics output style in doing the page recognition, you can select the TouchUp Text Tool by clicking its button on the Editing toolbar or by typing T, and then click the I-beam pointer in a line of text to select the line with a bounding box to verify that you can edit the text as well. Always remember to use File –> Save to save the changes made to your document by processing with Paper Capture.

Related Articles

Tips

Warning

References (3)

About the Author

Photo Credits

Choose Citation Style