What is a "Searchable PDF"?

The PDF file format can be confusing, especially when it comes to understanding what constitutes a "searchable" PDF file. To understand whether a PDF file is searchable, you have to look at its origin.

Text-Based PDF

First, a PDF file can originate with a file on your computer, like a Word document. Normally, you create the file in your software and then "print" it to a PDF printer. This converts the file to PDF format. These PDF files are text-based PDF, meaning that they retain the text and formatting of the original. Text-based PDF files are searchable because they contain real text.

Image-Based PDF

PDF files can also originate from a scan or a fax. These are image-based PDF files, meaning that they are simply a picture of the original. To your computer, these images are no different than digital photos or graphics. Your computer does not see any text in them.

To make these files searchable, it is necessary to "recognize" the text in the image using optical character recognition ("OCR"). This creates text from the "pictures" of the letters and then inserts the text invisibly behind the image. Without OCR, an image-based PDF file is not searchable.

It’s easy to do. Just select or open the PDF, then click the OCR button. Enterprise Organizer Pro will take care of the rest.

Test Whether a PDF Has Text

If you’re in doubt, there’s an easy way to see whether a PDF file is searchable or not:

  1. If you don’t have the PDF open in Enterprise Organizer Pro, select it, then click the Files button Open in Enterprise Organizer Pro
  2. With the PDF open in Enterprise Organizer Pro, right-click on it and choose Select Tool
  3. Now click and drag the mouse across text to see if it selects anything
  4. If you can’t select any text, it’s because there isn’t text and the PDF isn’t searchable

Alternatively, open the PDF in Adobe Acrobat, then select the "Edit" menu > "Select All". This will select all of the text in the file. If nothing is selected, there is no text and the file isn’t searchable.

Attached Files
There are no attachments for this article.
Comments
There are no comments for this article. Be the first to post a comment.
Name
Email
Security Code Security Code
Related Articles RSS Feed
Can’t Print to PDF from Internet Explorer
Viewed 2698 times since Fri, Oct 25, 2013
Can’t Print to PDF from Office Applications
Viewed 1589 times since Fri, Oct 25, 2013
Watermarks Still Showing Up
Viewed 1469 times since Fri, Oct 25, 2013
Using Acrobat for PDF Preview; Common Problems
Viewed 1474 times since Fri, Oct 25, 2013
Dynamic Stamps
Viewed 1547 times since Fri, Oct 25, 2013
Conversion Benchmarks
Viewed 1781 times since Fri, Oct 25, 2013
Change the Typewriter Font
Viewed 1377 times since Fri, Oct 25, 2013
Highlight in a Scan
Viewed 1431 times since Fri, Oct 25, 2013
"Save" Dialog Doesn’t Always Work with the Adobe PDF Printer
Viewed 1476 times since Fri, Oct 25, 2013
Erase in PDF
Viewed 7494 times since Fri, Oct 25, 2013
MENU