What is a "Searchable PDF"?

The PDF file format can be confusing, especially when it comes to understanding what constitutes a "searchable" PDF file. To understand whether a PDF file is searchable, you have to look at its origin.

Text-Based PDF

First, a PDF file can originate with a file on your computer, like a Word document. Normally, you create the file in your software and then "print" it to a PDF printer. This converts the file to PDF format. These PDF files are text-based PDF, meaning that they retain the text and formatting of the original. Text-based PDF files are searchable because they contain real text.

Image-Based PDF

PDF files can also originate from a scan or a fax. These are image-based PDF files, meaning that they are simply a picture of the original. To your computer, these images are no different than digital photos or graphics. Your computer does not see any text in them.

To make these files searchable, it is necessary to "recognize" the text in the image using optical character recognition ("OCR"). This creates text from the "pictures" of the letters and then inserts the text invisibly behind the image. Without OCR, an image-based PDF file is not searchable.

It’s easy to do. Just select or open the PDF, then click the OCR button. Enterprise Organizer Pro will take care of the rest.

Test Whether a PDF Has Text

If you’re in doubt, there’s an easy way to see whether a PDF file is searchable or not:

  1. If you don’t have the PDF open in Enterprise Organizer Pro, select it, then click the Files button Open in Enterprise Organizer Pro
  2. With the PDF open in Enterprise Organizer Pro, right-click on it and choose Select Tool
  3. Now click and drag the mouse across text to see if it selects anything
  4. If you can’t select any text, it’s because there isn’t text and the PDF isn’t searchable

Alternatively, open the PDF in Adobe Acrobat, then select the "Edit" menu > "Select All". This will select all of the text in the file. If nothing is selected, there is no text and the file isn’t searchable.

Attached Files
There are no attachments for this article.
Comments
There are no comments for this article. Be the first to post a comment.
Name
Email
Security Code Security Code
Related Articles RSS Feed
Using Acrobat for PDF Preview; Common Problems
Viewed 2502 times since Fri, Oct 25, 2013
Can’t Print to PDF from Internet Explorer
Viewed 3652 times since Fri, Oct 25, 2013
"Save" Dialog Doesn’t Always Work with the Adobe PDF Printer
Viewed 2510 times since Fri, Oct 25, 2013
Can’t Print to PDF from Office Applications
Viewed 2550 times since Fri, Oct 25, 2013
Conversion Benchmarks
Viewed 2815 times since Fri, Oct 25, 2013
Erase in PDF
Viewed 8486 times since Fri, Oct 25, 2013
PDF/A Warning
Viewed 2330 times since Fri, Oct 25, 2013
Give a Scanned Document Keywords
Viewed 2255 times since Fri, Oct 25, 2013
Change the Typewriter Font
Viewed 2393 times since Fri, Oct 25, 2013
Dynamic Stamps
Viewed 2474 times since Fri, Oct 25, 2013
MENU