Searchable PDF: Difference between revisions

From Simple Wiki
No edit summary
No edit summary
Line 1: Line 1:
Searchable [[PDF]] files keep the original scanned image for viewing, as well as [[OCR]] text in a hidden layer that can be used to [[full-text search]] the document or highlight text for copy and paste operations.
Searchable [[PDF]] files keep the original scanned image for viewing, as well as [[OCR]] text in a hidden layer that can be used for [[Full-Text Searching]] the document or highlight text for copy and paste operations.


Full [[OCR]] conversions to [[PDF]] that don't include the original image will never preserve the original formatting 100%, especially if there are a lot of images or complex document layout.  
Full [[OCR]] conversions to [[PDF]] that don't include the original image will never preserve the original formatting 100%, especially if there are a lot of images or complex document layout.  

Revision as of 15:53, 14 January 2022

Searchable PDF files keep the original scanned image for viewing, as well as OCR text in a hidden layer that can be used for Full-Text Searching the document or highlight text for copy and paste operations.

Full OCR conversions to PDF that don't include the original image will never preserve the original formatting 100%, especially if there are a lot of images or complex document layout.

Searchable PDF avoids this by keeping an image for viewing along with the text.

These are also known as Image+Text PDF files.