Searchable PDF: Difference between revisions

From Simple Wiki
(Created page with "Searchable PDF files keep the original scanned image for viewing, as well as OCR text in a hidden layer that can be used to search the document or highlight text for c...")
 
 
(4 intermediate revisions by 2 users not shown)
Line 1: Line 1:
Searchable [[PDF]] files keep the original scanned image for viewing, as well as [[OCR]] text in a hidden layer that can be used to search the document or highlight text for copy and paste operations.
Searchable [[PDF]] files keep the original scanned image for viewing, as well as [[OCR]] text in a hidden layer that can be used for [[Full-Text Searching]] the document or highlight text for copy and paste operations.


Full [[OCR]] conversions too [[PDF]] that don't include the original image will never preserve the original formatting 100%, especially if there are a lot of images or complex document layout. Searchable PDF avoids this by keeping an image for viewing along with the text.
Full [[OCR]] conversions to [[PDF]] that don't include the original image will never preserve the original formatting 100%, especially if there are a lot of images or complex document layout.  
 
Searchable PDF avoids this by keeping an image for viewing along with the text.
 
These are also known as Image+Text [[PDF]] files.
 
== Related Knowledge Base Articles ==
 
* [https://www.simpleindex.com/knowledge-base/is-it-possible-to-search-for-and-retrieve-documents-with-google-desktop-search/ Is it possible to search for and retrieve documents with Windows desktop search?]
* [https://www.simpleindex.com/knowledge-base/can-simpleindex-create-searchable-pdf-imagetext-files-with-hidden-text/ Can SimpleIndex create searchable PDF Image+Text files with hidden text?]
* [https://www.simpleindex.com/knowledge-base/can-simpleindex-create-searchable-pdf-imagetext-files-with-hidden-text/ Can SimpleIndex create searchable PDF Image+Text files with hidden text?]

Latest revision as of 17:26, 15 August 2023

Searchable PDF files keep the original scanned image for viewing, as well as OCR text in a hidden layer that can be used for Full-Text Searching the document or highlight text for copy and paste operations.

Full OCR conversions to PDF that don't include the original image will never preserve the original formatting 100%, especially if there are a lot of images or complex document layout.

Searchable PDF avoids this by keeping an image for viewing along with the text.

These are also known as Image+Text PDF files.

Related Knowledge Base Articles[edit | edit source]