You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
BUG: OCR (PDFSandwich) is getting executed for Searchable Documents (PDF) as well.
Expected behavior: OCR should not process documents already containing text or searchable file.
Actual behavior: OCR is getting executed for Searchable Documents as well.
Steps to reproduce the behavior: Uploaded text containing PDF files which is also being processed for OCR.
Please help me on this.
Tell us about your environment: Linux
The text was updated successfully, but these errors were encountered:
DEEPAK-KESWANI
changed the title
How to restrict OCR (PDFSandwich) for Searchable Documents?
How to restrict OCR (PDFSandwich) for Searchable Documents (PDF)?
Oct 15, 2018
Hello,
there is one simple javascript for know if pdf containt already ocr or not 👍
var transformedPdfFolder = space.createFolder("temp_txt_folder");
var transformedPdfFile = document.transformDocument("text/plain", transformedPdfFolder)
if (transformedPdfFile.content.match(/./)) (don t do extract OCR) else do it.
I would like to do this on folder, actually, i do the javascript if not ocr move on folder then do the ocr and the doc back on the first folder.
BUG: OCR (PDFSandwich) is getting executed for Searchable Documents (PDF) as well.
Expected behavior: OCR should not process documents already containing text or searchable file.
Actual behavior: OCR is getting executed for Searchable Documents as well.
Steps to reproduce the behavior: Uploaded text containing PDF files which is also being processed for OCR.
Please help me on this.
Tell us about your environment: Linux
The text was updated successfully, but these errors were encountered: