Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide OCR capabilities when importing a PDF #10

Open
extracts opened this issue Apr 4, 2021 · 0 comments
Open

Provide OCR capabilities when importing a PDF #10

extracts opened this issue Apr 4, 2021 · 0 comments
Labels
enhancement New feature or request

Comments

@extracts
Copy link
Owner

extracts commented Apr 4, 2021

When importing a PDF file, it would be very useful if Keypoints could trigger optical character recognition (OCR) to ensure that the PDF contains a readable text layer.

There are various OCR libraries available, like the open source Tesseract or commercial web services such as from ABBYY. It will also depend on license & pricing if one of these could be used with Keypoints.

Note that support for OCR is rather a long-term goal.

Workaround: Users who also use some other OCR-capable app (like DEVONthink Pro) could first import the PDF into that app and let it perform the OCR, then drag the PDF over to Keypoints for further annotation & highlighting.

@extracts extracts added the enhancement New feature or request label Apr 4, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant