Skip to content

Releases: explosion/prodigy-pdf

v0.4.2

24 Dec 09:22
Compare
Choose a tag to compare

Fix handling of documents without page spans.

v0.4.1

23 Dec 14:54
Compare
Choose a tag to compare

Fix potential memory leak by closing PDFs explicitly.

v0.4.0

25 Nov 09:54
fbbe45d
Compare
Choose a tag to compare

Added pdf.layout.fetch and pdf.spans.manual recipes.

v0.3.0

18 Nov 12:33
6aafabc
Compare
Choose a tag to compare

This release adds support to pages UI introduced in Prodigy v1.17.0.

v0.2.2

24 Jul 16:01
0072da9
Compare
Choose a tag to compare

This release adds porting over the meta field to the resultant dataset so that for example the document information can be preserved.

v0.2.1

07 May 08:03
89644c5
Compare
Choose a tag to compare

Bumped Pillow dependecy to <11.0.0

v0.2.0

19 Oct 09:30
f7abd42
Compare
Choose a tag to compare

This release adds support for pdf.ocr.correct. This recipe adds OCR to annotated segments from the pdf.image.manual recipe and gives a textbox for corrections. This recipe uses pytessaract under the hood.

v0.1.0

05 Oct 11:00
66d0b28
Compare
Choose a tag to compare

This initial release introduces a pdf.image.manual recipe that adds a workflow to annotate PDF pages as if they are images. We also add CI and unit tests.