Skip to content

Latest commit

 

History

History
32 lines (25 loc) · 1.44 KB

README.md

File metadata and controls

32 lines (25 loc) · 1.44 KB

pdftoedn

A poppler-based PDF processing tool to extract document data and save it in EDN format. It supports:

  • Font and glyph remapping via user-defined font map configurations (in JSON format) to allow glyph substitutions for Type 1 or TT fonts with invalid/incorrect unicode tables and even embedded CID fonts with missing tables.
  • Path data extraction.
  • Transformed image output, written directly to disk in PNG format.
  • Annotations.
  • PDF outlines.

Usage

Process a pdf document and write its output to output_file.edn:

pdftoedn -o output_file.edn input_file.pdf

Further reading

Refer to the wiki for