- Config: make :any:`Config.cfg_path` public attribute
- Document: add support for :class:`~pathlib.Path` for loading pdf
- pyxpdf_data: add 35 base Postscript fonts from ghostscript
Bugs Fixed
- Fix #9: segfault using :meth:`~pyxpdf.xpdf.Document.text()`
- Fix #8: add checks for file in :any:`Config.add_font_file()`
- Config: add function to add missing fonts :any:`Config.add_font_file()`
- Introduce :class:`~pyxpdf.xpdf.PDFImage` to represent a PDF Image.
- PDFImageOutput: :meth:`~pyxpdf.xpdf.PDFImageOutput.get` returns :class:`~pyxpdf.xpdf.PDFImage` instead of Pillow :class:`Image`
Bugs Fixed
- fix all direct memory leaks
- Config: fix :any:`Config.text_encoding` setter, encodings with lowercase characters were not able to set.
- fix weird bytes encoding problem in python debug builds
- Python 2.7 support dropped
- 2 optional dependencies (Pillow, pyxpdf_data) introduced
New Features
Introduce (optional) package pyxpdf_data which add more encoding support.
API: add specialised classes for pdf outputs, PDFOuputDevice.
- TextOutput - For Text extraction
- RawImageOutput - Render PDF Page as Image
- PDFImageOutput - Extract images from PDF
- Config: add new global settings:
Bugs Fixed
- pdftotext: extracted text contains clipped text even when explictly discarding it.
- Config: fix loading of external xdfrc with :any:`Config.load_file()`
- FIX: default :any:`Config.text_encoding` value i.e UTF-8 does not persist :any:`Config.reset()` and changes to Latin1.
- pdftotext: remove all parameters that change global :data:`~pyxpdf.xpdf.Config` properties.
Initial stable release.