Skip to content

Releases: opendatalab/PDF-Extract-Kit

PDF-Extract-Kit-1.0.0-released

11 Oct 08:56
Compare
Choose a tag to compare

What's Changed

  • @wangbinDL refactored the code for pdf-extract-kit-0.1.1 to support modular operations, allowing users to more conveniently and flexibly select and combine the models they need.
  • @wangbinDL added demos for formula recognition, formula detection, and layout detection.
  • @wangbinDL added documentation for PDF-Extract-Kit-1.0.
  • @JulioZhao97 introduced a new layout detection model (LayoutLMv3).
  • @wufan-tb added OCR support.

New Contributors

  • @JulioZhao97 made their first contribution with the addition of the LayoutLMv3 model.

PDF-Extract-Kit-0.1.1-released

09 Oct 03:11
Compare
Choose a tag to compare

What's Changed

  • Update license from Apache 2.0 to AGPL-3.0 by @wangbinDL
  • Add MinerU technical report bibtex by @wangbinDL

Version 0.1.1 is the stable release preceding the major architectural changes in PDF-Extract-Kit 1.0.0. While the upcoming 1.0.0 version introduces a more streamlined and intuitive user experience, it involves substantial modifications. Users who prefer the stability and familiarity of the previous version are encouraged to continue using 0.1.1.

PDF-Extract-Kit-0.1.0-released

11 Sep 08:30
2794fa3
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: https://github.com/opendatalab/PDF-Extract-Kit/commits/PDF-Extract-Kit-0.1.0-released