Skip to content

v0.0.2: make binarize process PAGE-XML (with AlternativeImage):

Pre-release
Pre-release
Compare
Choose a tag to compare
@kba kba released this 20 Dec 13:56
· 231 commits to master since this release

First release

  • detect whether input file was image/* or PAGE-XML
  • for image file, also generate a skeleton PAGE-XML
    (with AlternativeImage for the binarized result)
  • for PAGE-XML file, detect namespace (and prefix),
    then check whether AlternativeImage already exists
    or find imageFilename (both on disk and as METS ID),
    process image and add AlternativeImage for result
  • reference all new files (image under OCR-D-IMG-BIN,
    PAGE-XML under output fileGrp) in METS
  • encapsulate, add pointers for improvement of bashlib
    in core
  • improve documentation