Skip to content

A Python script that runs Kraken over the pages in a IIIF manifest and outputs the results to files.

Notifications You must be signed in to change notification settings

pulibrary/ocr-manifest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OCR-Manifest

A package to perform OCR on the pages in a IIIF Manifest.

ocr-manifest~ is a Python package that provides a script, ocr-manifest, to download OCR files from a Manifest URI and write them to a specified directory.

Installation

To install the ocr-manifest package, ensure you have uv installed to manage your project dependencies. Then, follow these steps:

  1. Clone the repository or download the source code.

  2. Install the package and its dependencies using uv:

    uv install
  3. Verify that the ocr-manifest script is installed and available in your PATH:

    ocr-manifest --help

Usage

The ocr-manifest script processes a Manifest URI and downloads the corresponding OCR files to a specified directory.

Syntax

ocr-manifest <manifest_uri> <output_directory>
  • <manifesturi>: The URI of the Manifest to process.
  • <outputdirectory>: The directory where OCR files will be written.

Example

ocr-manifest https://example.com/manifest.json output_files

After execution, all the OCR files will be available in the output_files directory.

About

A Python script that runs Kraken over the pages in a IIIF manifest and outputs the results to files.

Resources

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages