PyLaia is a device agnostic, PyTorch based, deep learning toolkit for handwritten document analysis.
It is also a successor to Laia.
Get started by having a look at our Documentation!
To install PyLaia from PyPi:
pip install pylaia
Please note that the CUDA version of nnutils (nnutils-pytorch-cuda
) is installed by default. If you do not have a GPU, you should install the CPU version (nnutils-pytorch
).
The following Python scripts will be installed in your system:
pylaia-htr-create-model
: Create a VGG-like model with BLSTMs on top for handwriting text recognition. The script has different options to customize the model. The architecture is based on the paper "Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?" (2017) by J. Puigcerver.pylaia-htr-train-ctc
: Train a model using the CTC algorithm and a set of text-line images and their transcripts.pylaia-htr-decode-ctc
: Decode text line images using a trained model and the CTC algorithm. It can also output the char/word segmentation boundaries of the symbols recognized.pylaia-htr-netout
: Dump the output of the model for a set of text-line images in order to decode using an external language model.
If you want to contribute new feature or found a text that is incorrectly segmented using pySBD, then please head to CONTRIBUTING.md to know more and follow these steps.
- Fork it ( https://gitlab.teklia.com/atr/pylaia/-/forks/new )
- Create your feature branch (
git checkout -b my-new-feature
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create a new Merge Request ( https://gitlab.teklia.com/atr/pylaia/-/merge_requests/new )
We are committed to providing a friendly, safe and welcoming environment for all. Please read and respect the PyLaia Code of Conduct.
Work in this toolkit was financially supported by the Pattern Recognition and Human Language Technology (PRHLT) Research Center.
- Article describing the latest contributions to PyLaia
@inproceedings{pylaia2024,
author = "Tarride, Solène and Schneider, Yoann and Generali, Marie and Boillet, Melodie and Abadie, Bastien and Kermorvant, Christopher",
title = "Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library",
booktitle = "Submitted at ICDAR",
year = "2024"
}
- Original article
@inproceedings{laia2017,
author={Puigcerver, Joan},
booktitle={2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)},
title={Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?},
year={2017},
volume={01},
number={},
pages={67-72},
doi={10.1109/ICDAR.2017.20}}
- GitLab repository
@misc{puigcerver2018pylaia,
author = {Joan Puigcerver and Carlos Mocholí},
title = {PyLaia},
year = {2018},
publisher = {GitLab},
journal = {GitLab repository},
howpublished = {\url{https://gitlab.teklia.com/atr/pylaia/}},
commit = {commit SHA}
}