v0.1.0: The foundation version that covers four major functionalities
layoutparser
now supports the following functionalities:
-
Coordinate system:
- Supports the 3 basic coordinate system and their geometric relationships
- Supports the TextBlook and Layout system for convenient coordinate and text processing
-
OCR System:
- Supports OCR based on Google Cloud Vision and Tesseract API.
-
Layout Modeling:
- Supports using pre-trained Deep Learning models for layout object detection using Detection2
-
Visualization:
- Supports highly-customizable presentation of the box coordinates and text in the detected layout