Change the repository type filter
All
Repositories list
32 repositories
uwazi
PublicUwazi is a web-based, open-source solution for building and sharing document collectionspdf_metadata_extraction
Public- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
queue-processor
Publicdummy_extractor_services
Publicpdf-labeled-data
Publicuwazi-documentation
Publicml-cloud-connector
Publicpdf_ocr_service
Publicconvert-to-pdf-service
Publicpdf-tokens-type-labeler
Public- This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of identifying and structuring the document's TOC.
pdf-text-extraction
PublicThis project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of text extraction from PDF files.pdf-reading-order
Publicpreserve
PublicPreserve is a tool for capturing and saving online digital content. Integrated with Uwazi, Preserve captures content from websites, social media and communication platforms, and archives them with accompanying key metadata to ensure evidentiary value by establishing and demonstrating authenticity and chain of custody.uwazi-design
Publictopic-classification
Publictwitter_crawler
Publicsemantic-search
Publicmock-semantic-ml-server
Publicclassification-utils
Publicuwazi-fixtures
Public archivepython_uwazi_API
Publiccasebox
Public archiveOpenEvSys
Public archive