Skip to content
@huridocs

HURIDOCS

HURIDOCS equips human rights defenders with tools to mobilise information for justice and accountability.

Popular repositories Loading

  1. uwazi uwazi Public

    Uwazi is a web-based, open-source solution for building and sharing document collections

    TypeScript 254 81

  2. pdf-document-layout-analysis pdf-document-layout-analysis Public

    A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…

    Python 235 28

  3. casebox casebox Public archive

    Forked from KETSE/casebox

    Casebox: Secure all your information and team communication in one place

    JavaScript 49 31

  4. pdf_paragraphs_extraction pdf_paragraphs_extraction Public

    Python 49 7

  5. OpenEvSys OpenEvSys Public archive

    OpenEvSys is free open source software designed for use by organisations who need a software tool to manage information on human rights violations

    PHP 30 20

  6. pdf-text-extraction pdf-text-extraction Public

    This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the under…

    Python 24 1

Repositories

Showing 10 of 33 repositories
  • uwazi Public

    Uwazi is a web-based, open-source solution for building and sharing document collections

    huridocs/uwazi’s past year of commit activity
    TypeScript 254 MIT 81 454 9 Updated Jan 23, 2025
  • NER-in-docker Public

    NER-in-docker

    huridocs/NER-in-docker’s past year of commit activity
    Python 0 0 0 4 Updated Jan 23, 2025
  • trainable-entity-extractor Public

    Trainable Entity Extractor

    huridocs/trainable-entity-extractor’s past year of commit activity
    Python 0 Apache-2.0 0 0 7 Updated Jan 23, 2025
  • pdf_metadata_extraction Public

    pdf_information_extraction

    huridocs/pdf_metadata_extraction’s past year of commit activity
    Python 4 0 0 8 Updated Jan 23, 2025
  • pdf-document-layout-analysis-async Public

    pdf-document-layout-analysis-async

    huridocs/pdf-document-layout-analysis-async’s past year of commit activity
    Python 1 0 0 5 Updated Jan 23, 2025
  • pdf-document-layout-analysis Public

    A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.

    huridocs/pdf-document-layout-analysis’s past year of commit activity
    Python 235 Apache-2.0 28 2 6 Updated Jan 23, 2025
  • preserve Public

    Preserve is a tool for capturing and saving online digital content. Integrated with Uwazi, Preserve captures content from websites, social media and communication platforms, and archives them with accompanying key metadata to ensure evidentiary value by establishing and demonstrating authenticity and chain of custody.

    huridocs/preserve’s past year of commit activity
    TypeScript 6 MIT 1 12 7 Updated Jan 22, 2025
  • pdf_ocr_service Public

    An http service to OCR PDFs based on a redis queue.

    huridocs/pdf_ocr_service’s past year of commit activity
    Python 1 MIT 0 3 0 Updated Dec 13, 2024
  • queue-processor Public

    queue-processor

    huridocs/queue-processor’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Dec 13, 2024
  • docker-translation-service Public

    docker-translation-service

    huridocs/docker-translation-service’s past year of commit activity
    Python 0 Apache-2.0 0 0 6 Updated Dec 12, 2024