Skip to content

This is a notebook that provides access to the linguistic and socio-linguistic annotations that were added to the letters by P.C van Hooft in an annotation project in 2017 by Marjo van Koppen and Marijn Schraagen.

License

Notifications You must be signed in to change notification settings

knaw-huc/brieven-van-hooft-notebook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Brieven van Hooft - Notebook

This is a notebook that provides access to the linguistic and socio-linguistic annotations that were added to the letters by P.C van Hooft in an annotation project in 2017 by Marjo van Koppen and Marijn Schraagen. The letters come from "De briefwisseling van Pieter Corneliszoon Hooft, edited by H.W van Tricht e.a.,", as published by the DBNL in the following three parts:

License information for these works can be found here. We did not receive the rights to publish the editorial parts of the texts that are not from the 17th century. They will still be available in this notebook as they can be downloaded from DBNL directly, but republishing them is not permitted unfortunately. The notebook's code itself is under the GNU General Public License v3.

The annotations were initially published in a combination of FoLiA XML and other stand-off formats. In 2024, they have been re-aligned with the original DBNL sources and published as a STAM model. You can also inspect the full pipeline that produced this model.

This Marimo notebook provides search and visualisation functionality on this STAM model. We will guide you through several examples. All code in this notebook can be executed, and if needed, modified to your liking.

Installation & Usage

Using Docker

  1. In a terminal run: docker pull proycon/brieven-van-hooft-notebook
    • Alternatively, if you don't want to use the command line, you can install Docker Desktop and search for proycon/brieven-van-hooft-notebook.
  2. Then run: docker run -p 8080:8080 proycon/brieven-van-hooft-notebook
    • (Or start it from Docker Desktop)
  3. Point your browser to http://localhost:8080, initial loading may take a while

Note that the notebook runs locally on your system in a container, only the models and texts will be automatically downloaded once on first run. This may take a while.

Steps 2 and 3 need to be repeated whenever you want to open the notebook again. The rest only needs to be done once.

Local installation

First make sure you have an up-to-date Python installation on your system. Windows users can [https://www.python.org/downloads/windows/](obtain Python from the official website), make sure to select the option Add to PATH during installation.

Then from the command line, do:

  1. Clone this git repository: git clone https://github.com/knaw-huc/brieven-van-hooft-notebook
  2. Make a virtual environment: python3 -m venv env
  3. Activate the virtual environment: source env/bin/activate (on UNIX systems), on Windows PowerShell do env\scripts\Activate.ps1 instead).
  4. Install all dependencies: python3 -m pip install -r requirements.txt.
    • On Windows this may give a warning that you need to add a path to your PATH this can be done with something like $env:PATH += ";C:\Users\yourusername\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.12_qbz5n2kfra8p0\LocalCache\local-packages\Python312\Scripts (copy the actual path from the warning).
  5. Run the notebook: marimo run brieven-van-hooft-notebook.py

The notebook will open in your default web-browser and automatically run. Note that it runs locally, only the models and texts will be automatically downloaded once on first run. This may take a while.

Steps 3 and 5 need to be repeated whenever you want to open the notebook again. The rest only needs to be done once.

Troubleshooting

  • Downloading and loading the data may take a while! You will see an hourglass symbol in the top-left corner.
  • If you visualise a lot of letters or an entire edition, you may run into an error Your output is too large. You can set a higher output limit as follows:
    • If you use docker, pass --env MARIMO_OUTPUT_MAX_BYTES=80_000_000 (80MB, the default is 40) when doing docker run.
    • If you use a local installation, do export MARIMO_OUTPUT_MAX_BYTES=80_000_000 prior to marimo run.

About

This is a notebook that provides access to the linguistic and socio-linguistic annotations that were added to the letters by P.C van Hooft in an annotation project in 2017 by Marjo van Koppen and Marijn Schraagen.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published