init

kazcw · Oct 20, 2021 · 153540a · 153540a
commit 153540a
Show file tree

Hide file tree

Showing 6 changed files with 659 additions and 0 deletions.
diff --git a/.gitignore b/.gitignore
@@ -0,0 +1 @@
+/target
diff --git a/Cargo.lock b/Cargo.lock
diff --git a/Cargo.toml b/Cargo.toml
@@ -0,0 +1,14 @@
+[package]
+name = "jerbs"
+version = "0.1.0"
+edition = "2018"
+authors = ["Kaz Wesley <[email protected]>"]
+description = "Command-line work-stealing scheduler."
+repository = "https://github.com/kazcw/jerbs"
+license = "GPL-3.0"
+categories = ["command-line-utilities"]
+exclude = [".gitignore"]
+
+[dependencies]
+clap = "2.33"
+rusqlite = "0.26"
diff --git a/README.md b/README.md
@@ -0,0 +1,59 @@
+# jerbs
+
+Command-line work-stealing scheduler.
+
+## Operation
+
+Create a job database:
+```
+$ jerbs work.db new
+```
+
+Define a job and enqueue some repetitions:
+```
+$ jerbs work.db new-job --count 17 <<< "info for thing to do 17 times"
+1
+```
+The output is the job id, which you can use to edit the job later.
+
+See what's scheduled:
+```
+$ jerbs work.db list-jobs -v
+1       17      "info for thing to do 17 times"
+```
+(Note: do not use verbose output (`-v`) for scripting. It is intended to be
+human-readable and the format is unstable.)
+
+Run a worker:
+```
+$ while jerbs work.db take $$ | read JOB; do echo $JOB; done
+```
+Now start some more!
+
+## Typical Usage
+
+I made this so I could have a tmux with a worker process in each pane, all
+taking jobs from the same queue. The worker processes run a shell script that
+uses this utility to pick the next job.
+
+A job's payload is a blob of data. What's in the blob is up to you. If a job
+needs multiple parameters, the blobs could be filenames indicating where to
+find the job data; or, you might pack the data directly into the blob with a
+delimiter-based format or `jq` or something.
+
+Worker IDs can be any utf-8 string. If your worker is a bash script, you can
+pass `$$` to use your worker's PID.
+
+Because the data blob for your task may contain characters that are subject to
+string interpolation hazards, any command that requires a blob will read it
+from standard input by default. If your blobs are shell-safe, you can instead
+use `--data` to include your blob in the arguments.
+
+## Comparison to alternatives
+
+Other work-stealing schedulers (like GNU Parallel) are frameworks; they own the
+worker processes, so you can only configure workers through the framework.
+`jerbs` inverts this paradigm: `jerbs` is a utility to be used from your worker
+script. With `jerbs` you can easily assign unique resources to the workers, pin
+workers to CPUs/NUMA nodes, or dynamically vary the number of simultaneous
+jobs. At last, the workers control the means of production.