Merge pull request #14 from cceckman/5min

Content for RC presentation
cceckman · Jun 28, 2024 · b5d7cb3 · b5d7cb3
2 parents 7c1230e + d12d9ac
commit b5d7cb3
Show file tree

Hide file tree

Showing 4 changed files with 550 additions and 0 deletions.
diff --git a/talk/.gitignore b/talk/.gitignore
@@ -2,4 +2,5 @@
 *.aac
 *.wav
 *.mp4
+*.jpg
 
diff --git a/talk/5min.md b/talk/5min.md
@@ -0,0 +1,293 @@
+# Solving the Maybe-Halting problem
+
+...in 5 minutes.
+
+https://github.com/cceckman/sarah-connor
+
+???
+
+P to enter presentation mode
+
+C from there to start a present-able window
+
+---
+
+## Motivation
+
+Charles used to work on embedded software.
+
+- **Interrupt-service routines**: Stop regular processing while running.
+- **Preemption-disabled regions**: Disable interrupts while running.
+- **Mutexes**: Only one can run using this at a time.
+
+As a design principle, we always want these to run in **a bounded amount of time**.
+
+**Can we check it automatically?**
+
+---
+
+### The Halting Problem
+
+*Isn't that the halting problem?*
+
+<!-- ...is what my coworkers said. Quick refresher: -->
+
+> Is it possible to write a program `A` that,  
+> 
+> given any program `B` and input `I`,
+>
+> reports whether or not `B(I)` terminates?
+
+Answer:
+
+<!-- Alan Turing's big thesis. Conclusively established: -->
+
+> **No, you cannot write such a program.**
+
+<!-- ...but we're not actually trying to solve that problem. -->
+
+Rice's theorem generalizes this to other properties.
+
+---
+
+### The Maybe Halting Problem
+
+<!-- We just want something that can _sometimes_ give an answer. -->
+
+> Is it possible to write a program `A` that,
+>
+> given any program `B` and input `I`,
+>
+> reports one of:
+>
+> - `B(I)` terminates
+> - `B(I)` does not terminate
+> - `A` cannot determine whether `B(I)` terminates
+
+Can we do that?
+
+
+> **Yes.**
+
+```python
+def analyze(b, i):
+  return "I don't know"
+```
+
+...can we do something _useful_ though?
+
+---
+
+## It works!
+
+```c
+volatile int x = 0;
+
+int bounded_loop() {
+    for(int i = 0; i < 10000; i++) {
+        x += i;
+    }
+    return x;
+}
+
+int unbounded_loop() {
+    while(1) {
+        x += 1;
+    }
+    return x;
+}
+
+int main() {
+    int x;
+    [[clang::noinline]] x = bounded_loop();
+    [[clang::noinline]] x = unbounded_loop();
+    x += 1;
+    return x;
+}
+```
+
+---
+
+## It works!
+
+```
+Function name: bounded_loop
+Result: Bounded
+Explanation: includes a loop, but it has a fixed bound
+
+Function name: unbounded_loop
+Result: Unknown
+Explanation: includes loop with indeterminate bounds
+
+Function name: main
+Result: Unknown
+Explanation: via call to unbounded_loop: includes loop with indeterminate bounds
+```
+
+---
+
+## Sarah Connor
+
+LLVM analysis pass (C, C++, Rust, Zig if you want)
+
+If:
+
+- The call graph terminates
+- Each function terminates
+- Each instruction terminates
+
+The program terminates!
+
+---
+
+### Call graph (good)
+
+```c
+int add(int a, int b) {
+  return a + b;
+}
+
+int mult(int a, int b) {
+  int x = 0;
+  for(int i = 0; i < a; i++) {
+    x = add(x, b);
+  }
+  return x;
+}
+```
+
+![Call graph of the above code](mult.svg)
+
+---
+
+### Call graph (bad: recursion)
+
+```c
+int add(int a, int b) {
+  return a + b;
+}
+
+int fib(int n) {
+  return add(fib(n-1), fib(n-2));
+}
+```
+
+![](fib.svg)
+
+**"I don't know"**
+
+???
+
+If the call graph is a DAG, and each function completes, eventually the program completes.
+
+If there's a loop, there's recursion. Completes if recursion is bounded.
+
+We haven't implemented any reasoning about whether recursion is bounded,
+so we classify this as "I don't know."
+
+(Also conservative for embedded b/c of stack size; recursion needs to be bounded for other reasons.)
+
+---
+
+### Control flow
+
+```c
+int mult(int a, int b) {
+  int x = 0;
+  for(int i = 0; i < a; i++) {
+    x = add(x, b);
+  }
+  return x;
+}
+```
+
+![](mult-cfg2.svg)
+
+**Is the loop bounded?**
+
+"I don't know"
+
+---
+
+### Control flow (better)
+
+```c
+int mult(int a, int b) {
+  int x = 0;
+  for(int i = 0; i < a; i++) {
+    x = add(x, b);
+  }
+  return x;
+}
+```
+
+![](mult-cfg2.svg)
+
+**Is the loop bounded?**
+
+Use LLVM analysis!
+
+"Loop is bounded" --> "Function terminates"
+
+???
+
+There's a decent amount of literature on loop unrolling and/or bounding.
+It winds up being an important optimization that compilers can perform.
+
+LLVM has built-in analysis passes; we can request their results.
+Including "is this loop bounded"!
+
+LLVM's analysis is, like ours, conservative; it sometimes can't find a bound
+when one in principle exists. That's OK; we treat "unbounded loop" as
+"I don't know".
+
+---
+
+### Instructions terminate
+
+**Assumed.**
+
+(Not true. Ask Charles for fun stories.)
+
+???
+
+Not necessarily true, in an embedded context.
+
+But out of scope of this checker.
+
+---
+
+### It doesn't work!
+
+- Escape hatches (own code, LLVM intrinsics)
+
+  We couldn't figure out the LLVM infrastructure for this
+
+- LLVM result invalidation (bookkeeping)
+
+???
+
+Some stuff that's necessary for it to be really usable, but we haven't worked out.
+
+---
+
+### It could be better!
+
+- Cross-module analysis
+- Recursion
+- Indirect calls
+- Test it on Rust (in principle "just works"?)
+
+???
+
+A bunch of ways that it could be better.
+
+---
+
+## Thanks!
+
+https://github.com/cceckman/sarah-connor
+
+
+.bigimg[![](terminator.jpg)]
+