Hey thank you! OP here, yes I was struggling to find large enough documents to run the benchmarks on, the range currently on the benchmark data is ~106 B - ~190MB, which I think covers the majority of quick task workloads, but would love to have large documents, if there's an public ones you can thinking of I'd like to know!
Hey! This is the jsongrep author, just wanted to give a little intro to the jsongrep tool. I started this project as part of my undergrad research and kept iterating on it. I'm super happy with how performant it is and I have found it incredibly useful for my day-to-day work, for example combing through long OpenAPI specs and combining with other tools like `fzf` and `rg`. Happy to answer any questions about the tool!
I just published a deep dive on Ken Thompson’s Reflections on Trusting Trust, where he talks about the compiler-level backdoor he described in his 1984 Turing Award lecture that would let him login on compromised systems.
The post walks through how a compiler can teach itself to hide a Trojan that reproduces even after the source is “clean” and annotated snippets from Thompson’s actual code (shared by Russ Cox).
Would love feedback or corrections, especially from folks who’ve studied or implemented compiler bootstrapping or reproducible-build systems.
[1]: https://github.com/micahkepe/jsongrep/releases/tag/v0.8.0