More

bobkb · 2026-06-16T17:00:45 1781629245

Will cursor launch a CLI tool like Claude/codex/opencode/pi ?

thisisit · 2026-06-16T17:01:59 1781629319

Grok has its own CLI tool called Grok Build: https://x.ai/news/grok-build-cli

mattnewton · 2026-06-16T17:03:36 1781629416

they have had one for a while now. https://cursor.com/cli

agluszak · 2026-06-16T17:02:18 1781629338

They already have. https://cursor.com/cli

bobkb · 2026-06-15T15:34:00 1781537640

I assume your idea is, if the spec and the proof is verified the code generated is good enough as well ?

addaon · 2026-06-15T18:29:00 1781548140

Today, I write the code. It’s trivial and takes a lot less time than writing the spec, and since I’m using conventional tooling for WCET and stack sizing it’s nice to get those right up front. The LLMs sometimes tweak the code slightly for provability, but this is usually either direct operator replacement (shift with multiplication, and with modulus, etc) or factoring out a block to a function to tie a contract onto it, both of which I trust my compiler to undo (simple arithmetic operations and inlining, respectively) with zero to minimal impact on the generated binary.

bobkb · 2026-06-14T15:08:52 1781449732

I have been testing formal verification methods with multiple products. It will be great to also understand more about what’s tried and how it was done. For example attempting to verify the spec is what I have been trying to implement.

bobkb · 2026-06-09T18:25:13 1781029513

In an interesting coincidence I ended up watching Person of Interest S4 E5 while reading the announcement. The series showed some code supposedly belonging to to an AI.

Fable 5 said the first screen shot is from “ IDA Pro’s Hex-Rays decompiler” and a windows driver. The second screenshot triggered the safety guard rails and pushed me into Haiku.

Apparently the code is Windows driver code.

bobkb · 2026-06-07T16:18:34 1780849114

It’s impossible to write a spec that’s not ambiguous , complete and correct in natural languages. Thus prompts will always generate unreliable software.

bobkb · 2026-06-07T16:11:19 1780848679

IMHO even if we are using auditing tools I believe we must use deterministic tools for critical analysis like this. Such rule and pattern based systems may not scale beyond certain point but they can be accurate.

bobkb · 2026-06-07T10:20:46 1780827646

At work we are now in the process of migrating away from Figma. We had spend years perfecting our Figma based design workflow. Currently we are moving all the designs into the code itself using Storybook. The gap currently is reviews and feedback which is addressed by Chromatic now.

bobkb · 2026-06-06T14:26:13 1780755973

I tried building a deliberately vague project around managing MCP servers [0]. The purpose was to find what LLMs and agents can do. While the project didn’t reach anywhere I was amazed by how it’s possible to navigate even with no clear direction. The ability of the “glorified auto-complete” system to pull off something this sort was an eye opener for me.

0. https://github.com/bobinson/aop1

bobkb · 2026-06-05T11:02:51 1780657371

False positives from the deterministic audits a very difficult problem to address. Comparing and deduplicating across different methods or LLM audits seems to the only way.

bobkb · 2026-06-04T22:08:09 1780610889

I think these audit tools can look beyond just security and can look for compliance audits as well. The ability to audit real targets in staging environments makes it easy to identify issues.