How four production bugs transformed a three-step prototype into a production-ready LLM pipeline. I expected to finish it in about a week. Instead, every production failure forced another ...
This is the third post in a series about AI agent harness design and token efficiency. The first post looked at what’s inside those 14,500 tokens opencode sends on every “hello”. The second covered ...