Harness Engineering: After 60 Days Running an AI Coding Pipeline, the Model Was the Least Important Part

Mon, 30 Mar 2026 10:00:00 +0800

This is Part 1 of the Harness Engineering series. Part 2 goes deep on CLAUDE.md best practices. Part 3 covers sub-agent architecture and model routing.

I have been running an AI coding harness in production for 60 days — the pipeline that writes, illustrates, and distributes posts for this blog. The single most surprising finding from that experiment: the model is the least important part of the agent. Not irrelevant, but far from dominant. Upgrading the writer from Sonnet to Opus lifted blind-rated output quality by about 5%. Rewiring the harness — routing, context boundaries, sensors — cut end-to-end cost by roughly 60% while quality went up, not down.

Sub-Agent Architecture on Bruce on AI Engineering

Harness Engineering: After 60 Days Running an AI Coding Pipeline, the Model Was the Least Important Part