<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Sub-Agent Architecture on Bruce on AI Engineering</title><link>http://www.heyuan110.com/tags/sub-agent-architecture/</link><description>Recent content in Sub-Agent Architecture on Bruce on AI Engineering</description><generator>Hugo</generator><language>en</language><lastBuildDate>Mon, 30 Mar 2026 10:00:00 +0800</lastBuildDate><atom:link href="http://www.heyuan110.com/tags/sub-agent-architecture/index.xml" rel="self" type="application/rss+xml"/><item><title>Harness Engineering: After 60 Days Running an AI Coding Pipeline, the Model Was the Least Important Part</title><link>http://www.heyuan110.com/posts/ai/2026-03-30-harness-engineering-guide/</link><pubDate>Mon, 30 Mar 2026 10:00:00 +0800</pubDate><guid>http://www.heyuan110.com/posts/ai/2026-03-30-harness-engineering-guide/</guid><description>&lt;p&gt;&lt;img src="http://www.heyuan110.com/posts/ai/2026-03-30-harness-engineering-guide/cover.webp"
 alt="Harness engineering architecture — layered systems wrapping an AI agent core with guardrails, feedback loops, and monitoring"
 
 loading="lazy"
 decoding="async"
 fetchpriority="low"
 width="1200"
 height="630"
/&gt;
&lt;/p&gt;
&lt;p&gt;This is &lt;strong&gt;Part 1&lt;/strong&gt; of the Harness Engineering series. Part 2 goes deep on &lt;a href="http://www.heyuan110.com/posts/ai/2026-03-31-harness-claudemd-guide/"&gt;CLAUDE.md best practices&lt;/a&gt;. Part 3 covers &lt;a href="http://www.heyuan110.com/posts/ai/2026-04-13-harness-subagent-architecture/"&gt;sub-agent architecture and model routing&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;I have been running an AI coding harness in production for 60 days — the pipeline that writes, illustrates, and distributes posts for this blog. The single most surprising finding from that experiment: &lt;strong&gt;the model is the least important part of the agent.&lt;/strong&gt; Not irrelevant, but far from dominant. Upgrading the writer from Sonnet to Opus lifted blind-rated output quality by about 5%. Rewiring the harness — routing, context boundaries, sensors — cut end-to-end cost by roughly 60% while quality went up, not down.&lt;/p&gt;</description></item></channel></rss>