Notes fromthe lab.

We write up what we learn shipping agents into production — evals, integration gotchas, cost traps, post-mortems. No thought leadership, no roadmap teasers. Just the parts that were hard, and what we did about them.

+
+

Six threads we keep pulling on

The archive is short on purpose — one essay published, more in progress. These are the threads each piece comes from. We post when there is something worth saying, not on a schedule.

01

Agent harnesses

How we wire Claude Code around business logic — tools, context windows, and the guardrails that keep an agent from improvising past its remit.

02

Pipelines

Ingestion, OCR, classification, reconciliation. Moving messy real-world input through deterministic steps where a wrong answer would cost money.

03

Workflow tools

When n8n, Zapier and visual builders earn their keep — and the point at which a code harness becomes the simpler, more honest thing to maintain.

04

Reliability

Evals, fingerprinting, idempotency, human-in-the-loop approval. Making failure modes mechanically impossible instead of "we will watch for it".

05

Dev infra

Cron, queues, Postgres as the source of truth, audit trails. The unglamorous plumbing that decides whether a system survives its first month in production.

06

Field notes

Shorter observations from active builds — integration gotchas, cost traps, and the small decisions that turn out to matter more than the architecture diagram.

No newsletter.Just follow along.

We have not built an email list, so there is nothing to subscribe to. New essays land on GitHub and LinkedIn first. If you want a heads-up when the next one ships, the surest way is to reach out directly.