Blog

Field notes on agent-ready software.

Essays on the MCP-first pattern — building, securing and auditing software that agents can drive. RSS.

Jun 11, 2026 · MCP-first · 2 min read

Introducing MCP-first

Why the next generation of software should be built as a secure, fully controllable capability layer first, and why screens are no longer the product.

MCP-first Architecture

Jun 10, 2026 · MCP-first · 5 min read

Cut your LLM bill by 50 to 90%: caching, routing, right-sizing

Practical levers to shrink inference spend without hurting quality, prompt caching, model routing, context discipline, and capability-level budgets.

LLM Ops Cost

Jun 9, 2026 · MCP-first · 1 min read

API-first was for developers. MCP-first is for agents.

API-first was a real step forward, but it answers a different question. Here is what MCP-first adds on top, and why it matters for agent-ready software.

API-first Agents Architecture

Jun 8, 2026 · MCP-first · 5 min read

Choosing and upgrading LLM models without the hype

A vendor-neutral framework for picking and switching models, define your eval tasks, weigh cost/latency/quality, run your own evals, and keep models swappable.

Models LLM Ops

Jun 7, 2026 · MCP-first · 5 min read

On-device or cloud? Choosing where your models run

A decision framework for splitting AI workloads between local and cloud models, privacy, latency, cost, and capability, plus how sensitivity should route the data.

Models Architecture

Jun 6, 2026 · MCP-first · 5 min read

Agents, MCP, and the small-model cost crash: where 2026 is heading

Three durable shifts reshaping how software gets built, agents moving into production, tool/context protocols standardizing, and small models getting good and cheap.

Trends Agents MCP

Jun 6, 2026 · MCP-first · 2 min read

Audit your MCP server with /manifest.ai

There is a machine-readable, normative edition of the MCP-first manifest. Point an LLM at it and get a 40-rule conformance audit of any MCP server in minutes.

Security Audit Tools

Jun 5, 2026 · MCP-first · 7 min read

Design patterns for long-horizon agents

Patterns that keep multi-step, long-running agents reliable, task decomposition, sub-agent delegation, checkpoints and recovery, self-verification, budgets, and human gates.

Agents Architecture

Jun 4, 2026 · MCP-first · 4 min read

Prompting for long-horizon reasoning: effort and self-checks

Techniques that make models reliable on long, multi-step tasks, decomposition, explicit effort budgets, scratchpads, and self-verification passes.

Prompting Agents

Jun 3, 2026 · MCP-first · 5 min read

Loop engineering: designing the system that drives the agent

The model is only half the system. The other half is the loop around it, observe, plan, act, verify, retry, stop, and the guardrails that keep it honest.

Agents Engineering

Jun 2, 2026 · MCP-first · 5 min read

Using LLM agents for large-scale code migrations

How to run a framework upgrade or codebase-wide refactor with agents, discovery, per-file transforms, verification, isolation, and review gates that keep it safe.

Agents Engineering

May 31, 2026 · MCP-first · 7 min read

Self-hosting open models without a GPU farm

When local inference makes sense, and how quantization and right-sizing let capable open-weight models run on modest hardware, with the tradeoffs spelled out.

Models Self-hosting

May 29, 2026 · MCP-first · 6 min read

AI data retention and compliance: what to get right

A practical guide to handling personal and sensitive data in AI systems, minimization, retention, redaction, audit trails, and the data-subject rights you must honor.

Security Compliance

Mar 17, 2026 · MCP-first · 5 min read

Browser-using agents: power and peril

Agents that drive a real browser can do almost anything a user can, which is exactly why they need capabilities, confirmation, and audit, not a free hand.

Agents Security

Feb 24, 2026 · MCP-first · 6 min read

Multi-agent systems: when one agent isn't enough

Orchestrator/worker patterns, specialization, and shared context, plus the failure modes (cost blowups, loops, compounding errors) and how to contain them.

Agents Architecture

Feb 3, 2026 · MCP-first · 5 min read

Guardrails: building AI that's safe by design

Model-level alignment is not enough. Real safety comes from the system around the model, input/output checks, permissions, confirmation, and audit.

Security Safety

Jan 15, 2026 · MCP-first · 4 min read

Vector databases and embeddings: a practical primer

What embeddings are, how similarity search works, and how to choose and operate a vector store without over-engineering it.

Infra RAG

Dec 9, 2025 · MCP-first · 5 min read

RAG: giving models the right context, not all of it

Retrieval-augmented generation explained, chunking, embedding, retrieval, and the discipline of feeding a model only the context a task actually needs.

Context RAG

Nov 18, 2025 · MCP-first · 5 min read

Function calling and tool use: how agents actually act

How models go from text to action through typed tool definitions, and why the quality of your tool schemas decides how reliable your agent is.

Agents Tools