Introducing MCP-first
Why the next generation of software should be built as a secure, fully controllable capability layer first, and why screens are no longer the product.
Keine Treffer.
Blog
Essays on the MCP-first pattern — building, securing and auditing software that agents can drive. RSS.
Practical levers to shrink inference spend without hurting quality, prompt caching, model routing, context discipline, and capability-level budgets.
API-first was a real step forward, but it answers a different question. Here is what MCP-first adds on top, and why it matters for agent-ready software.
A vendor-neutral framework for picking and switching models, define your eval tasks, weigh cost/latency/quality, run your own evals, and keep models swappable.
A decision framework for splitting AI workloads between local and cloud models, privacy, latency, cost, and capability, plus how sensitivity should route the data.
Three durable shifts reshaping how software gets built, agents moving into production, tool/context protocols standardizing, and small models getting good and cheap.
There is a machine-readable, normative edition of the MCP-first manifest. Point an LLM at it and get a 40-rule conformance audit of any MCP server in minutes.
Patterns that keep multi-step, long-running agents reliable, task decomposition, sub-agent delegation, checkpoints and recovery, self-verification, budgets, and human gates.
Techniques that make models reliable on long, multi-step tasks, decomposition, explicit effort budgets, scratchpads, and self-verification passes.
The model is only half the system. The other half is the loop around it, observe, plan, act, verify, retry, stop, and the guardrails that keep it honest.
How to run a framework upgrade or codebase-wide refactor with agents, discovery, per-file transforms, verification, isolation, and review gates that keep it safe.
When local inference makes sense, and how quantization and right-sizing let capable open-weight models run on modest hardware, with the tradeoffs spelled out.
A practical guide to handling personal and sensitive data in AI systems, minimization, retention, redaction, audit trails, and the data-subject rights you must honor.
Agents that drive a real browser can do almost anything a user can, which is exactly why they need capabilities, confirmation, and audit, not a free hand.
Orchestrator/worker patterns, specialization, and shared context, plus the failure modes (cost blowups, loops, compounding errors) and how to contain them.
Model-level alignment is not enough. Real safety comes from the system around the model, input/output checks, permissions, confirmation, and audit.
What embeddings are, how similarity search works, and how to choose and operate a vector store without over-engineering it.
Retrieval-augmented generation explained, chunking, embedding, retrieval, and the discipline of feeding a model only the context a task actually needs.
How models go from text to action through typed tool definitions, and why the quality of your tool schemas decides how reliable your agent is.