Manzia AITrusted Agents

Articles

Long-form on trusted agents.

Research, design notes, deployment field reports, evaluation methodology, and benchmarks. Cross-referenced with the Agent Trust Framework.

2026

July 14, 2026 · Research
The AI Agentic Landscape 2026
The 3,020 highest-traffic AI agents on the web, discovered from the Tranco Top-1M and organized by the job people hire AI to do — not by industry.
May 30, 2026 · Design
The Year We Taught a Machine to Tutor
A team built an ambitious AI tutor, watched it slowly degrade under its own safeguards, paused the pilot, and rebuilt it twice — the stubbornly simple rebuild won, decisively.
May 10, 2026 · Evaluate
Measuring agent reliability in production
Offline eval suites tell you whether your agent is good on the problems you thought to write down. Production telemetry tells you whether it's good on the problems you didn't.
May 10, 2026 · Develop
SDK patterns for trusted agents
Where the SDK ends and the trust layer begins — and why putting the guardrails inside the SDK is usually the wrong default.
May 10, 2026 · Deploy
Staged rollouts for agent deployments
Why "ship to 1% of traffic" doesn't map cleanly onto agents, and a four-stage rollout — shadow, sandbox, gated, general — that does.
May 10, 2026 · Benchmark
The state of agent benchmarks, 2026
A field guide to the benchmarks people cite, the benchmarks people ought to cite, and the gap between what they measure and what matters in production.
May 8, 2026 · Research
Welcome to Manzia Research
An introduction to Manzia's editorial line on Trusted Agents and the Agent Trust Framework family.