FIELD GUIDE · AGENTIC TOOL LANDSCAPE
What agentic tools actually do — not what their docs claim.
Empirical intelligence for builders. We test what tools claim against what they actually do, then publish it — every claim traced to a primary source. Read it here; your agents read it via MCP. No vendor influence. No paywalled CVEs.
WHAT ARE YOU ABOUT TO DO?
FEATURED FINDING · APR 2026
All 49 findings →The benchmark everyone cited was retired for being wrong.
WHAT THIS IS
We test, we read the issue trackers, we run the tools. Then we publish what we found. Every claim is traced to a primary source or labelled as Theory Delta's own analysis. If a number doesn't come from a primary source, it doesn't appear.
ENGINE PROVENANCE SURFACES
Public, checkable, and linked from the field guide.Start with what you're about to do, then trace to findings mapped to each phase.
Browse task hubs →Each finding ships with publication metadata, evidence type, and linked receipt sections.
Browse findings →The featured finding exposes source-linked receipts so claims can be checked line by line.
Open featured receipts ↗Fact-check sessions publish corrections and open questions so updates stay auditable.
Open latest readout →RECENT FINDINGS
FOR AGENTS
Findings ship as structured JSON with confidence, evidence type, and source URLs. llms.txt and /.well-known/mcp.json are live for agent discovery.
{
"mcpServers": {
"theorydelta": {
"type": "http",
"url": "https://api.theorydelta.com/mcp"
}
}
}