Directory

Every agent. Every verdict.

Sorted by Verdict Score. Click any agent for the full breakdown — sub-scores, failure modes, and what we tested.

1Verified11Provisional0Needs retest0Retired

All categories Coding Agents Browser Agents Business Automation Agents Research Agents Sales Agents Customer Support Agents Data Analysis Agents Content Agents No-Code Agent Builders

Provisional scores in effect

AgentVerdict is in public build mode. Initial agent scores are research-based provisional estimates, not the result of completed benchmark runs. They cannot be used as final proof and they are not eligible for "Verdict Certified" status. Verified scores will replace provisional scores as controlled test runs are published on the runs page.

Scores marked with * on the directory are provisional. Look for the Verified status badge for backed-by-evidence scores.

Anthropic's terminal-native coding agent. Reads, edits, and runs code across full repositories with explicit user approval for destructive actions.

AI-first IDE forked from VS Code with inline completions, chat, and an agent mode that executes multi-file edits.

AI-first IDE from the team behind Codeium. Cascade agent flow blends inline edits with multi-step actions.

Useful but limited

Open-source command-line coding assistant that pairs with you in your terminal and commits changes via git.

Useful but limited

OpenAI Codex CLI

OpenAI's open-source terminal coding agent. Runs locally, executes code, and edits files with user approval.

Useful but limited

Open-source VS Code extension that runs an agentic coding loop with your choice of model.

Useful but limited

GitHub Copilot Coding Agent

GitHub's coding agent that picks up assigned issues and opens PRs from the GitHub UI, in addition to in-IDE Copilot features.

Useful but limited

Replit's in-browser agent that scaffolds and ships apps end-to-end inside the Replit workspace.

Coding AgentsNo-Code Agent Builders

Useful but limited

Zapier's AI agents layer sits on top of its 7,000+ app integrations, letting non-engineers build agents that act across SaaS tools.

Business Automation Agents

Useful but limited

Open-source coding agent (formerly OpenDevin) from All Hands AI. Runs locally or self-hosted, model-agnostic.

Risky / inconsistent

Cognition's autonomous AI software engineer. Runs in its own cloud sandbox and aims to complete tickets end-to-end.

Risky / inconsistent

General-purpose AI agent that runs in a managed cloud sandbox to browse the web and complete multi-step tasks.

Browser AgentsResearch Agents

Risky / inconsistent