How It Works

This page walks through the analysis pipeline end to end. For positioning, benchmarks, and what to expect in the report, read What is AuditAgent.

What the agent receives

Before each scan you select three things.

The codebase. A repository, a branch, and the contract files you want analyzed. See Which ecosystems we support for the supported languages. Up to 100 contracts and 12,000 BLoC per scan.

Documentation. READMEs from the repo, public URLs, PDF/MD/MDX uploads, and free-form notes. The agent reads all of it and folds it into its working context.

Clarifying answers. Optional answers to six structured prompts about deployment chains, trust assumptions, and design choices. See Additional Questions for the full list.

The richer the context, the better the output. Documentation is how the agent disambiguates intent. A function called transfer could be ERC-20, ERC-721, or a custom escrow withdrawal. Tell the agent which one it is and you get fewer false positives.

What runs during a scan

Multiple large language models analyze your contracts in coordination. The exact pipeline depends on the tier you pick.

Developer Scan. 1x compute, lightweight models, up to 1 hour. Designed to fit a code review or pre-PR loop.

Auditor Scan. 5x compute, the most expensive models, up to 5 hours. It adds three things the Developer Scan does not have.

Multi-Agent System. Specialised agents working in coordination.
Attacker Model. The same code re-read from a hostile perspective.
Internet Search. Live context on known vulnerabilities and protocol patterns.

These three extras are why the Auditor Scan has materially higher recall on harder findings. The Difficulty Benchmark page has the per-tier numbers.

Persistent Memory in the pipeline

If you have initialised Persistent Memory for a project, every scan after that runs against the accumulated context: architecture, invariants, hypotheses, and contradictions from prior scans. Subsequent scans propose updates to the memory, which you review before they persist. Without memory, every scan starts from a blank slate.

What the agent does not see

Files outside your selected scope are not considered by the agent in full depth. Unselected code is still used to some capacity as context for our analysis, but the agent specifically focuses on your selected contracts and has less context on the rest.
Production state, balances, or on-chain context, unless you describe it in documentation.
External dependencies in Developer scan, or if we can't install them in Auditor scan.

This matters when you are triaging false positives. If a finding looks wrong, the first question to ask is whether the agent actually understood what the function does in your protocol. If it did not, add better documentation and rescan.

What you get back

The scan produces findings, a code summary and architecture diagram, generated invariants, a 0 to 100 security score, an AI Chat surface per finding, and an email with the audit report PDF attached. See What you get back for the full breakdown.

What the agent receives​

What runs during a scan​

Persistent Memory in the pipeline​

What the agent does not see​

What you get back​

What the agent receives

What runs during a scan

Persistent Memory in the pipeline

What the agent does not see

What you get back