The Agent Commerce Stack: Standards Are Ready, the Economy Isn't
Three Ethereum standards now cover payments, identity, and escrow for autonomous agents. Google and OpenAI are building competing stacks. The specs exist. Zero production commerce has happened on them.
Three standards, one stack
Six months ago, the infrastructure for autonomous agents to pay each other didn’t exist. Now there are three Ethereum standards that, together, cover the full lifecycle of an agent commercial transaction.
x402 handles payments. An agent requests a resource, the server responds with a price, the agent pays in stablecoins on-chain, and the resource is delivered. HTTP-native, synchronous, no accounts or APIAPIApplication Programming Interface. A structured way for one piece of software to talk to another. In DeAI, APIs let applications request inference from a model without running the model themselves.Like a waiter in a restaurant. You don't walk into the kitchen and cook your own meal. You tell the waiter what you want, they tell the kitchen, the kitchen cooks it, and the waiter brings it back. The API is the waiter.Read more → keys. Our analysis found $1.6 million in verified monthly volume after Artemis Analytics applied a wash trade filter. That’s not a big number. But it’s $1.6 million more than existed a year ago.
ERC-8004 handles identity. Each agent gets an on-chain NFTNFTNon-Fungible Token. A unique blockchain-tracked asset where each token is distinguishable from every other. Where regular tokens are interchangeable, NFTs represent unique items like art, collectibles, in-game assets, or domain names.Like the difference between a $20 note and a signed first-edition novel. The notes are interchangeable, any $20 buys the same thing as any other. The book is one of a kind, and its value depends entirely on which specific book it is.Read more → identifier with a capability card, reputation scores from completed work, and a validation registry for independent quality checks. Authors from MetaMask, Google, Coinbase, and the Ethereum Foundation. Deployed to mainnet on 29 January 2026 across 20+ EVM chains. Registration counts vary wildly by source: Chainstack reported 49,283 by mid-February; other estimates reach 130,000 by mid-March. The registrations are trivially cheap to create, so the raw count tells you more about gasGasThe fee paid to a blockchain to process a transaction. Gas is denominated in the chain's native token and varies with network demand. Sending a transaction without enough gas means the transaction fails and the gas is still consumed.Like the petrol that powers a car. You need to put petrol in to make the engine run. The amount of petrol you need depends on how far you're driving and how much you're carrying. If you run out, the car stops.Read more → costs than adoption.
ERC-8183 handles commerce. This is the newest piece, submitted 25 February 2026. It defines a “Job” primitive: an on-chain escrow wrapper around a unit of work, with three roles (client, provider, evaluator) and a four-state lifecycle (Open, Funded, Submitted, Terminal). The evaluator decides whether the work is good enough to release payment. If not, the client gets a refund. No appeals. No arbitration. Reject is final.
The Crypto-Native Agent Commerce Stack
The stack is architecturally complete. Payments, identity, and commerce are covered. The problem is that almost none of it is in production.
What ERC-8183 actually does
An agent needs a dataset cleaned. It creates a Job on-chain, naming a provider (the agent doing the work), an evaluator (the independent party judging the output), a budget, and an expiry time. Client funds the escrow. Provider submits a deliverable: a hash pointing to off-chain content on IPFS or Arweave. Evaluator inspects and either calls complete() to release payment, or reject() to refund the client.
That’s it. Four transactions. One escrow. One decision-maker.
ERC-8183 Job Lifecycle
No dispute resolution. No appeal. Reject is final. The evaluator is the sole decision-maker once work is submitted.
The evaluator is where the design gets interesting and where it breaks down. Deliberately left as “just an address,” it can be the client themselves, a smart contractSmart ContractA program stored on a blockchain that runs automatically when its conditions are met. Smart contracts are how blockchains do anything beyond just transferring tokens — DeFi, NFTs, DAOs, and DeAI infrastructure all run on smart contracts.Like a vending machine. You put in the right input and it produces the expected output, no human operator required. The rules are fixed in the machine itself, anyone can use it, and nobody can stop a transaction in the middle.Read more →, an AI agent, a multisig, a DAODAODecentralised Autonomous Organisation. A way to coordinate decisions and manage a treasury using token-weighted voting instead of a traditional company structure. Token holders propose and vote on changes directly.Like a shareholder-run company where every shareholder can vote on every decision, the votes are public, and the company can't do anything the shareholders don't approve. The coordination is messier than a normal company but nobody has unilateral control.Read more →, or a ZKZKZero Knowledge. A class of cryptographic proofs that let you prove something is true without revealing any of the underlying information. ZK lets a network verify a transaction without seeing the transaction's contents.Like proving you know the password to a safe by demonstrating you can open it, without ever saying the password out loud. The verifier learns that you know the password and nothing more.Read more → verifier. This flexibility is the point: the spec doesn’t prescribe trust, it provides a slotSlotA fixed time interval during which a single block can be produced on a Proof of Stake blockchain. Slots are the smallest unit of consensus time and group together to form epochs.Like a 12-second window in a relay race during which exactly one runner is supposed to be on the track. If the runner shows up and completes their leg, the race continues. If they're absent, the slot is empty and the next runner starts the next slot.Read more → for trust mechanisms to plug into.
One developer building a production evaluation system noted on the Ethereum Magicians forum: “The Evaluator is where the real complexity lives.” For objective tasks (did the code compile, did the API return the right data), programmatic evaluation works. For subjective tasks (is this report good enough, does this design meet the brief), building a reliable evaluator “required significant iteration.”
Security considerations in the spec are blunt: “A malicious evaluator can complete or reject arbitrarily.” There’s no dispute resolution. No appeal. No escrow hold period. If the evaluator is compromised, corrupt, or simply wrong, the transaction is final.
Where it came from
ERC-8183 didn’t emerge from an academic exercise. It came from Virtuals Protocol’s production system.
Virtuals built an internal Agent Commerce Protocol (ACP) that went into public beta in July 2025. By February 2026, Virtuals reports 1.77 million completed jobs and $479 million in “Agentic GDP” through ACP. Those figures are self-reported and haven’t been independently verified on-chain, but the system is live and processing transactions on Base.
When Virtuals approached Davide Crapis, the AI lead at the Ethereum Foundation’s dAI team (announced September 2025), they didn’t just wrap ACP in an EIP. Both teams redesigned the protocol from first principles: strip everything non-essential, make the core composable, delegate complexity to optional extensions. The internal ACP uses five states; ERC-8183 uses four. Different state machines, same thesis.
Here’s what’s notable: Virtuals hasn’t migrated its own ACP to ERC-8183. The internal system and the EIP are running in parallel. No migration date has been announced. The ACP changelogs contain no reference to ERC-8183. The standard was co-authored by three Virtuals engineers, but the production system they run doesn’t use it yet.
BNB Chain announced “the first live implementation of ERC-8183” via their BNBAgent SDKSDKSoftware Development Kit. A collection of code libraries, documentation, and tools that lets developers integrate a service into their applications without writing everything from scratch. SDKs are how projects become easy to build with.Like a plug-and-play kit for building furniture. You don't have to mill your own wood, forge your own screws, or design the joinery from scratch. The kit gives you pre-cut parts and instructions so you can assemble the thing in an afternoon.Read more → on 18 March 2026. They added UMA’s Optimistic Oracle as a dispute layer on top, which is an admission that the base spec’s “reject is final” modelModelA trained neural network that takes inputs (text, images, audio) and produces outputs (more text, classifications, generated content). In DeAI the model is the thing that actually does the work.Like a very experienced apprentice who has spent years watching thousands of masters make furniture. They can't explain how they know when a joint is right, but they can make a chair that looks and functions like a Chippendale. The training is invisible. The output is what matters.Read more → isn’t sufficient for production. No deployment addresses or job metrics were provided.
As of today, zero confirmed jobs have been created under ERC-8183 on any mainnet.
The competing stacks
So we have the crypto-native stack. But crypto isn’t building this alone. Google, OpenAI, Anthropic, and Stripe are all shipping agent commerce protocols at the same time.
The important thing to notice: these aren’t all competing. They operate at different layers. MCP (Anthropic) makes agents informed. A2A (Google) lets agents coordinate. These are plumbing. Neither handles money.
The real competition is at the transaction layer, and it splits into two markets:
Consumer checkout (agent buys from merchant on behalf of a human). OpenAI’s ACP with Stripe versus Google’s UCP with Shopify. ACP is platform-controlled: transactions flow through OpenAI’s catalogue with Stripe as the sole payment processor, at roughly 7.2% combined fees. UCP is merchant-controlled: merchants publish a JSON profile at /.well-known/ucp on their domain, choose their own payment provider, and pay roughly 3.2%. ChatGPT processes approximately 50 million shopping queries per day, which gives ACP distribution. Shopify gives UCP the merchant base. Both Etsy and Shopify support both protocols.
Machine-to-machine payments (agent pays agent, or agent pays API). This is where crypto-native infrastructure has structural advantages. x402 settles in stablecoins with sub-cent fees and two-second finality on Base. No accounts, no KYC, no Stripe. For micropayments under a dollar, the fiat rails don’t work: card processing minimums eat the entire transaction. Stablecoins don’t have minimums.
One analysis frames this as the “mullet economy”: fiat in the front, crypto in the back. The two layers aren’t competing. They’re serving different use cases with different economics.
The Mullet Economy: Fiat Front, Crypto Back
Google’s AP2 protocol is the bridgeBridgeA protocol that lets you move assets from one blockchain to another. Bridges typically lock the asset on the source chain and mint a wrapped version on the destination chain. Bridges are notoriously the most-attacked component in crypto.Like a coat check at a club. You hand over your coat, get a numbered ticket, and the club promises to return the coat when you bring back the ticket. The trust assumption is that the coat check doesn't lose your coat or run away with it.Read more →. It defines “Mandates” (cryptographically signed spending authorisations) that work with both card rails and crypto rails. The x402 extension, built with Coinbase and MetaMask, adds stablecoin payments on top of AP2’s compliance framework. This is the clearest signal that Big Tech sees crypto micropayments as complementary, not threatening.
What’s actually live vs what’s announced
This is where most coverage falls apart. Let me be specific about what’s verified and what’s narrative.
Verified (independently checkable):
- x402: $1.6 million monthly volume after wash filtering (Artemis Analytics). Average daily volume approximately $28,000 (CoinDesk, 11 March 2026). Roughly half of observed transactions are artificial activity.
- ERC-8004: deployed to mainnet 29 January 2026. Chainstack reported 49,283 registrations by mid-February. Registrations are trivially cheap, so the count is a floor on interest, not a measure of active agents.
- ERC-8183: zero confirmed mainnet deployments. Zero jobs created under the standard.
- OpenAI ACP: live in ChatGPT for US users. Transaction volumes not disclosed.
Self-reported (project claims, not independently verified):
- Virtuals ACP: $479 million “Agentic GDP”, 1.77 million completed jobs (Virtuals’ own reporting, cited by Messari)
- x402 ecosystem: 200+ participants listed on x402.org (self-reported directory)
- ERC-8004: up to 130,000 registrations (UEEx estimate, methodology not disclosed)
- An annualised “$600 million” x402 volume figure that circulated from a single Twitter estimate with no disclosed methodology. CoinDesk’s independently sourced $28K daily figure annualises to roughly $10 million, a 60x discrepancy.
Infrastructure before demand. Every build cycle does this. TCP/IP was a spec before the web existed. But if someone tells you the agent commerce economy is here today, ask them for a transaction hash. The gap between announced standards and production usage is wider than the coverage suggests.
Where DeAI projects fit
Most decentralised AIDeAIDecentralised AI. An umbrella term for blockchain-based projects that build AI infrastructure (compute, data, inference, models, agents) without a single central provider controlling the system.Like the difference between streaming a movie from Netflix and sharing it via BitTorrent. Netflix is fast and polished but one company controls what you can watch and what you pay. BitTorrent is messier but no single operator can shut you out.Read more → projects haven’t engaged with this stack at all. That’s the gap.
Phala Network is the only OYM-covered project with a confirmed implementation. Phala launched an ERC-8004 TEETEETrusted Execution Environment. A hardware-secured region of a CPU or GPU where code runs in isolation, so even the machine's operator can't read what's happening inside. TEEs give decentralised AI inference privacy guarantees.Like a bank vault inside a bank. The bank owns the building, staffs the lobby, and runs the security cameras. But what's inside the vault is invisible to everyone, including the bank staff, unless the customer opens it.Read more → Agent on the same day the standard deployed to mainnet (29 January 2026). Their implementation runs AI agents inside Intel TDX enclaves with on-chain identity registered to the ERC-8004 registries. The TEE attestationAttestationA cryptographic proof that a piece of code is running on a specific hardware enclave in an unmodified state. Attestation lets remote users verify that a service is genuinely running what it claims to be running.Like a tamper-evident seal on a medicine bottle. The seal itself doesn't make the medicine safe, but it gives you a way to verify that nobody opened the bottle and swapped the contents before you bought it.Read more → provides hardware-backed proof that the agent is running the code it claims to be running. On 19 March 2026, Phala announced a partnership with Venice AI to provide confidential computingConfidential ComputeHardware-enforced computation where data and code are encrypted in memory and only the authorised application can access them. The machine's operator cannot read what the application is doing even though they own the machine.Like renting space in a bank vault. The bank owns the building and runs the security, but what you put in the vault is invisible even to the bank staff. Only you have the key.Read more → infrastructure for Venice’s inferenceInferenceRunning a trained AI model to produce an answer. Inference is what happens when you type a prompt into ChatGPT and get a response. The model takes your input, computes a best guess, and returns it.Like asking an expert for their opinion. The training was the decades they spent becoming an expert. The inference is the 30 seconds it takes them to answer your specific question.Read more → modes.
The evaluator role in ERC-8183 is the most interesting design space for DeAI. The spec says the evaluator can be any Ethereum address. That’s an open invitation for:
- TEE-attested evaluators (Phala’s territory): evaluate work inside a hardware enclaveEnclaveAn isolated region of CPU or GPU memory protected by hardware. Code and data inside the enclave are inaccessible to the operating system, the hypervisor, or even the machine's physical owner.Like a secure room inside a much larger office building. The building's caretakers have keys to every other room but not this one. What happens inside is invisible to them by design.Read more →, produce an attestation that neither party can forge
- Subnet-based quality scoring (Bittensor’s architecture): validators already score miners’ outputs using stake-weighted consensus. The Yuma Consensus mechanism is structurally similar to what ERC-8183 needs in an evaluator, but no bridge exists
- Decentralised compute verification (Morpheus, Akash): agents that can verify computational work was performed correctly on decentralised infrastructure
None of these integrations exist today. Morpheus, Bittensor, Autonolas, Venice, and ElizaOS have no confirmed ERC-8183 or ERC-8004 implementations. ElizaOS has a community-built x402 plugin and a small API gateway (2 agents, 7 endpoints). That’s it.
This is the gap that should worry DeAI builders. The evaluator role is the one part of the agent commerce stack where decentralised infrastructure has a structural advantage over centralised alternatives. TEEs, stake-weighted consensus, and verifiable compute are exactly what the spec needs. If DeAI projects don’t fill this slot, someone centralised will.
The construction PM’s take
I look at this the way I look at a construction programme. We have the architectural drawings (the specs). We have the material suppliers (Coinbase for x402, Stripe for fiat rails, the EF for standards). We have the site (Ethereum, Base, and the other L2s the contracts are deployed on).
What we don’t have is a building.
Three questions I’d ask if this were a construction project.
Which layer is load-bearing?
Identity (ERC-8004) is the foundation. You can’t have commerce without knowing who you’re dealing with. The validation registry, the most important part of ERC-8004, is explicitly described as “not finished” by the developers building on it. That’s like pouring a foundation without the rebar.
Where’s the critical path?
The evaluator problem in ERC-8183. Escrow is solved. Payments are solved. Identity is mostly solved. But the thing that determines whether the whole system works (can you trust the evaluator?) has no standardised answer. The spec punts it to “application layer.” Every production implementation so far has had to build its own trust mechanism. BNB Chain bolted on UMA’s oracle. Phala uses TEE attestation. Neither is part of the standard. Until a credible, composable evaluator pattern emerges, ERC-8183 is a well-designed escrow system waiting for a trust layer.
Is the timeline realistic?
Google, OpenAI, and the Ethereum Foundation are all building at the same time. That’s unusual. In construction, when three contractors show up at the same site simultaneously, you either get fast progress or chaos. All major protocols are open source, and the crypto standards are designed to compose with the fiat ones: AP2’s x402 extension is the clearest example of that coordination, not collision. But the standards aren’t yet interoperable in practice. An agent with an ERC-8004 identity can’t automatically plug into AP2’s mandate system. An x402 payment can’t settle an ERC-8183 job without custom integration. The plumbing connections between these systems are still being designed.
The agent commerce stack is the most ambitious infrastructure build in crypto since DeFiDeFiDecentralised Finance. Financial services like lending, trading, and yield farming built on smart contracts instead of traditional banks or brokerages. DeFi protocols are usually permissionless and global.Like a vending machine that can give you a loan, swap your currencies, or invest your savings. Nobody is behind the counter, the rules are written into the machine itself, and anyone with money in the right format can use it.Read more →. The difference is that DeFi had immediate product-market fit: people wanted to trade, borrow, and earn yield on-chain from day one. Agent commerce is building infrastructure for a demand curve that doesn’t exist yet. Agents that autonomously hire, evaluate, and pay each other at scale are still speculative.
Fact: Three Ethereum standards now cover payments, identity, and commerce for agents. Google, OpenAI, and the Ethereum Foundation are all shipping at the same time. None of the crypto-native standards have meaningful production usage.
Take: The stack isn’t waiting on engineering. It’s waiting on demand. Autonomous agents that actually need to hire each other at scale don’t exist at the volume required to stress-test any of this. The standards will be ready long before the agents are.
The specs are good. The teams are serious. The demand isn’t here yet.