HomeGlobal Market TrendsOpenAI pits AI agents against each other to red team smart contracts

OpenAI pits AI agents against each other to red team smart contracts

February 19, 2026

OpenAI said it is becoming increasingly important to evaluate the performance of AI agents in “economically meaningful environments” as their adoption grows.

OpenAI has launched a new benchmark that evaluates how well different AI models detect, patch, and even exploit security vulnerabilities found in crypto smart contracts.

OpenAI released the “EVMbench: Evaluating AI Agents on Smart Contract Security” paper on Wednesday, in collaboration with crypto investment firm Paradigm and crypto security firm OtterSec, to evaluate how much the AI agents could theoretically exploit from 120 smart contract vulnerabilities.

Anthropic’s Claude Opus 4.6 came out on top with an average “detect award” of $37,824, followed by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Pro at $31,623 and $25,112, respectively.

Solana futures data shows panicked bulls: Will $80 SOL hold?

Bitcoin’s consolidation nears ‘turning point’ as $70K comes in focus: Analyst

OpenAI pits AI agents against each other to red team smart contracts

latest articles

LONGTITUDE recap: Bitcoin’s 2-step quantum plan, US crypto policy

Hyperliquid launches DeFi lobby amid ‘critical time’ for US policy

Bitcoin’s consolidation nears ‘turning point’ as $70K comes in focus: Analyst

Solana futures data shows panicked bulls: Will $80 SOL hold?

eToro shares pop 20% as crypto revenues bolster Q4 earnings

Zora debuts attention markets on Solana, betting on social trends

explore more

LONGTITUDE recap: Bitcoin’s 2-step quantum plan, US crypto policy

Hyperliquid launches DeFi lobby amid ‘critical time’ for US policy

Bitcoin’s consolidation nears ‘turning point’ as $70K comes in focus: Analyst

Solana futures data shows panicked bulls: Will $80 SOL hold?

eToro shares pop 20% as crypto revenues bolster Q4 earnings

Zora debuts attention markets on Solana, betting on social trends