PANews 2月27日消息,据Cointelegraph报道,开源AI实验室Sentient宣布推出Arena,这是一个用于评估AI代理在企业级工作流程中表现的生产级测试环境。Pantera Capital和Franklin Templeton的数字资产部门已加入Arena的首批测试队列。
Sentient表示,Arena并非静态模型测试,而是通过模拟包含长文档、不完整信息和冲突来源的企业条件,对AI代理进行标准化任务测试。平台会跟踪幻觉、证据缺失、引用错误和推理漏洞等失败类别,帮助开发者诊断问题。Arena计划通过公开排行榜发布对比性能指标,并发布总结常见失败模式和修复方案的测试报告。
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to
Disclaimer.
Articoli correlati
Zcash Foundation Releases Zebra 4.4.0, Fixes Consensus-Level Security Vulnerabilities
According to Zcash Foundation, Zebra 4.4.0 was released today, fixing multiple consensus-level security vulnerabilities and urging all node operators to upgrade immediately. The vulnerabilities include a denial-of-service flaw that could halt block discovery permanently, sigops counting errors
GateNews4h fa
Wasabi Protocol's EVM Deployment Hit by Security Incident on April 30, Now Contained
According to Wasabi Protocol's official statement, the protocol suffered a security incident affecting its EVM deployment on April 30, which has now been fully contained. The Solana deployment and Prop AMM remained unaffected. The project has closed attack vectors, rotated credentials and keys, and
GateNews5h fa
AI Agent Manfred Forms Company, Gets Crypto Wallet Ahead of End-of-May Trading Launch
AI agent Manfred has formed its own company and obtained a crypto wallet along with credentials to hire staff, make payments, and conduct business. The agent is not scheduled to begin trading crypto until the end of
GateNews5h fa
Exodus 創辦人:助記詞還要靠酒吧餐巾紙紀錄表示產業仍有進步空間
Exodus 在峰會揭示轉型:自託管演進為全端支付基礎設施,監管波折後上市。透過收購 Monavate、Baanx 垂直整合支付軌道,推出 Exodus Pay,兼容 Visa/Apple Pay,以穩定幣與 BTC 提供日常支付,並減少交易手續費依賴;2026Q1 收入約 2270 萬美元,受比特幣波動影響。
ChainNewsAbmedia6h fa
Arbitrum Governance Votes to Release 30,765 ETH ($71M) Frozen After Kelp DAO Exploit
As of publication, Arbitrum governance is voting on a proposal to release 30,765 ETH (approximately $71 million) that was frozen by the Arbitrum Security Council on April 21 following the Kelp DAO exploit. The proposal, co-authored by Aave Labs, Kelp DAO, LayerZero, EtherFi, and Compound, has
GateNews7h fa
AI Agent Manfred Forms Company, Gets Crypto Wallet Ahead of May Trading Launch
AI agent Manfred has formed its own company and obtained a crypto wallet and business credentials, though it will not begin trading cryptocurrency until the end of May. The agent is now equipped to hire staff, make payments, and conduct business
GateNews11h fa