Reading Group (+๐ง): Agents' Last Exam
About This Event
Join the Snorkel AI Reading Group, a recurring forum to explore the latest frontier developments in AI while building meaningful connections within the community.
In this afternoon session, Yiyou Sun and Xinyang Han, Postdoctoral Researchers at UC Berkeley, will cover their recent paper: Agents' Last Exam.
Agenda:
4 pm - doors open
4:30 pm - talk begins
๐ง๐ง๐ง Boba tea and other refreshments will be provided ! ๐ง๐ง๐ง
Among other things, you'll learn:
ALE is a benchmark designed to evaluate AI agents on long-horizon, economically valuable, real-world tasks with verifiable outcomesโdeveloped in collaboration with 250+ industry experts and covering 1,000+ tasks across 55 subfields in 13 industry clusters.
Widely-used benchmarks lack sustained performance measurement on real, economically valuable workflows, creating a systematic gap between benchmark success and meaningful deployment across professional domains.
ALE grounds task coverage in O*NET / SOC 2018, the U.S. federal occupational taxonomy, ensuring systematic, reproducible coverage of non-physical job categories at scale.
The hardest task tier remains far from saturatedโacross mainstream harness and backbone configurations, the average full pass rate is just 2.6%, underscoring the substantial headroom that remains.
ALE's task pool grows continuously as new workflows and industries are onboarded, enabling longitudinal tracking of agent capabilities rather than one-time snapshot comparisons.
ALE is intended not merely as another leaderboard, but as an instrument for closing the gap between benchmark performance and GDP-relevant economic impact.
Agents' Last Exam is a collaboration between UC Berkeley's RDI (Center for Responsible Decentralized Intelligence), Snorkel AI, and 250+ industry experts across academia and industry.
Location
๐ 101 Second Street, San Francisco, CA 94105, USA
Get a free growth analysis for your company
See how your website, messaging, and go-to-market strategy stack up, in minutes.
Get My Free AnalysisMore SF Events You Might Like
AI Engineer World's Fair
The premier industry gathering for AI engineers, offering unparalleled access to the leading edge of...
-1 to Snowflake with Sridhar Ramaswamy
A marquee event featuring the CEO of Snowflake at a top-tier technical community, essential for seri...
Agentic AI Summit
A premier summit with an elite speaker list from top labs and funds, essential for the AI engineerin...