[Linkpost] “Where’s my ten minute AGI?” by Vasco Grilo🔸
EA Forum Podcast (All audio) - A podcast by EA Forum Team

Categories:
This is a link post. This is a crosspost for Where's my ten minute AGI? by Hanson Ho, which was originally published on Gradient Updates on 2 May 2025. Recently, METR released a paper arguing that the length of tasks that AIs can do is doubling every 7 months. We can see this in the following graph, where the best AI system[1] is able to do roughly hour-long tasks at a 50% success rate on average: METR's research finds that AIs are rapidly able to do longer and longer tasks, where length is measured by the time it takes for a human with requisite expertise to do the task. But there's a big problem here – if AIs are actually able to perform most tasks on 1-hour task horizons, why don’t we see more real-world task automation? For example, most emails take less than an hour to write, but [...] ---Outline:(01:59) 1. Time-horizon estimates are very domain-specific(04:54) 2. Task reliability strongly influences task horizons(08:12) 3. Real-world tasks are bundled together and hard to separate out(10:57) DiscussionThe original text contained 9 footnotes which were omitted from this narration. --- First published: May 19th, 2025 Source: https://forum.effectivealtruism.org/posts/5fA7CXc4WK2nTCz3o/where-s-my-ten-minute-agi Linkpost URL:https://epoch.ai/gradient-updates/where-is-my-ten-minute-agi --- Narrated by TYPE III AUDIO. ---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.