
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken
Descripción del Episodio
New episode with my good friends Sholto Douglas & Trenton Bricken. Sholto focuses on scaling RL and Trenton researches mechanistic interpretability, both at Anthropic.
We talk through what’s changed in the last year of AI research; the new RL regime and how far it can scale; how to trace a model’s thoughts; and how countries, workers, and students should prepare for AGI.
See you next year for v3. Here’s last year’s episode, btw. Enjoy!
Watch on YouTube; listen on Apple Podcasts or Spotify.
----------
SPONSORS
* WorkOS ensures that AI companies like OpenAI and Anthropic don't have to spend engineering time building enterprise features like access controls or SSO. It’s not that they don't need these features; it's just that WorkOS gives them battle-tested APIs that they can use for auth, provisioning, and more. Start building today at workos.com.
* Scale is building the infrastructure for safer, smarter AI. Scale’s Data Foundry gives major AI labs access to high-quality data to fuel post-training, while their public leaderboards help assess model capabilities. They also just released Scale Evaluation, a new tool that diagnoses model limitations. If you’re an AI researcher or engineer, learn how Scale can help you push the frontier at scale.com/dwarkesh.
* Lighthouse is THE fastest immigration solution for the technology industry. They specialize in expert visas like the O-1A and EB-1A, and they’ve already helped companies like Cursor, Notion, and Replit navigate U.S. immigration. Explore which visa is right for you at lighthousehq.com/ref/Dwarkesh.
To sponsor a future episode, visit dwarkesh.com/advertise.
----------
TIMESTAMPS
(00:00:00) – How far can RL scale?
(00:16:27) – Is continual learning a key bottleneck?
(00:31:59) – Model self-awareness
(00:50:32) – Taste and slop
(01:00:51) – How soon to fully autonomous agents?
(01:15:17) – Neuralese
(01:18:55) – Inference compute will bottleneck AGI
(01:23:01) – DeepSeek algorithmic improvements
(01:37:42) – Why are LLMs ‘baby AGI’ but not AlphaZero?
(01:45:38) – Mech interp
(01:56:15) – How countries should prepare for AGI
(02:10:26) – Automating white collar work
(02:15:35) – Advice for students
Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe
Episodios Recientes

Más podcasts de Sociedad y Cultura
Ver toda la categoría →
Ad Propositum
By shows
Bienvenidas y bienvenidos al podcast de Adpropositum, mi espacio auditivo para acompañarte a conectarte con tu propósito ayudándote a eliminar los obstáculos para acceder a una vida autentica y con sentido. Aqui reflexionaremos y aprenderemos en torno a la vida, el amor, el sufrimiento, el proposito y lo valioso. Un lugar construido para que lo compartas con otros y para que ademas de acceder a mis podcast, tambien encuentres mis medicinas auditivas para el alma.

Modo Taoísmo
By shows
Obtén inspiración para poseer el poder de alcanzar la grandeza y desbloquear todo tu potencial. Solo necesitas motivación y orientación para superar obstáculos y llevar una vida con propósito.

BAJO LOS PALOS by FLEXICAR
By shows
Bajo los Palos es un podcast presentado por Iker Casillas, donde las conversaciones van más allá del fútbol. En cada episodio, Iker invita a diferentes personalidades para hablar sobre experiencias de vida, aprendizajes y reflexiones, creando un espacio cercano y auténtico. Un viaje lleno de historias inspiradoras, desde dentro y fuera del terreno de juego.

Park Predators
By shows
Explore the dark side of the world’s most beautiful places with investigative journalist and park enthusiast Delia D’Ambra. Each week, Delia guides you deep into national parks and forests across the globe, uncovering stories where nature’s breathtaking beauty has masked sinister secrets. From infamous cases that made headlines to little-known crimes that still need answers, Delia’s relentless pursuit of the truth takes her through archives and remote landscapes to reveal the hidden darkness haunting these natural wonders. Because sometimes, the most beautiful places hide the darkest secrets. This is Park Predators.

Monólogo de Alsina
By shows
Escucha y lee todas las noticias del programa. En directo de L-V de 6 a 12:30

Martes De Misterio
By shows
Casos reales de misterio y horror. Testimonios en primera persona. Entrevistas e investigaciones. Conduce: Martín Echevarría (@martinderadio). Cada Martes un episodio estreno para que puedas oír desde cualquier dispositivo. Si tienes una historia para contarnos éstos son nuestros contactos: +54 9 223 6155802 (Whatsapp Producción) // @martesdemisterio (Instagram) // mail: [email protected]

El Cartel de La Mega
By shows
Dirigido por Daniel Trespalacios. Es reconocido por su formato innovador que mezcla entretenimiento, interacción con los oyentes y temas paranormales.
