Background

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Dwarkesh Podcast22 de mayo de 20258641
Compartir episodio:Descargar

Descripción del Episodio

New episode with my good friends Sholto Douglas & Trenton Bricken. Sholto focuses on scaling RL and Trenton researches mechanistic interpretability, both at Anthropic.

We talk through what’s changed in the last year of AI research; the new RL regime and how far it can scale; how to trace a model’s thoughts; and how countries, workers, and students should prepare for AGI.

See you next year for v3. Here’s last year’s episode, btw. Enjoy!

Watch on YouTube; listen on Apple Podcasts or Spotify.

----------

SPONSORS

* WorkOS ensures that AI companies like OpenAI and Anthropic don't have to spend engineering time building enterprise features like access controls or SSO. It’s not that they don't need these features; it's just that WorkOS gives them battle-tested APIs that they can use for auth, provisioning, and more. Start building today at workos.com.

* Scale is building the infrastructure for safer, smarter AI. Scale’s Data Foundry gives major AI labs access to high-quality data to fuel post-training, while their public leaderboards help assess model capabilities. They also just released Scale Evaluation, a new tool that diagnoses model limitations. If you’re an AI researcher or engineer, learn how Scale can help you push the frontier at scale.com/dwarkesh.

* Lighthouse is THE fastest immigration solution for the technology industry. They specialize in expert visas like the O-1A and EB-1A, and they’ve already helped companies like Cursor, Notion, and Replit navigate U.S. immigration. Explore which visa is right for you at lighthousehq.com/ref/Dwarkesh.

To sponsor a future episode, visit dwarkesh.com/advertise.

----------

TIMESTAMPS

(00:00:00) – How far can RL scale?

(00:16:27) – Is continual learning a key bottleneck?

(00:31:59) – Model self-awareness

(00:50:32) – Taste and slop

(01:00:51) – How soon to fully autonomous agents?

(01:15:17) – Neuralese

(01:18:55) – Inference compute will bottleneck AGI

(01:23:01) – DeepSeek algorithmic improvements

(01:37:42) – Why are LLMs ‘baby AGI’ but not AlphaZero?

(01:45:38) – Mech interp

(01:56:15) – How countries should prepare for AGI

(02:10:26) – Automating white collar work

(02:15:35) – Advice for students



Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe

Episodios Recientes

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Más podcasts de Sociedad y Cultura

Ver toda la categoría →
Dumb Blonde

Dumb Blonde

By shows

<p>Asking the questions others are afraid to. Bunnie XO host of the Dumb Blonde podcast – the ultimate destination for comedy, trending and lifestyle. Get ready to dive into hilarious discussions about relationships, trauma, embarrassing moments, and all the realness life throws at us. Join Bunnie every week to laugh, relate, and embrace your inner healing.</p>

The Snare

The Snare

By shows

In 1996, 18-year-old Angie Dodge is found brutally murdered in her Idaho Falls home. Police zero in on a suspect and put a man behind bars. But as the years pass, doubts emerge about whether the real killer was ever caught. Leading the fight for answers is an unlikely advocate: Angie’s own mother, who embarks on a decades-long mission to uncover the truth. A six-part series from 20/20 and ABC Audio, hosted by Maggie Rulli. New episodes Tuesdays.

Joy 101 with Hoda Kotb

Joy 101 with Hoda Kotb

By shows

<p>Joy is essential.</p> <p>And it's also elusive. You can't order it, borrow it, or simply hope it into life.</p> <p>But now, there's a new and exciting way to start your journey toward a more joyful existence: The Joy 101 Podcast with Hoda!</p> <p>Best known for her Emmy-winning work and co-anchoring&nbsp;<em>Today,</em> Hoda Kotb infuses her authenticity, curiosity, and warmth into conversations with the world&rsquo;s most fascinating people. Entertainment legends, sport icons, wellness experts, and everyday folks will share how they&nbsp;find, allow, and experience joy.&nbsp;Hoda will offer her own tips and takes on seeking a more balanced, harmonious life.&nbsp;</p> <p>If you're craving inspiration, support, and useful tools to maximize your joy, tune in to these candid, uplifting, and moving on-air chats.</p> <p>Joy after a breakup, joy as an empty-nester, joy after loss, joy as a caretaker &mdash; Hoda's new podcast will speak to you.</p> <p>Joy 101 with Hoda Kotb, an iHeartPodcast.</p>

La Silla: On The Record

La Silla: On The Record

By shows

Cada semana contamos movidas de poder en Colombia a través de la voz de sus protagonistas. Un podcast de La Silla Podcasts.

Dinero Más Inteligente

Dinero Más Inteligente

By shows

El dinero no solo se gana, se entiende. <br /> En El Dinero Más Inteligente, Valeria Ovalle presenta la economía y Juan Carlos Herrera la conecta al mundo de inversiones. Una conversación entre razón y estrategia para entender el mundo financiero sin complicaciones. <br /> By GBM <br /> Síguenos en <b><a href="https://www.instagram.com/gbmplus_?igsh=czBzOG5mazBwMmlj&amp;utm_source=qr">Instagram</a><b>.</b></b>

Lo que NO se habla con Fer Flores

Lo que NO se habla con Fer Flores

By shows

Contenidos para construir una Humanidad con H mayúscula con perspectiva psicoanalista, médica, legal, antropológica y espiritual.

After the Whistle with Brendan Hunt and Rebecca Lowe

After the Whistle with Brendan Hunt and Rebecca Lowe

By shows

<p>Rebecca Lowe (Fox Sports) and Brendan Hunt (‘Ted Lasso’) are teaming up again to take on the 2026 World Cup! They’ll ride an emotional roller coaster together as 48 teams play 104 action-packed matches across the U.S., Canada, and Mexico. They’ll bring you all the joy and the drama, the hope and the heartbreak — and help you understand the matchups and personalities that will make this the biggest sporting and cultural event of our lifetimes</p><p>‘After the Whistle With Brendan Hunt and Rebecca Lowe’ is an Apple News Original podcast presented by Verizon.</p>

熊熊翻唱记录(AI周棋洛)

熊熊翻唱记录(AI周棋洛)

By shows

手游恋与制作人-周棋洛AI翻唱,仅娱乐用/随缘更新