Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

63.404 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

This week I welcome two of the most important technologists in any field.

Jeff Dean is Google's Chief Scientist, and through 25 years at the company, has worked on basically the most transformative systems in modern computing: from MapReduce, BigTable, Tensorflow, AlphaChip, to Gemini.

Noam Shazeer invented or co-invented all the main architectures and techniques that are used for modern LLMs: from the Transformer itself, to Mixture of Experts, to Mesh Tensorflow, to Gemini and many other things. 

We talk about their 25 years at Google, going from PageRank to MapReduce to the Transformer to MoEs to AlphaChip – and soon to ASI.

Sponsors
* Meter wants to radically improve the digital world we take for granted. They’re developing a foundation model that automates network management end-to-end. To do this, they just announced a long-term partnership with Microsoft for tens of thousands of GPUs, and they’re recruiting a world class AI research team. To learn more, go to https://meter.com/dwarkesh.

* Scale partners with major AI labs like Meta, Google Deepmind, and OpenAI. Through Scale’s Data Foundry, labs get access to high-quality data to fuel post-training, including advanced reasoning capabilities. If you’re an AI researcher or engineer, learn about how Scale’s Data Foundry and research lab, SEAL, can help you go beyond the current frontier at https://scale.com/dwarkesh.

* Curious how Jane Street teaches their new traders? They use Figgie, a rapid-fire card game that simulates the most exciting parts of markets and trading. It’s become so popular that Jane Street hosts an inter-office Figgie championship every year. Download from the app store or play on your desktop at https://www.figgie.com/.

Advertisers
To sponsor a future episode, visit: https://www.dwarkeshpatel.com/p/advertise

Timestamps
00:00 - Intro
03:29 - Joining Google in 1999
06:20 - Future of Moore's Law
11:04 - Future TPUs
13:56 - Jeff’s undergrad thesis: parallel backprop
15:54 - LLMs in 2007 
25:09 - “Holy shit” moments
27:28 - AI fulfills Google’s original mission
32:00 - Doing Search in-context
36:12 - The internal coding model
37:29 - What will 2027 models do?
43:20 - A new architecture every day?
49:10 - Automated chips and intelligence explosion
53:07 - Future of inference scaling
02:38 - Already doing multi-datacenter runs
08:15 - Debugging at scale
12:41 - Fast takeoff and superalignment
20:51 - A million evil Jeff Deans
24:22 - Fun times at Google 
27:51 - World compute demand in 2030
34:37 - Getting back to modularity
44:48 - Keeping a giga-MoE in-memory
49:35 - All of Google in one model
57:59 - What’s missing from distillation 
03:10 - Open research, pros and cons
09:58 - Going the distance					

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Nhạc Theo Chủ Đề

Liên kết website

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Những bài liên quan

Chưa có bài liên quan nào!