DeepSeek R1 Explained to your grandma

1.231.476 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

DeepSeek R1 Explained to your grandma

Describing the key insights from the DeepSeek R1 paper in a way even your grandma could understand. I focus on the key concepts of chain of thought reasoning, reinforcement learning, and model distillation.

Check out my other DeepSeek video on their latest model Janus Pro 7B:
https://www.youtube.com/watch?v=B0ex13QIYpA

Paper:
https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf

Ollama link for local use:
https://ollama.com/library/deepseek-r1

0:00 Introduction
0:43 Chain of Thought
1:33 Reinforcement Learning
3:53 Group Relative Policy Optimization
6:26 Distillation

#deepseek #ai #largelanguagemodels					

DeepSeek R1 Explained to your grandma

Nhạc Theo Chủ Đề

Liên kết website