DeepSeek R1 Explained to your grandma

DeepSeek R1 Explained to your grandma

1.231.476 Lượt nghe
DeepSeek R1 Explained to your grandma
Describing the key insights from the DeepSeek R1 paper in a way even your grandma could understand. I focus on the key concepts of chain of thought reasoning, reinforcement learning, and model distillation. Check out my other DeepSeek video on their latest model Janus Pro 7B: https://www.youtube.com/watch?v=B0ex13QIYpA Paper: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf Ollama link for local use: https://ollama.com/library/deepseek-r1 0:00 Introduction 0:43 Chain of Thought 1:33 Reinforcement Learning 3:53 Group Relative Policy Optimization 6:26 Distillation #deepseek #ai #largelanguagemodels