Toggle navigation
Video
♫ Thôn Quê
♫ Sông Đáy
♫ Liên Khúc
♫ Nhạc Đám Cưới
♫ Nonstop Việt
♫ Không Lời
♫ Nhạc Vàng Trữ Tình
♫ Nhạc Trẻ
CMU Advanced NLP Spring 2025 (20): Advanced Post-Training
Sean Welleck
1.044 Lượt nghe
Prev
play
stop
Next
mute
max volume
00:00
00:00
repeat
Update Required
To play the media you will need to either update your browser to a recent version or update your
Flash plugin
.
Tải MP3
MÔ TẢ MP3
TIẾP THEO
CMU Advanced NLP Spring 2025 (20): Advanced Post-Training
This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP covers: - Supervised Fine-tuning - Reward Modeling - Reinforcement Learning - Direct Preference Optimization
Những bài liên quan
59:39
CMU Advanced NLP Spring 2025 (21): Multimodal Modeling I
379
Sean Welleck
1:09:10
CMU Advanced NLP Spring 2025 (16): Parallelism and Scaling
1.3 N
Sean Welleck
1:16:31
CMU Advanced NLP Spring 2025 (11): Reinforcement Learning
2.4 N
Sean Welleck
23:16
DeepSeek's GRPO (Group Relative Policy Optimization) | Reinforcement Learning for LLMs
11.1 N
Julia Turc
24:19
Context Is The Next Frontier by Jacob Buckman, CEO of Manifest AI
5.8 N
Democratize Intelligence
35:35
Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning
128.8 N
Steve Brunton
1:16:16
CMU Advanced NLP Fall 2024 (7): Prompting and Complex Reasoning
4 N
Graham Neubig
1:15:17
CMU Advanced NLP Spring 2025 (9): Fine-tuning
1.3 N
Sean Welleck
1:04:31
CMU Advanced NLP Spring 2025 (19): Efficient Inference
809
Sean Welleck
33:01
The Man Who Almost Broke Math (And Himself...)
9.4 Tr
Veritasium
1:16:51
CMU Advanced NLP Spring 2025 (17): Long-Context Models
487
Sean Welleck
18:40
But what is a neural network? | Deep learning chapter 1
19.4 Tr
3Blue1Brown
11:09
MAZUREK sprawdził, że nie wiedzą nic o polityce przed wyborami!
211.4 N
MaturaToBzdura
24:22
Group Relative Policy Optimization (GRPO) - Formula and Code
15.6 N
Deep Learning with Yacine
28:49
MACIEJ MACIAK: CAŁA PRAWDA
587 N
Kanał Zero
18:17
Reinforcement Learning for Agents - Will Brown, ML Researcher at Morgan Stanley
53.1 N
AI Engineer
27:14
Transformers (how LLMs work) explained visually | DL5
6.1 Tr
3Blue1Brown
1:12:13
CMU Advanced NLP Spring 2025 (5): Attention and Transformers
1.5 N
Sean Welleck
1:15:41
CMU Advanced NLP Spring 2025 (18): Advanced Inference Strategies
867
Sean Welleck
19:39
RLHF & DPO Explained (In Simple Terms!)
9.3 N
Entry Point AI
Nhạc Theo Chủ Đề
Nhạc Không Lời
Nhạc Vàng HOT
Nhạc Liên Khúc
Nhạc DJ HOT
Nhạc Hà Nam
Nhạc Vĩnh Yên
Nhạc Hưng Yên
Nhạc Hải Dương
Nhạc Hà Tây
Nhạc Sông Đáy
LK Nhạc Vàng
LK Nhạc Trẻ
Liên kết website