CMU Advanced NLP Spring 2025 (20): Advanced Post-Training

1.044 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

CMU Advanced NLP Spring 2025 (20): Advanced Post-Training

This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP covers:
- Supervised Fine-tuning
- Reward Modeling
- Reinforcement Learning
- Direct Preference Optimization					

CMU Advanced NLP Spring 2025 (20): Advanced Post-Training

Nhạc Theo Chủ Đề

Liên kết website