Toggle navigation
Video
♫ Thôn Quê
♫ Sông Đáy
♫ Liên Khúc
♫ Nhạc Đám Cưới
♫ Nonstop Việt
♫ Không Lời
♫ Nhạc Vàng Trữ Tình
♫ Nhạc Trẻ
CMU Advanced NLP Fall 2024 (8): Reinforcement Learning and Human Feedback
Graham Neubig
1.597 Lượt nghe
Prev
play
stop
Next
mute
max volume
00:00
00:00
repeat
Update Required
To play the media you will need to either update your browser to a recent version or update your
Flash plugin
.
Tải MP3
MÔ TẢ MP3
TIẾP THEO
CMU Advanced NLP Fall 2024 (8): Reinforcement Learning and Human Feedback
This lecture (by Graham Neubig) for CMU CS 11-711, Advanced NLP (Fall 2024) covers: * Methods to Gather Feedback * Error and Risk * Reinforcement Learning * Stabilizing Reinforcement Learning Class Site: https://phontron.com/class/anlp-fall2024/
Những bài liên quan
1:16:16
CMU Advanced NLP Fall 2024 (7): Prompting and Complex Reasoning
4 N
Graham Neubig
1:04:21
CMU Advanced NLP Fall 2024 (11): Distillation, Quantization, and Pruning
1.4 N
Graham Neubig
1:16:31
CMU Advanced NLP Spring 2025 (11): Reinforcement Learning
2.4 N
Sean Welleck
1:16:48
Randy Pausch's Last Lecture - Remastered
35.6 N
Carnegie Mellon University
1:18:40
S2024 #06 - Vectorized Query Execution Using SIMD (CMU Advanced Database Systems)
5 N
CMU Database Group
1:17:54
CMU Advanced NLP Fall 2024 (10): Retrieval and RAG
1.9 N
Graham Neubig
1:17:22
CMU Advanced NLP Fall 2024 (9): Experimental Design and Data Annotation
978
Graham Neubig
1:22:32
Optimal Control (CMU 16-745) 2024 Lecture 1: Intro and Dynamics Review
13.4 N
CMU Robotic Exploration Lab
1:15:17
CMU Advanced NLP Spring 2025 (9): Fine-tuning
1.2 N
Sean Welleck
1:09:15
Multimodal AI Agents with Ruslan Salakhutdinov
1.5 N
Kempner Institute at Harvard University
1:16:45
CMU Advanced NLP Fall 2024 (5): Pre-training and Pre-trained Models
3.2 N
Graham Neubig
1:09:10
CMU Advanced NLP Spring 2025 (16): Parallelism and Scaling
1.3 N
Sean Welleck
1:06:13
CMU Advanced NLP Fall 2024 (6): Instruction Tuning
2.1 N
Graham Neubig
1:12:13
CMU Advanced NLP Spring 2025 (5): Attention and Transformers
1.5 N
Sean Welleck
1:14:44
CMU Advanced NLP Fall 2024 (22): From Decoding to Meta Generation Inference Time Algorithms for LMs
1.1 N
Graham Neubig
1:11:05
AI has rewired my brain
21.5 N
Theo - t3․gg
1:09:45
CMU Advanced NLP Fall 2024 (15): Tool Use and LLM Agent Basics
2.9 N
Graham Neubig
1:07:33
CMU Advanced NLP Fall 2024 (12): Domain Specific Modeling: Code and Math
947
Graham Neubig
1:11:49
3. Induction and Recursion | CMU Principles of Functional Programming M23
2.4 N
Brandon Wu
Nhạc Theo Chủ Đề
Nhạc Không Lời
Nhạc Vàng HOT
Nhạc Liên Khúc
Nhạc DJ HOT
Nhạc Hà Nam
Nhạc Vĩnh Yên
Nhạc Hưng Yên
Nhạc Hải Dương
Nhạc Hà Tây
Nhạc Sông Đáy
LK Nhạc Vàng
LK Nhạc Trẻ
Liên kết website