Reinforcement Learning from Human Feedback: From Zero to chatGPT
Published 2022-12-13Download video
Recommendations
-
1:16:15 Stanford CS224N | 2023 | Lecture 10 - Prompting, Reinforcement Learning from Human Feedback
-
59:48 [1hr Talk] Intro to Large Language Models
-
30:10 The Inside Story of ChatGPT’s Astonishing Potential | Greg Brockman | TED
-
3:13:14 Prompt Engineering for Web Devs - ChatGPT and Bard Tutorial
-
1:03:32 John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges
-
16:51 A.I. Learns to Drive From Scratch in Trackmania
-
3:53:53 Machine Learning for Everybody – Full Course
-
42:40 State of GPT | BRK216HFS
-
38:24 Proximal Policy Optimization (PPO) - How to train Large Language Models
-
3:15:38 What is ChatGPT doing...and why does it work?
-
3:50:57 How Deep Neural Networks Work - Full Course for Beginners
-
2:31:27 WSU: Space, Time, and Einstein with Brian Greene
-
1:19:27 Stanford CS25: V3 I Retrieval Augmented Language Models
-
3:36:55 Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]
-
17:07 LoRA explained (and a bit about precision and quantization)
-
3:01:58 Reinforcement Learning in 3 Hours | Full Course using Python
-
15:04 How I'd Learn AI (If I Had to Start Over)
-
16:01 Reinforcement Learning with sparse rewards
Similar videos
-
1:00:38 Reinforcement Learning from Human Feedback From Zero to ChatGPT [Record of the live]
-
10:48 RLHF+CHATGPT: What you must know
-
13:43 How ChatGPT is Trained
-
06:31 Reinforcement Learning: ChatGPT and RLHF
-
12:38 Reinforcement Learning from Human Feedback (RLHF)
-
15:53 ChatGPT and Reinforcement Learning
-
03:34 What is Reinforcement Learning with Human Feedback (RLHF) ?
-
02:50 Learn about Reinforcement Learning from Human Feedback - ChatGPT / RLHF HuggingFace Course
-
1:11:49 RLHF - Reinforcement Learning with Human Feedback
-
1:01:01 Mastering RLHF with AWS: A Hands-on Workshop on Reinforcement Learning from Human Feedback
-
59:17 RLHF: How to Learn from Human Feedback with Reinforcement Learning
-
2:14:29 How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF)
-
26:28 10 minutes paper (episode 20); InstructGPT
-
47:16 Nathan Lambert - Reinforcement Learning from Human Feedback @ UCL DARK
-
11:59 How ChatGPT is Trained - model and training explained
-
1:00:02 What is RLHF?