Reinforcement Learning from Human Feedback: From Zero to chatGPT

Published 2022-12-13

Download video MP4 360p

Recommendations

1:16:15

Stanford CS224N | 2023 | Lecture 10 - Prompting, Reinforcement Learning from Human Feedback
59:48

[1hr Talk] Intro to Large Language Models
30:10

The Inside Story of ChatGPT’s Astonishing Potential | Greg Brockman | TED
3:13:14

Prompt Engineering for Web Devs - ChatGPT and Bard Tutorial
1:03:32

John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges
16:51

A.I. Learns to Drive From Scratch in Trackmania
3:53:53

Machine Learning for Everybody – Full Course
42:40

State of GPT | BRK216HFS
38:24

Proximal Policy Optimization (PPO) - How to train Large Language Models
3:15:38

What is ChatGPT doing...and why does it work?
3:50:57

How Deep Neural Networks Work - Full Course for Beginners
2:31:27

WSU: Space, Time, and Einstein with Brian Greene
1:19:27

Stanford CS25: V3 I Retrieval Augmented Language Models
3:36:55

Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]
17:07

LoRA explained (and a bit about precision and quantization)
3:01:58

Reinforcement Learning in 3 Hours | Full Course using Python
15:04

How I'd Learn AI (If I Had to Start Over)
16:01

Reinforcement Learning with sparse rewards

Similar videos

1:00:38

Reinforcement Learning from Human Feedback From Zero to ChatGPT [Record of the live]
10:48

RLHF+CHATGPT: What you must know
13:43

How ChatGPT is Trained
06:31

Reinforcement Learning: ChatGPT and RLHF
12:38

Reinforcement Learning from Human Feedback (RLHF)
15:53

ChatGPT and Reinforcement Learning
03:34

What is Reinforcement Learning with Human Feedback (RLHF) ?
02:50

Learn about Reinforcement Learning from Human Feedback - ChatGPT / RLHF HuggingFace Course
1:11:49

RLHF - Reinforcement Learning with Human Feedback
1:01:01

Mastering RLHF with AWS: A Hands-on Workshop on Reinforcement Learning from Human Feedback
59:17

RLHF: How to Learn from Human Feedback with Reinforcement Learning
2:14:29

How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF)
26:28

10 minutes paper (episode 20); InstructGPT
47:16

Nathan Lambert - Reinforcement Learning from Human Feedback @ UCL DARK
11:59

How ChatGPT is Trained - model and training explained
1:00:02

What is RLHF?
More results