George Hotz | Programming | RL is dumb and doesn't work | Reinforcement Learning LunarLander Part 2

Published 2024-01-10

Recommendations

1:59:37

George Hotz: Comma.ai, OpenPilot, and Autonomous Vehicles | Lex Fridman Podcast #31
08:40

AI Learns to Walk (deep reinforcement learning)
19:52

USENIX Enigma 2016 - Timeless Debugging
22:08

Biomedical Scientist Answers Pseudoscience Questions From Twitter | Tech Support | WIRED
4:17:10

George Hotz | Programming | what is the Q* algorithm? OpenAI Q Star Algorithm | Mistral 7B | PRM800K
21:14

The first evidence for PLUNGING REGIONS around black holes (Einstein was right!) ft. Dr Andy Mummery
2:37:52

George Hotz | Programming | Mistral mixtral on a tinybox | AMD P2P multi-GPU mixtral-8x7b-32kseqlen
32:37

What Jumping Spiders Teach Us About Color
1:52:52

George Hotz | Just Chatting | effective accelerationism | e/acc | Techno-pessimism | Deceleration
55:59

Jailbreaking the Simulation with George Hotz | SXSW 2019
06:52

Meet the 26-Year-Old Hacker Who Built a Self-Driving Car... in His Garage
3:23:26

George Hotz | Researching | same thing we do every weekend, documenting the AMD 7900XTX | Part 1
22:24

2 Years Of Learning C | Prime Reacts
17:33

Programming Meme Review with George Hotz
38:24

Proximal Policy Optimization (PPO) - How to train Large Language Models
3:33:20

George Hotz | Researching | Chatting | rewriting the linearizer (tinygrad) | TVM | People | Part 1
5:16:43

George Hotz | Programming | OpenCL on a Galaxy Z Fold 5 | Android | IOCTL | GPU | arm64 | part 1
1:28:13

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning
18:19

Reinforcement Learning, by the Book

Similar videos