George Hotz | Programming | RL is dumb and doesn't work | Reinforcement Learning LunarLander Part 2 Published 2024-01-10 Download video MP4 360p Download video MP4 720p Recommendations 1:59:37 George Hotz: Comma.ai, OpenPilot, and Autonomous Vehicles | Lex Fridman Podcast #31 08:40 AI Learns to Walk (deep reinforcement learning) 19:52 USENIX Enigma 2016 - Timeless Debugging 22:08 Biomedical Scientist Answers Pseudoscience Questions From Twitter | Tech Support | WIRED 4:17:10 George Hotz | Programming | what is the Q* algorithm? OpenAI Q Star Algorithm | Mistral 7B | PRM800K 21:14 The first evidence for PLUNGING REGIONS around black holes (Einstein was right!) ft. Dr Andy Mummery 2:37:52 George Hotz | Programming | Mistral mixtral on a tinybox | AMD P2P multi-GPU mixtral-8x7b-32kseqlen 32:37 What Jumping Spiders Teach Us About Color 1:52:52 George Hotz | Just Chatting | effective accelerationism | e/acc | Techno-pessimism | Deceleration 55:59 Jailbreaking the Simulation with George Hotz | SXSW 2019 06:52 Meet the 26-Year-Old Hacker Who Built a Self-Driving Car... in His Garage 3:23:26 George Hotz | Researching | same thing we do every weekend, documenting the AMD 7900XTX | Part 1 22:24 2 Years Of Learning C | Prime Reacts 17:33 Programming Meme Review with George Hotz 38:24 Proximal Policy Optimization (PPO) - How to train Large Language Models 3:33:20 George Hotz | Researching | Chatting | rewriting the linearizer (tinygrad) | TVM | People | Part 1 5:16:43 George Hotz | Programming | OpenCL on a Galaxy Z Fold 5 | Android | IOCTL | GPU | arm64 | part 1 1:28:13 RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning 18:19 Reinforcement Learning, by the Book Similar videos 7:57:31 George Hotz | Programming | Decision Transformer Reinforcement Learning (RL) | LunarLander | Part 1 6:47:22 George Hotz | Programming | Can MuZero play Tic Tac Toe? | Part1 | DeepMind AI 19:50:06 George Hotz | Programming | Fun with MuZero and MCTS on a lovely Sunday | CartPole | DeepMind AI 12:01 Truth? Eh. Keep it Simple Stupid: MuZero More results