Long Review: Apple's MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training Published 2024-03-31 Download video MP4 360p Download video MP4 720p Recommendations 10:11 A Quick Review of Apple's SOTA Multimodal LLM: MM1 1:16:59 [Detailed Paper Reading] Zipformer: A faster and better encoder for automatic speech recognition 56:48 Google's Universal Speech Model for 100+ languages beats OpenAI's Whisper Model 28:18 GPT-4o is WAY More Powerful than Open AI is Telling us... 06:13 Does Python's dictionary get slower as it gets bigger? 42:53 Google Researcher's In-Depth Analysis on End-to-End Speech Recognition, Part 1: Overview & Modeling 19:41 From OpenAI's Whisper Model to Your Own In-House ASR Service: ROI (Return-on-Investment) (Part 2) 45:21 You need to learn AI in 2024! (And here is your roadmap) 19:06 A Review of Microsoft+OpenAI, Google, Meta, and Nvidia's Open Source Large Speech Models for ASR 55:55 Miles Cranmer - The Next Great Scientific Theory is Hiding Inside a Neural Network (April 3, 2024) 34:21 Google Releases AI AGENT BUILDER! 🤖 Worth The Wait? 20:42 From OpenAI's Whisper Model to Your Own In-House ASR Service: Postprocessing and Language Modeling 15:21 Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use 23:47 AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic" 25:20 Simple Introduction to Large Language Models (LLMs) 1:03:05 A Path Towards Autonomous Machine Intelligence with Dr. Yann LeCun 54:20 Generative AI Models Related to Sora: Normalizing Flows 33:41 A Review of SpeechT5: Introducing Google's T5 into Speech (ASR, TTS, SID, ...) Tasks 25:11 From OpenAI's Whisper Model to Your Own In-House ASR Service: Overview (Part 1) Similar videos 1:53:18 Multi-Modal Pre-training (Apple's MM1) More results