Loading video player...
In this video, I test Inception's new Mercury 2, a diffusion-based large language model that introduces reasoning capabilities and generates text at 1,000 tokens per second. I demonstrate its speed and instruction-following through coding tests, and then evaluate its practical utility by integrating it into a real-time voice assistant and my open-source RAG agent. LINKS: https://www.inceptionlabs.ai/ My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0 Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: https://ko-fi.com/promptengineering |🔴 Patreon: https://www.patreon.com/PromptEngineering 💼Consulting: https://calendly.com/engineerprompt/consulting-call 📧 Business Contact: engineerprompt@gmail.com Become Member: http://tinyurl.com/y5h28s6h 💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off). Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0 #Gemini3.1 #GoogleAI #Antigravity #AIStudio #VibeCoding #LLMBenchmarks #TechNews #SoftwareDevelopment #artificialintelligence