Frozen AI News archive

Nvidia buys (most of) Groq for $20B cash; largest execuhire ever

**Groq** leadership team is joining **Nvidia** under a "non-exclusive licensing agreement" in a deal valued at **$20 billion cash**, marking a major acquisition in AI chip space though Nvidia states it is not acquiring Groq as a company. Jensen Huang plans to integrate Groq's low-latency processors into the NVIDIA AI factory architecture to enhance AI inference and real-time workloads. Twitter highlights include **Gemini** used as a consumer utility for calorie tracking, OpenAI discussing the "deployment gap" focusing on model usage in healthcare and business, and Tesla's FSD v14 described as a "Physical Turing Test" for consumer AI. Benchmarking challenges are noted by **Epoch AI** emphasizing provider variance and integration issues affecting model quality measurement. Discussions on coding agents and developer experience convergence continue in the AI community.

Canonical issue URL

Execuhires are back!

AI News for 12/24/2025-12/25/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (208 channels, and 5086 messages) for you. Estimated reading time saved (at 200wpm): 346 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Execuhires first started in Aug 2024 and again in Jun 2025, but it seems Christmas Eve 2025 isn't too late for a hat-trick. In a 5 sentence post, Groq confirmed it's "non-exclusive licensing agreement" for most of Groq's leadership team to join Nvidia, leaving behind GroqCloud, while the current CFO will become the CEO of the old Groq, for reported total consideration of $20 billion cash.

It's an acquisition in everything but name, and made interesting by a few other facts: Groq was last valued at $6.9B in Sept, and says Nvidia came inbound to Groq. Nvidia's former largest acquisition was the 2019 acquisition of Mellanox for $7B, yet this acquisition is only 1/3 of Nvidia's cash war chest.

Jensen's quote is the most actual detail we have on future plans:

“We plan to integrate Groq’s low-latency processors into the NVIDIA AI factory architecture, extending the platform to serve an even broader range of AI inference and real-time workloads,” Huang wrote.

Huang added that, “While we are adding talented employees to our ranks and licensing Groq’s IP, we are not acquiring Groq as a company.”

That's all we know, but in semis world this is very very earth shaking, not least for hopeful Nvidia competitors.


AI Twitter Recap

Top tweets (by engagement)


Benchmarking and Evaluation: Provider Variance, Harness Bugs, and “What Even Is a Score?”


Coding Agents, Agent Packaging, and Developer Experience (DX) Convergence


Open Models and the “Inference Distribution Layer”: MiniMax M2.1, GLM-4.7, Qwen Image Edit


Training & Research Notes: RL for Agents, Pretraining Tricks, and Representation/Attention Fixes


Robotics, Autonomy, and “Physical Turing Test” Framing


Macro Themes: Talent, Product Cycles, and the “Deployment Gap”


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

nothing met our bar

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. AI-Generated Art and Animation Experiments

2. AI Character and Meme Creations

3. AI-Driven Music and Video Creations


AI Discord Recap

A summary of Summaries of Summaries by gpt-5.1

1. Wave 13 Coding Agents & AI IDE Tooling

2. Video, Audio & Multimodal Model Tooling

3. Architecture Tricks, Precision Wars & Interpretability

4. GPU Hardware, Kernels & Quantization Engineering

5. Benchmarks, Evaluation Drift, RAG & Code Understanding


Discord: High level Discord summaries

Perplexity AI Discord


BASI Jailbreaking Discord


Unsloth AI (Daniel Han) Discord


OpenRouter Discord


LMArena Discord


OpenAI Discord


Nous Research AI Discord


HuggingFace Discord


LM Studio Discord


Latent Space Discord


Eleuther Discord


Moonshot AI (Kimi K-2) Discord


GPU MODE Discord


tinygrad (George Hotz) Discord


Manus.im Discord Discord


DSPy Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


Windsurf Discord


The aider (Paul Gauthier) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MCP Contributors (Official) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1501 messages🔥🔥🔥):

Perplexity Pro promo code, Coding with Perplexity, Christmas Baubles, Gemini Model vs sonnet vs opus for coding


Perplexity AI ▷ #sharing (1 messages):

nike0656: https://www.perplexity.ai/search/836b97e3-6d72-4c3d-bc1d-7b568e96fcf1


BASI Jailbreaking ▷ #announcements (1 messages):

Christmas, Holidays, BASI, Celebrations


BASI Jailbreaking ▷ #general (653 messages🔥🔥🔥):

GPT 5.2 Jailbreak, Gemini 3 Pro, Discord as Free Google Drive, Gemini's Persistent Memory, DDoS vs SynFlood


BASI Jailbreaking ▷ #jailbreaking (108 messages🔥🔥):

Jailbreaking for NSFW on GPT, Gemini Jailbreak Prompts, Grok NSFW, Bypassing Credit Systems, AI Simulation Layers


BASI Jailbreaking ▷ #redteaming (26 messages🔥):

Google triage issues, Advanced Roleplay Prompts, Gray Swan leaderboard fast track, Red team extraction methodologies, Malware link


Unsloth AI (Daniel Han) ▷ #general (82 messages🔥🔥):

System Configuration for Voice Data Inference, RTX Pro 6000 vs RTX 5090, CPU selection for parallel GPU inference, Fine-tuning for poketwo discord bot, psutil not defined error on collab


Unsloth AI (Daniel Han) ▷ #introduce-yourself (2 messages):

AI Engineer Introduction, ML & DL, Fine-Tuning, Computer Vision


Unsloth AI (Daniel Han) ▷ #off-topic (280 messages🔥🔥):

Cuneiform OCR, Pluribus Finale Spoilers, 4k 240hz OLED gaming monitor, AI model optimization techniques for faster inference, AI Bio Slimes


Unsloth AI (Daniel Han) ▷ #help (21 messages🔥):

Unsloth QAT and TorchAO impact on llama cpp, Finetuning Ministral-3B and GGUF conversion issues, Loading a finished QAT model from Unsloth with less RAM


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

2kian: Life-Timeline Forecaster


OpenRouter ▷ #app-showcase (3 messages):

Open-WebUI Integration, llumen Demo Bugs, Chat Pipeline Update


OpenRouter ▷ #general (257 messages🔥🔥):

File size limit for parsing PDFs using OpenRouter, VPN for AI access, Caching, OpenRouter support, Groq acquisition


OpenRouter ▷ #discussion (2 messages):

Regulatory Scrutiny


LMArena ▷ #general (247 messages🔥🔥):

Grok 4.20 release, Lua Scripting, LM Arena Captcha Issues, GLM-4.7 Ranking, AI Video Generation


OpenAI ▷ #ai-discussions (109 messages🔥🔥):

ElevenLabs video generation, Sora 2 Watermark, ElevenLabs vs. Higgsfield pricing, Nano Banana Pro IP filtering


OpenAI ▷ #gpt-4-discussions (7 messages):

Vanished chat history, GPT-5.2 holiday advice, GPT can't control OS, GPT Pro trial


OpenAI ▷ #prompt-engineering (39 messages🔥):

Emergent Behaviors, Suppression of Emergent Behaviors, Hallucination Mitigation, Prompt Engineering Theorycraft, Meta-cognition


OpenAI ▷ #api-discussions (39 messages🔥):

Emergent Behaviors Suppression, ToS Splitting, Meta-cognition for Hallucination Guard, Prompt Engineering claim, Truth-Tracking


Nous Research AI ▷ #general (120 messages🔥🔥):

AES PGP encryption, Sharing repo links for Agentic workflows, Discord channel splitting, Discord threads vs forums, Discord Alternatives: Matrix vs Rocket.Chat


Nous Research AI ▷ #research-papers (21 messages🔥):

Local LLM Inference, GTX 970 for Local AI, GPU Performance Impact


Nous Research AI ▷ #interesting-links (1 messages):

promptsiren: https://blog.character.ai/squinch/


Nous Research AI ▷ #research-papers (21 messages🔥):

Local LLM inference cost, GPU performance with mixed cards, Low VRAM usage


HuggingFace ▷ #general (135 messages🔥🔥):

Gradio update fixes, Float16 vs BFloat16 issues, Qwen 2.5VL-3B Image size on P100, Microsoft Trellis 2-4B, Livebook all the things


HuggingFace ▷ #i-made-this (7 messages):

hf-grass, GitHub contribution heatmap, VQ-VAE model training, Google Mobile Actions Model fine-tuning


HuggingFace ▷ #agents-course (6 messages):

Llama-4-Scout-17B-16E-Instruct model issues, Model Access Rejection, Suggested Model Swapping


LM Studio ▷ #general (49 messages🔥):

LM Studio Hugging Face Proxy, Speculative Decoding, NPU Support, Gemini 3 Pro


LM Studio ▷ #hardware-discussion (74 messages🔥🔥):

Dual Channel RAM, Cost Effective LLM Workstation, 4000 Blackwell GPU, Tempered Glass Cases


Latent Space ▷ #ai-general-chat (48 messages🔥):

X-Ware for Inference Benchmarking, Character.ai's Squinch Optimization, DeepWiki Utility in OSS, Amazon's Rufus Chatbot, Nvidia's Groq Acquisition


Latent Space ▷ #genmedia-creative-ai (4 messages):

FlashSR, Audio Enhancement Model, MiraTTS, Hugging Face, GitHub


Eleuther ▷ #research (36 messages🔥):

Partial RoPE Ablations, Long Context Scaling with Qwen3-Next, Attention Normalization, RoPE for Interp, RMSNorm after Attention


Eleuther ▷ #interpretability-general (1 messages):

SAE, open-source repositories, fine-tuning the trained SAE


Moonshot AI (Kimi K-2) ▷ #general-chat (30 messages🔥):

Gemini Hallucinations, Benchmark Accuracy, M2.1 vs GLM-4.7, Qwen and RAG, Dynamic Semantic Search


GPU MODE ▷ #general (6 messages):

Quantization, Lecture 7 slides


GPU MODE ▷ #cuda (1 messages):

TMA Transpose, Swizzle Optimization


GPU MODE ▷ #youtube-recordings (2 messages):

Lecture Slides, Lecture 7, Quantization


GPU MODE ▷ #rocm (2 messages):

AMDGPU custom builtins, LLVM dev resources


GPU MODE ▷ #self-promotion (1 messages):

2kian: Life-Timeline Forecaster


GPU MODE ▷ #submissions (8 messages🔥):

nvfp4_dual_gemm NVIDIA leaderboard submissions, NVIDIA performance improvements


GPU MODE ▷ #teenygrad (2 messages):

Tinygrad Eager Mode, Handwritten CUDA Kernels, TF1 vs TF2, IR.pyodide, Rust mdbook


GPU MODE ▷ #career-advice (2 messages):

cute swizzling, tcgen PTX, open source for skilling up


tinygrad (George Hotz) ▷ #learn-tinygrad (6 messages):

autogen.py, contiguous error, symbolic tensor


Manus.im Discord ▷ #general (6 messages):

Manus Pro credits, Mobile app preview issue, New channel for collaboration/services


DSPy ▷ #general (5 messages):

Agentic context engineering, LLM autodiff/textgrad, Prompt optimization, New member introduction


Modular (Mojo 🔥) ▷ #general (2 messages):

Discord Channel Suggestions, Kapa AI User Experience


Modular (Mojo 🔥) ▷ #mojo (2 messages):

database-PL optimizations, LingoDB, query optimizer


Yannick Kilcher ▷ #general (3 messages):

Christmas Greetings


Windsurf ▷ #announcements (1 messages):

Windsurf Wave 13, SWE-1.5 Free, Git Worktree Support, Multi-Cascade Panes & Tabs, Dedicated Terminal