Frozen AI News archive

Mistral 3: Mistral Large 3 + Ministral 3B/8B/14B open weights models

**Mistral** has launched the **Mistral 3 family** including **Ministral 3** models (3B/8B/14B) and **Mistral Large 3**, a sparse MoE model with **675B total parameters** and **256k context window**, all under an Apache 2.0 open license. Early benchmarks rank Mistral Large 3 at **#6 among open models** with strong coding performance. The launch includes broad ecosystem support such as vLLM, llama.cpp, Ollama, and LM Studio integrations. Meanwhile, **Anthropic** acquired the open-source **Bun** runtime to accelerate **Claude Code**, which reportedly reached a **$1B run-rate in ~6 months**. Anthropic also announced discounted **Claude** plans for nonprofits and shared insights on AI's impact on work internally.

Canonical issue URL

Mistral is back!

AI News for 12/1/2025-12/2/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (205 channels, and 9665 messages) for you. Estimated reading time saved (at 200wpm): 697 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

We last saw Mistral Small 3 in Jan, and 3.1 in March, then the mainline models took a detour with Mistral Code and Magistral and Voxtral. Well, after raising 1.7B at a 11.7B valuation, Mistral Large 3 is here together with 3 sizes of Ministral (blogpost), all open weights Apache 2.0.

Mistral Large 3 performance comparison chart showing benchmark results across multiple AI models and evaluation metrics.

It's unfortunate timing coming right after Deepseek V3.2 (#6 on Open Models and #28 on Text), but still a notable achievement for European AI. as Anj points out, this is on Mistral's old cluster - with the new funding, a 6x larger compute cluster will come online in 2026.


AI Twitter Recap

Mistral 3 family: open, multimodal, and everywhere

Anthropic: Bun acquisition, nonprofit program, and how AI is changing work

Frontier benchmarks, leaks, and competitive positioning

Amazon Nova 2.0 (reasoning, agentic, multimodal) and Nova Sonic 2.0 (speech‑to‑speech)

Agents, toolchains, and safety

Research highlights

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Mistral 3 Model Family Release

2. GPU Rental Market in Mongolia

3. Hugging Face Top Contributors

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI 'Code Red' and New Model Announcements

2. AI Model and Benchmark Releases

3. AI and Internet Challenges


AI Discord Recap

A summary of Summaries of Summaries by Gemini 3.0 Pro Preview Nov-18

Theme 1. Model Releases: Mistral’s MoE Behemoth, Arcee’s Trinity, and Flux Rankings

Theme 2. Kernel Optimization & Hardware: PyTorch Bugs, Race Conditions, and Leaderboards

Theme 3. Developer Tooling: Unstable IDEs, API Errors, and Sub-Agent Dreams

Theme 4. Security & Jailbreaking: Stealth Modes, Soul Documents, and 29KB Seeds

Theme 5. Industry Shifts: "Alert Level Red," 400GB VRAM Rigs, and Funding Wins


Discord: High level Discord summaries

LMArena Discord


BASI Jailbreaking Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


LM Studio Discord


OpenAI Discord


Cursor Community Discord


OpenRouter Discord


GPU MODE Discord


Latent Space Discord


Nous Research AI Discord


Moonshot AI (Kimi K-2) Discord


HuggingFace Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


Manus.im Discord Discord


Yannick Kilcher Discord


DSPy Discord


tinygrad (George Hotz) Discord


aider (Paul Gauthier) Discord


MCP Contributors (Official) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1315 messages🔥🔥🔥):

Coreweave and NVIDIA stock, Chinese AI models, Kling vs Runway, DeepSeek Speciale, Sora release


LMArena ▷ #announcements (2 messages):

Flux-2-pro, Flux-2-flex, KAT-coder-pro-v1, Mistral-Large-3


BASI Jailbreaking ▷ #general (1146 messages🔥🔥🔥):

Christianity contradictions and logic, Ethics vs religion, LLMs and jailbreaks, Gemini 3 Pro prompts


BASI Jailbreaking ▷ #jailbreaking (503 messages🔥🔥🔥):

ASCII art jailbreak prompts, Pliny jailbreak, Gemini jailbreak for scraping Reddit, Claude system prompt, GPT-5.1 jailbreak


BASI Jailbreaking ▷ #redteaming (9 messages🔥):

Ethical Jailbreaking, AI Discoveries by Accident, LLM System of Systems


Unsloth AI (Daniel Han) ▷ #general (353 messages🔥🔥):

Spam bots, Arcee AI Trinity Mini model, 500k context release, ShareGPT format, Deepseek 3.2 models


Unsloth AI (Daniel Han) ▷ #introduce-yourself (2 messages):

Introductions, Channel Guidelines


Unsloth AI (Daniel Han) ▷ #off-topic (533 messages🔥🔥🔥):

Gemini 3 Pro Song Detection, Kagi Search Engine, Transformers dependency, LFM-2 VL Model, Attention Heads Collapsing


Unsloth AI (Daniel Han) ▷ #help (35 messages🔥):

Parquet vs CSV Datasets, ShareGPT System Prompt Location, Tuned Model Support Tools in Ollama, ChatML Format Conversion, GPT-OSS-20B Model Loading


Perplexity AI ▷ #general (870 messages🔥🔥🔥):

Image Generation Limits, Grok 4 vs Gemini 3 for Math, Comet Browser Feedback, Perplexity 'Wrapped' Feature Request, Grok roleplay


Perplexity AI ▷ #pplx-api (1 messages):

mares1317: open sauce 👨‍🍳


LM Studio ▷ #general (485 messages🔥🔥🔥):

Risers and Splitters for GPUs, Qwen on Limited Memory, Linux Transition with AI Assistance, LLM-Managed VENVs, Mistral 3 performance


LM Studio ▷ #hardware-discussion (51 messages🔥):

Powering multiple GPUs, Qwen3-Next-80B-A3B on Mac M4, Dual 3080s vs newer cards, CPU upgrade impact on LLM performance, M4 Macbook Pro for inference


OpenAI ▷ #ai-discussions (391 messages🔥🔥):

Grok for animating photos, ChatGPT iOS shopping research, Physical Limits of Robots, GPT-4o/5.1 Bedside Manners, Hallucination by Design


OpenAI ▷ #prompt-engineering (4 messages):

Anime Opening Generation, Custom Bot Creation, Antigravity AI IDE, GPT-OSS 120B


OpenAI ▷ #api-discussions (4 messages):

AI Anime Opening Template, Custom Bot Tutorial, Antigravity by Google, GPT-OSS 120B Model


Cursor Community ▷ #general (393 messages🔥🔥):

Cursor Pro+ Worth, Model Validation, Cursor Sub Agents Orchestration, Cursor on Auto Mode unlimited, Platform sidebars changed


OpenRouter ▷ #announcements (5 messages):

Arcee Trinity Mini, Deepseek V3.2, Distillable Models, Activity Exports, API Keys with Expiration


OpenRouter ▷ #general (362 messages🔥🔥):

DeepSeek Rate Limiting, Internal Server Errors, Gemini 3 Pro Issues, OpenRouter GDPR compliance, Nano Banana Pro issues


OpenRouter ▷ #new-models (5 messages):

``


OpenRouter ▷ #discussion (8 messages🔥):

Microwave Model, Chatty Frustrations, Model Competition


GPU MODE ▷ #general (2 messages):

Inference Providers Profitability


GPU MODE ▷ #triton-gluon (8 messages🔥):

Triton Profiling, Data Parameter Issue, Version Compatibility


GPU MODE ▷ #cuda (5 messages):

Sequential Consistency, __syncwarp(), Race Conditions, syncthreads vs syncwarp, Memory Model


GPU MODE ▷ #torch (4 messages):

PyTorch 2.9.1, cu128, conv3D, cudnn, PyTorch issue #166643


GPU MODE ▷ #off-topic (2 messages):

Eleuther AI Publishing, MLSys career mentorship programs, ML4Health career mentorship program


GPU MODE ▷ #irl-meetup (2 messages):

Quartet, Arxiv Papers, Meetup Attendees


GPU MODE ▷ #rocm (2 messages):

AMD Max Pro 395, enterprise/ai dc grade GPUs, GPU discounts, ROCm support, AI performance


GPU MODE ▷ #self-promotion (4 messages):

Profiling Pytorch Kernels, nCompass Extension, Warpgbm and PackBoost, Qwen3-Omni-30B-A3B-Instruct


GPU MODE ▷ #reasoning-gym (1 messages):

Reasoning-gym generators, Generative MMLU


GPU MODE ▷ #submissions (67 messages🔥🔥):

NVIDIA leaderboard submissions, nvfp4_gemm leaderboard


GPU MODE ▷ #factorio-learning-env (1 messages):

Speaker Identification, Thumbnail Generation


GPU MODE ▷ #cutlass (5 messages):

GEMM in CUDA, Shared memory access patterns, MMA Layouts


GPU MODE ▷ #teenygrad (3 messages):

GitHub repo teenygrad, organization of teenygrad


GPU MODE ▷ #general (2 messages):

Nvidia Competition, Submission Clarification


GPU MODE ▷ #multi-gpu (1 messages):

pynvshmem, nvshmem4py, typo in documentation


GPU MODE ▷ #low-bit-training (2 messages):

Arxiv Paper, Talk Invitation


GPU MODE ▷ #llmq (1 messages):

Activation Offloading, fp8 Adam, Loss Masking, Pyllmq on PyPi


GPU MODE ▷ #helion (1 messages):

Helion Parallel Reduction, Weight Gradients Computation, HL.reduce Usage


GPU MODE ▷ #nvidia-competition (111 messages🔥🔥):

Nvidia Competition T&C, eval_better_bench.py Overhead, Python loop queuing kernel calls, Inconsistency with Runners, GPU mode Terminal User Interface


GPU MODE ▷ #robotics-vla (6 messages):

RL with Parkinson Symptoms, BEAST Tokenizer, stack_blocks success


Latent Space ▷ #ai-general-chat (203 messages🔥🔥):

Edwin Arbus joins Cursor, Arcee AI Debuts Trinity, Apple AI Power Shift, OpenAI Launches Alignment Research Blog, Jeanne DeWitt Grosser’s 10 AI-GTM lessons


Latent Space ▷ #genmedia-creative-ai (10 messages🔥):

Apple videogen paper, AI-generated Zootopia-style game footage, Gradium $70M Seed


Nous Research AI ▷ #general (92 messages🔥🔥):

Mistral Large 3 Size and Architecture, Mistral Medium Specs and Leaks, Arcee Trinity Models, Claude's Soul Document, DeepSeek V3.2 Performance


Nous Research AI ▷ #ask-about-llms (13 messages🔥):

Image/Video LLMs, GPT-OSS, Hermes Finetune, MLX-LM, Gherkin Scenarios


Moonshot AI (Kimi K-2) ▷ #general-chat (51 messages🔥):

Kimi Black Friday personality, Deepseek V3.2 problems, Kimi Coding API issues, Roo Code issues, Kimi K2 Thinking in app


HuggingFace ▷ #general (21 messages🔥):

Hugging Face Pro payment issues, PPOTrainer with accelerate and bf16 errors, Tokenizer type is bool after model name change, DPO as RL technique, ACE framework for agents learning from mistakes


HuggingFace ▷ #i-made-this (2 messages):

FFMPEG radio station, Open source AI music models


HuggingFace ▷ #computer-vision (1 messages):

Computer Vision API library, Robotics and automation models, Developer-facing API feedback


HuggingFace ▷ #smol-course (15 messages🔥):

Course Unit Updates, Model Evaluation, Unit Certifications, Final Project Clarification


HuggingFace ▷ #agents-course (3 messages):

AI Agents Course, Synthetic Data Unit, Order Following in AI Systems


Modular (Mojo 🔥) ▷ #mojo (35 messages🔥):

def keyword status, var keyword status, parallelize function safety, MutOrigin.external vs MutAnyOrigin for ffi


Eleuther ▷ #general (10 messages🔥):

NUS PhD intro, AI + Web3 developer introduction, Getting Help Reading Research Papers, fast.ai as a beginner course


Eleuther ▷ #research (3 messages):

Perplexity measurement, MMLU benchmark, topic datasets


Eleuther ▷ #scaling-laws (10 messages🔥):

Scaling Laws, Pretraining Dynamics, Decorrelated Performance, Nonlinear Metrics


Manus.im Discord ▷ #general (16 messages🔥):

Manus Auth issues, Manus instability, Chat Mode adjustment, Gemini 3 Pro, AI-powered automation


Yannick Kilcher ▷ #general (8 messages🔥):

Intern Recommendations, Learning Algorithms, Synthetic Data, Pug Resource, Docker and Kubernetes basics


Yannick Kilcher ▷ #paper-discussion (3 messages):

Kattention Module, TopKHot Autograd Function, HardTopKHotBCE Autograd Function


Yannick Kilcher ▷ #ml-news (3 messages):

Mistral 3, Llama finetunes, wavefunction


DSPy ▷ #show-and-tell (1 messages):

justanotheratom: https://www.elicited.blog/posts/managing-tools-in-dspy


DSPy ▷ #general (4 messages):

Prompt Injection Defenses in DSPy, Security Measures, Training Dataset for Attack Mitigation, Partnership Proposal


tinygrad (George Hotz) ▷ #general (2 messages):

Kernel Development Tools, Regression Test for Beam