Frozen AI News archive

MCP -> Agentic AI Foundation, Mistral Devstral 2

**OpenAI Engineering** sees a significant collaborative milestone with the launch of the **Agentic AI Foundation** under the Linux Foundation, uniting projects from **Anthropic**, **OpenAI**, and **Block**. **Mistral** released **Devstral 2**, a coding model with **123B parameters** and open weights, offering a cost-effective alternative to **Sonnet 4.3** and competitive performance against **DeepSeek v3.2**. The new **Mistral Vibe CLI** supports agentic coding workflows with rapid ecosystem integration. **Alibaba** introduced **Soft Adaptive Policy Optimization (SAPO)** for reinforcement learning tuning, improving stability and performance in **Qwen3-VL** across multiple tasks. Research highlights include the importance of data decontamination in RL and ongoing discussions on MoE RL stability and reward hacking mitigation.

Canonical issue URL

A good day for open AI Engineering.

AI News for 12/8/2025-12/9/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (205 channels, and 7780 messages) for you. Estimated reading time saved (at 200wpm): 644 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

A rare cross company move with the establishment of the Agentic AI Foundation under the Linux Foundation, with Anthropic's MCP joining OpenAI's Agents.md and Block's Goose as founding projects.

Agentic AI Foundation website with circular images representing different technological and professional scenarios, showcasing the foundation's mission of advancing AI collaboration.

Mistral continues to make minor waves with Devstral 2 it's new coding model that is "Sonnet 4.3 level" but 10x cheaper by API and open weights, winning or tying with DeepSeek v3.2 71% of the time in third party human evals.

A graph comparing AI model performance and size, highlighting Devstral 2 as a top performer in the SWE-bench Verifie

The new Mistral Vibe CLI is a pleasure to try out, even if not SOTA.


AI Twitter Recap

Mistral’s Devstral 2 release and the “agentic coding” toolchain

RL for LLMs: stability, decontamination, and process rewards

Agent protocols and frameworks: MCP to Linux Foundation; AWS Strands; LangChain

Benchmarks and evaluation hygiene

Notable model releases (vision, TTS, reasoning)

Infra and performance: training/serving improvements

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Mistral AI Tools Announcement

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Anthropic's Model Context Protocol Donation

2. AI Upscaling and Image Processing

3. AI Perception and Public Awareness


AI Discord Recap

A summary of Summaries of Summaries by gpt-5.1

1. New High-Performance & Specialist Models

2. Agentic Ecosystem & MCP / IDE Tooling

3. Quantum, Neuromorphic & Energy-Constrained Directions

4. Infra, GPUs, and Torch-Level Performance Hacks

5. Evaluation, Prompt/Context Engineering, and Search Tooling


Discord: High level Discord summaries

LMArena Discord


Cursor Community Discord


BASI Jailbreaking Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


LM Studio Discord


OpenRouter Discord


Eleuther Discord


Latent Space Discord


Nous Research AI Discord


GPU MODE Discord


HuggingFace Discord


Yannick Kilcher Discord


Modular (Mojo 🔥) Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


MCP Contributors (Official) Discord


aider (Paul Gauthier) Discord


MLOps @Chipro Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1200 messages🔥🔥🔥):

GPT-5.2 release, Nano Banana Pro vs Gemini 3 Pro, Hazel Image Models, Grok models, AI video scams


LMArena ▷ #announcements (1 messages):

Text Arena Leaderboard, ERNIE-5.0-Preview-1103


Cursor Community ▷ #general (795 messages🔥🔥🔥):

Cursor bugs, Agent terminal output bug, Custom modes, AI and UI design


Cursor Community ▷ #announcements (1 messages):

language-specific threads, sticky message in announcements channel, level roles


BASI Jailbreaking ▷ #general (759 messages🔥🔥🔥):

The Bible as Linguistic and Legal Source, Evolving Religious Systems, Project Looking Glass, AI and Image Generation, jailbreaking Models


BASI Jailbreaking ▷ #jailbreaking (98 messages🔥🔥):

DAN Prompt, Jailbreaking NBP for Face Swapping, Gemini 3 Pro Jailbreak, Image Censorship Bypass in GPT, Pre-Jailbroken Models


BASI Jailbreaking ▷ #redteaming (18 messages🔥):

Gandalf 7, Gandalf 8, Arbitrary Code Execution, Hacking with AI


OpenAI ▷ #ai-discussions (712 messages🔥🔥🔥):

Technical Interview AI, GPT-5.2 vs Gemini 3 Pro, Chinese AI Training, Google AI Studio vs Gemini


OpenAI ▷ #gpt-4-discussions (4 messages):

GPT Model Discussions, OpenAI Discord Channels, Workflow Analysis on GPT 5.2


OpenAI ▷ #prompt-engineering (21 messages🔥):

Prompt Engineering for code analysis and improvement, Stability Index Scoring Rubric, Framework Fidelity and Token Limitations, Sharing Large Frameworks on Platforms, Understanding LLM Behavior and Limitations


OpenAI ▷ #api-discussions (21 messages🔥):

Prompt Engineering Learning, LLM-based Code Analysis, Sharing chats and privacy, Rubric for stability scores, Prompting for code analysis


Unsloth AI (Daniel Han) ▷ #general (488 messages🔥🔥🔥):

Friendship AI, Nvidia chips for China, Github Copilot Agents, 3D Model Dataset, Unsloth Datasets


Unsloth AI (Daniel Han) ▷ #introduce-yourself (1 messages):

Custom Chatbots, Automation Agents, RAG Search Tools, Speech-to-Text Pipelines, Content Automation Tools


Unsloth AI (Daniel Han) ▷ #off-topic (178 messages🔥🔥):

ArtCNN for upscaling, RIFE SOTA status, Llama2.c in Yankovic, Linux Foundation Agentic AI Foundation, Dataset Hell


Unsloth AI (Daniel Han) ▷ #help (10 messages🔥):

Unsloth Usage, Qwen3-VL-30B-A3B-Instruct UD Q5 XL with llama.cpp issues, Qwen3VL Encoding Problems


Unsloth AI (Daniel Han) ▷ #research (5 messages):

Vision Transformers, Deepthink comparison


Perplexity AI ▷ #general (651 messages🔥🔥🔥):

Prompt generators, GPT 5 vs Gemini, Perplexity AI Pro Limits, Comet browser education subscription


Perplexity AI ▷ #pplx-api (3 messages):

Jio Gemini API, Sonar API


LM Studio ▷ #general (194 messages🔥🔥):

Desktop Commander security risks, GLM 4.6v Flash model, AMD GPU for image generation, Training LLMs, Cybersecurity Sidekick


LM Studio ▷ #hardware-discussion (395 messages🔥🔥):

RTX 3060 SLI with 3080, GDDR6 VRAM Temperature, Multi-GPU setup for LLM, Qwen3 30B, ai memory project: mcp-ai-memory


OpenRouter ▷ #general (264 messages🔥🔥):

Exa Search Quality, Parallel.ai Recommendation, Deepseek v3 Formatting Issues, OpenRouter Chatroom Refresh, Mistral's Comeback


OpenRouter ▷ #new-models (4 messages):

``


OpenRouter ▷ #discussion (12 messages🔥):

AI Studio, Olive Oil Cake, NousResearch CF Patch


Eleuther ▷ #general (122 messages🔥🔥):

Quantum Hardware for Language, H100 Speedrun, RWKV 8 Architecture, Neuromodulatory Control Networks (NCN), AI Slop Definition


Eleuther ▷ #research (45 messages🔥):

Quantum Machine Learning Simulators, Real Quantum Hardware Training for Language, RNN + Transformer Hash-Hop, 4096 Token Context Window Training, Selective Gradient Masking


Eleuther ▷ #interpretability-general (2 messages):

Task Optimized KV Caches, Task Optimized LoRAs, Mechanistic Interpretability of Diffusion Models


Eleuther ▷ #lm-thunderdome (3 messages):

Anthropic fix, lm-evaluation-harness


Latent Space ▷ #ai-general-chat (77 messages🔥🔥):

Unconventional AI, ManusAI Context-Engineering, Howie Xu AI talent, BERTopic & Hugging Face Nightly Metadata Tensor Leak, OpenAI Television Ads


Latent Space ▷ #genmedia-creative-ai (48 messages🔥):

Contact-Sheet Prompting, ModelScope Bias, RoR vs Node.js, Fake screenshot warning


Nous Research AI ▷ #announcements (1 messages):

Nomos 1, Open Source, AI Mathematician


Nous Research AI ▷ #general (111 messages🔥🔥):

Terminals SDK & Gemini integration, SMC steering for model degradation, Video game 3D map generation with AI, Multithreaded Sam3 performance, AI-generated website analysis reports


Nous Research AI ▷ #ask-about-llms (10 messages🔥):

Web Chat Tools, Hermes 4.3 Model, SillyTavern, KoboldCPP UI, Token Generation Speed


GPU MODE ▷ #general (4 messages):

ML Infra, ML Systems, Career Advice


GPU MODE ▷ #cuda (7 messages):

Beginner docs, Livestream coding, PTX doc feedback, Kernel challenge timing


GPU MODE ▷ #torch (2 messages):

torch.compile with slicing ops, static KV cache pre-allocation, KV cache updates


GPU MODE ▷ #cool-links (1 messages):

mobicham: https://www.radixark.ai from SGLang folks 👀


GPU MODE ▷ #jobs (1 messages):

crankshot1698: Shot you a dm!


GPU MODE ▷ #beginner (1 messages):

serverlessLLM, blitzscale, inference serving


GPU MODE ▷ #pmpp-book (4 messages):

A100 FLOPS calculation, TF32 MMA, FP16 MMA, Elementwise ops


GPU MODE ▷ #off-topic (1 messages):

jaefosho: Very Eastern European


GPU MODE ▷ #irl-meetup (1 messages):

szymonoz: I'll be in SF this week, anyone from the Bay up for a meetup?


GPU MODE ▷ #liger-kernel (3 messages):

ML Infra, Marksaroufim, How does one break into ML infra space?


GPU MODE ▷ #self-promotion (6 messages):

Registers Best Practices, Mojo, Apple Silicon, Metal IR


GPU MODE ▷ #submissions (21 messages🔥):

nvfp4_gemm benchmark, vectorsum_v2 benchmark, NVIDIA performance, A100, B200, H100, L4 performance


GPU MODE ▷ #factorio-learning-env (2 messages):

Factorio video, AI dev vs AI agent


GPU MODE ▷ #cutlass (2 messages):

Cutlass GEMM Tutorial, Tensor Layout


GPU MODE ▷ #helion (1 messages):

Helion webinar, PTC launch, Helion kernels


GPU MODE ▷ #nvidia-competition (16 messages🔥):

GEMM Competition, 50 Series NVFP4 Support, B200 GPU Performance, Mojo Availability


GPU MODE ▷ #robotics-vla (7 messages):

Variable Textures, Inference Speedups, Hanging Mug Successes, Interesting Choices


HuggingFace ▷ #general (30 messages🔥):

SmolVLA paper SO100 benchmark, HF Billing issues, Image/Video Generation models, Apple Clara 7B Instruct, AI FYP Guidance


HuggingFace ▷ #i-made-this (8 messages🔥):

Optuna HPO skill, Chronos-1.5B, Quantum-Classical Hybrid Language Model, Quantum circuit parameters trained on IBM quantum hardware


HuggingFace ▷ #agents-course (10 messages🔥):

AI Agent Workshop, Hugging Face Providers Outdated, LlamaIndex Issues, Free LLM Alternatives


Yannick Kilcher ▷ #general (11 messages🔥):

Whisper streaming, MultiTalker Parakeet streaming, AI Engineer seeking collaboration


Yannick Kilcher ▷ #paper-discussion (1 messages):

burnytech: Damn, likely colliding with something else I have


Yannick Kilcher ▷ #ml-news (15 messages🔥):

Claude coding secureness, Endomorphosis links, China chip manufacturing, Arcprize future, H200 on eBay


Modular (Mojo 🔥) ▷ #general (1 messages):

jokellum: <@&1116225504563970138>


Modular (Mojo 🔥) ▷ #announcements (2 messages):

Community Meeting, MMMAudio in Mojo, Shimmer OpenGL experiment, Mojo 1.0 Roadmap, Modular Team Updates


Modular (Mojo 🔥) ▷ #mojo (22 messages🔥):

Mojo memory management, MMMAudio presentation with Faust, ImplicitlyCopyable List, String CoW, Embedded AI development with Mojo


Moonshot AI (Kimi K-2) ▷ #general-chat (19 messages🔥):

Kimi.com website issues, Kimi's search tool, Kimi citation issues


Manus.im Discord ▷ #general (8 messages🔥):

Checkpoint restoration issues, Credits issue, Manus 1.5 critical incidents


MCP Contributors (Official) ▷ #general (4 messages):

Anthropic's donation, Model Context Protocol, Agentic AI Foundation, LF standards


MCP Contributors (Official) ▷ #general-wg (3 messages):

MCP Usage, Private vs Public Ecosystems, Client-ID Metadata Documents (CIDM), MCP Servers, Developer Tools


aider (Paul Gauthier) ▷ #general (2 messages):

Opus with Amazon bedrock and aider


aider (Paul Gauthier) ▷ #questions-and-tips (1 messages):

aider features, aider image support, aider workflow


MLOps @Chipro ▷ #events (2 messages):

AI Agent 0-1 Workshop, AI Engineering Bootcamp, GitHub Social Club NYC