Frozen AI News archive

not much happened today

**NousResearch's Nomos 1** is a 30B open math model achieving a top Putnam score with only ~3B active parameters, enabling consumer Mac inference. **AxiomProver** also posts top Putnam results using ThinkyMachines' RL stack. **Mistral's Devstral 2 Small** outperforms DeepSeek v3.2 in 71% of preferences with better speed and cost. **Anthropic's Claude Code** introduces asynchronous agent execution. **Cursor 2.2** adds deep agent primitives like Debug and Plan Modes. **VS Code** launches unified agent chat sessions improving multi-agent workflows. **LangChain** releases "Polly" for agent observability. The **Stirrup** harness leads OpenAI GDPval benchmarks with Claude Opus 4.5, GPT-5, and Gemini 3 Pro following. Advances in quantization include **vLLM** integrating Intel's AutoRound PTQ for efficient serving. **Unsloth** achieves up to 3× training speedups with new kernels across Llama, Qwen, Mistral, and Gemma models. *"Compositional reasoning + specialized post-training under constrained active params can rival frontier closed models on formal math."*

Canonical issue URL

a calm before the last batch of releases.

AI News for 12/9/2025-12/10/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (205 channels, and 6101 messages) for you. Estimated reading time saved (at 200wpm): 529 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Check out the RL talks from AIE Code.


AI Twitter Recap

Open math and reasoning: small active params + agents hit top-tier performance

Agentic coding systems, orchestration, and evals

Systems, performance, and compute trends

Multimodal, vision/video, and factuality

Autonomy, proactive agents, and AI-native product loops

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Unsloth AI Training Optimization

2. Mistral AI Model Releases

3. Hardware and CLI Innovations

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI Strategic Shift and AGI Pause

2. Claude Modular Rules Update

3. Futuristic Technology and AI Innovations


AI Discord Recap

A summary of Summaries of Summaries by gpt-5.1

1. High-Performance Training, Kernels, and GPU Wizardry

2. New Models, Context Monsters, and Coding Specialists

3. Agentic Ecosystem, MCP, and AI Tooling Stack

4. Security, Evaluation Methodologies, and Interpretability

5. Education, Study Groups, and Long‑Horizon AI Skill‑Building


Discord: High level Discord summaries

Unsloth AI (Daniel Han) Discord


LMArena Discord


Cursor Community Discord


LM Studio Discord


BASI Jailbreaking Discord


OpenAI Discord


Perplexity AI Discord


OpenRouter Discord


Nous Research AI Discord


Latent Space Discord


Eleuther Discord


Moonshot AI (Kimi K-2) Discord


GPU MODE Discord


HuggingFace Discord


Yannick Kilcher Discord


Modular (Mojo 🔥) Discord


Manus.im Discord Discord


MCP Contributors (Official) Discord


Windsurf Discord


DSPy Discord


tinygrad (George Hotz) Discord


aider (Paul Gauthier) Discord


MLOps @Chipro Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Unsloth AI (Daniel Han) ▷ #general (422 messages🔥🔥🔥):

Microwave Model, Dataset Guide Improvements, Deepseek Quant Request, GLM-4.6V-Flash, Qwen3-Next Looping Issues


Unsloth AI (Daniel Han) ▷ #introduce-yourself (1 messages):

AI Engineer, intelligent voice agents, chatbots, GPT-powered assistants, Pipecat


Unsloth AI (Daniel Han) ▷ #off-topic (644 messages🔥🔥🔥):

Agentic AI Foundation, Dataset reordering, HF CEO, Fine-tuning dLLMs, Lyrics in prompt


Unsloth AI (Daniel Han) ▷ #help (10 messages🔥):

Qwen3-VL-30B tool calling issue, Qwen3VL encoding image slice failure, Gemma-3-270m notebook ValueError, LoRA rank effect on final LLM


Unsloth AI (Daniel Han) ▷ #showcase (4 messages):

Unsloth finetuning embeddings, Embedding Model finetuning with Unsloth


Unsloth AI (Daniel Han) ▷ #research (2 messages):

Research Channel, Arxiv


LMArena ▷ #general (854 messages🔥🔥🔥):

Grok 4.2 release, LMArena's Rate Limits, Gemini 3 Flash release, AI Video generation, Huggingface Spaces Hosting


LMArena ▷ #announcements (1 messages):

November Contest, Code Arena Contest, Voting for Contest Winners


Cursor Community ▷ #general (834 messages🔥🔥🔥):

Rules vs Commands differences, Nvidia Open Source Model, Cursor on Linux, Levels in Cursor, Agent Terminal 0 Output Bug


LM Studio ▷ #general (340 messages🔥🔥):

LM Studio 0.3.34 Release, Agentic LLMs and GPU Offload, Cursor IDE limitations with local models, Model orchestration, OpenAI vs local LLMs


LM Studio ▷ #hardware-discussion (407 messages🔥🔥🔥):

PC Building in Smokers Room, PCIe Port Design Flaw, be quiet PSU Coil Whine, MI50 GPU on Windows, GPU on Bottom Slot


BASI Jailbreaking ▷ #general (452 messages🔥🔥🔥):

AI Symbiosis, Grok OSINT Recon, Open Source Multi-Agent Discord Bot, Jailbreak index, Local NSFW Models


BASI Jailbreaking ▷ #jailbreaking (134 messages🔥🔥):

Gemini 3 Pro Jailbreak, Azure OpenAI GPT-4o Jailbreaking, ko2bot.com pre-jailbroken models, UltraBr3aks jailbreak, Arabic Language Models


BASI Jailbreaking ▷ #redteaming (6 messages):

VAPT, Android Application


OpenAI ▷ #annnouncements (1 messages):

Cybersecurity Models, Preparedness Framework, Cyber Resilience


OpenAI ▷ #ai-discussions (512 messages🔥🔥🔥):

Gemini 3 Pro vs ChatGPT, Devstral Model, OpenAI's slow support, 40% Keyboards, Native Apps


OpenAI ▷ #gpt-4-discussions (4 messages):

Sora 2, Pro Plan, Video Generation Limits


OpenAI ▷ #prompt-engineering (15 messages🔥):

ChatGPT vs Gemini file handling, LLM Stability Scores, Reproducible Stability Protocol


OpenAI ▷ #api-discussions (15 messages🔥):

ChatGPT vs Gemini, stability scores, prompt engineering


Perplexity AI ▷ #general (538 messages🔥🔥🔥):

ChatGPT 5.2, Gemini 3 Pro, Perplexity AI R1, OpenAI's Style of Writing, AGI


Perplexity AI ▷ #sharing (1 messages):

Cursor Editor, Competitor endorsement


Perplexity AI ▷ #pplx-api (4 messages):

Perplexity API, Finance features, Financial Modeling Prep (FMP), FMP MCP server


OpenRouter ▷ #general (301 messages🔥🔥):

Deepseek v3.2 rate limited messages, Brother color laser printer, compact home laser printers, Printing Waifus, Miku hologram box


OpenRouter ▷ #discussion (6 messages):

Olive Oil Cake, CF Patch, Anthropic Safety Filtering


Nous Research AI ▷ #announcements (1 messages):

Nomos 1, Open Source Model, AI Mathematician


Nous Research AI ▷ #general (117 messages🔥🔥):

Lexical Wave Function Collapse, Agentic Benchmarks, Putnam AI Performance, Transformer Architecture Limitations, Combining Vision Adapters


Nous Research AI ▷ #ask-about-llms (20 messages🔥):

Hermes 4.3, KoboldCPP, SillyTavern, Nomos Tool Use, Model Performance


Latent Space ▷ #ai-general-chat (56 messages🔥🔥):

Eleven Labs Reader, Linux Foundation size, AI Agent Evaluations, Puppeteer vs Cypress, Latent Space Resources


Latent Space ▷ #genmedia-creative-ai (41 messages🔥):

ModelScope Bias, RoR vs Node.js, Fake Nitter Screenshots


Eleuther ▷ #general (50 messages🔥):

AI Slop, Brandolini's Law, OLMo-1 runs, Pythia eval dataset


Eleuther ▷ #research (42 messages🔥):

Deepseek v3.2, ARC-AGI, Adaptive AI, Thinking Machines Tinker Product


Eleuther ▷ #interpretability-general (1 messages):

Diffusion Models, Synthetic vs Naturalistic Data


Moonshot AI (Kimi K-2) ▷ #general-chat (70 messages🔥🔥):

Mistral Vibe, Devstral model, GLM 4.6, iFlow, Qwen-Code


GPU MODE ▷ #general (8 messages🔥):

Free CUDA sites, Parallel GPU sort disagreements


GPU MODE ▷ #triton-gluon (5 messages):

PTXAS error with sm_103, Triton PTX codegen error, CUDA toolkit 12.9, Triton community meetup


GPU MODE ▷ #torch (1 messages):

LLM Sparsification, Transformers Library, GPU Code Inspection


GPU MODE ▷ #jobs (3 messages):

Performance Engineers Hiring, High Compensation Packages, Silicon Valley Job Market


GPU MODE ▷ #beginner (3 messages):

Inference Serving, NPU Compiler Learning


GPU MODE ▷ #torchao (1 messages):

walrus_23: Made a little documentation update PR: https://github.com/pytorch/ao/pull/3480


GPU MODE ▷ #off-topic (1 messages):

ChatGPT memory, Blog post on ChatGPT's memory system


GPU MODE ▷ #self-promotion (5 messages):

Register Best Practices, Mojo on Apple Silicon, AMD vs. Nvidia, PTX Registers


GPU MODE ▷ #submissions (5 messages):

NVIDIA performance, nvfp4_gemm leaderboard updates, Submission results


GPU MODE ▷ #multi-gpu (2 messages):

NCCL ranks falling out of sync, Troubleshooting NCCL ranks, Collective launch skew analyzer


GPU MODE ▷ #helion (1 messages):

Helion webinar, PTC launch, Helion kernels


GPU MODE ▷ #nvidia-competition (22 messages🔥):

Benchmark performance swings, Mojo kernel from Python submission, Benchmarking cuBLAS on GEMM, torch._scaled_mm and cuBLAS, Discord bot error


GPU MODE ▷ #robotics-vla (3 messages):

llia larchenko X post


HuggingFace ▷ #general (34 messages🔥):

Anthropic donating to Linux Foundation, Arrow vs Parquet file formats, Tool Calling with Open Source LLMs, Unsloth for faster training, Lightweight Vision Transformer models


HuggingFace ▷ #today-im-learning (1 messages):

Token Throughput, Qwen3 Model


HuggingFace ▷ #i-made-this (4 messages):

retrain-pipelines, GOSIM Foundation, AI voice chat, WebGPU, GLM ASR model


HuggingFace ▷ #reading-group (3 messages):

Diffusion Models Study Group, Transformer Architecture Workshop, Diffusion Transformers Workshop


HuggingFace ▷ #agents-course (1 messages):

erdong_43406: Hello everyone.


Yannick Kilcher ▷ #general (17 messages🔥):

SI law, Superintelligence, AI Scams, AI HR, Generative Rehearsal Technique


Yannick Kilcher ▷ #paper-discussion (1 messages):

burnytech: Damn, likely colliding with something else I have


Yannick Kilcher ▷ #ml-news (24 messages🔥):

China rare-earth mineral control, Mistral dense vs MoE models, Mixtral Initialization, Mistral Devstral-2 Vibe CLI, Mistral EU Oligopoly


Modular (Mojo 🔥) ▷ #general (1 messages):

jokellum: <@&1116225504563970138>


Modular (Mojo 🔥) ▷ #mojo (34 messages🔥):

Embedded AI development boards for Mojo, Removing system-installed Mojo with Pixi, Qwen3 model availability, Roadmap for function inspection and manipulation in Mojo metaprogramming, Memory allocation control in Mojo


Manus.im Discord ▷ #general (5 messages):

New app launch, Project Crash, Website creation deal


MCP Contributors (Official) ▷ #general (4 messages):

LF Migration, Governance


Windsurf ▷ #announcements (2 messages):

Windsurf 1.12.41 Release, Windsurf Next Features, Windsurf Login Restored


DSPy ▷ #general (1 messages):

DSPy, OpenAI, GPTs, Adapter


tinygrad (George Hotz) ▷ #general (1 messages):

tinygrad PR 13553, GPU acceleration


aider (Paul Gauthier) ▷ #general (1 messages):

pierrunoyt: hi


MLOps @Chipro ▷ #events (1 messages):

Diffusion Models Study Group, Transformer Architecture Workshop, Diffusion Transformers Workshop