Frozen AI News archive

not much happened today

**GPT-5.2** shows mixed performance in public evaluations, excelling in agentic tasks but at a significantly higher cost (~**$620/run**) compared to **Opus 4.5** and **GPT-5.1**. It performs variably on reasoning and coding benchmarks, with some improvements on long-context tasks. Extended "reasoning effort" settings notably impact results. Aggregators rank **Gemini 3 Pro** above GPT-5.2 in task persistence. **OpenAI** released sparse activation models sparking debate on sparsity vs MoE architectures. **Allen AI**'s **Olmo 3.1 (32B)** advances open reinforcement learning scale with substantial compute investment (~**125k H100 hours**). **Mistral**'s Devstral-2 and **llama.cpp** improve local inference infrastructure with new features like GGUF support and distributed speedups. **Tinker** platform goes GA with vision input and finetuning support for **Qwen3-VL-235B**.

Canonical issue URL

a quiet friday.

AI News for 12/11/2025-12/12/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (205 channels, and 8597 messages) for you. Estimated reading time saved (at 200wpm): 621 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

More AIE Talks rolling out all weekend.


AI Twitter Recap

Frontier model evals: GPT‑5.2 vs Opus 4.5 and Gemini 3, costs, and context settings

Open models, RL scaling, and sparsity

Agent platforms and tooling

New techniques and papers

Product and leaderboard updates

Benchmarks: expectations vs reality

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. NVIDIA Nemotron Model Leak

2. TimeCapsuleLLM Project Update

3. High-Performance Server Builds for LLMs

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. AI Model Benchmarks and Comparisons

2. Z-Image Model Updates and Releases

3. Humanoid Robots and AI in Healthcare


AI Discord Recap

A summary of Summaries of Summaries by gpt-5.1

1. Frontier Model Wars: GPT‑5.2 Versus Opus, Gemini, Kimi & DeepSeek

2. Jailbreaking, Safety Evasion & Red‑Teaming Techniques

3. Local / Open‑Source Model Engineering, Hardware & Performance

4. Infrastructure, Protocols and Observability for LLM Systems

5. Agentic Coding Tools, IDEs and Workflows


Discord: High level Discord summaries

LMArena Discord


BASI Jailbreaking Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


OpenAI Discord


OpenRouter Discord


LM Studio Discord


Nous Research AI Discord


HuggingFace Discord


Yannick Kilcher Discord


Eleuther Discord


GPU MODE Discord


MCP Contributors (Official) Discord


Modular (Mojo 🔥) Discord


Moonshot AI (Kimi K-2) Discord


aider (Paul Gauthier) Discord


DSPy Discord


Manus.im Discord Discord


Windsurf Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1207 messages🔥🔥🔥):

Peppino in AI, Opus vs GPT for Coding, GPT 5.2 Benchmarking Troubles, Gemini 3 vs GPT-5, LMArena Error Reports


LMArena ▷ #announcements (1 messages):

New GLM Models, Text Arena, Vision Arena


BASI Jailbreaking ▷ #general (836 messages🔥🔥🔥):

Hallucinating LSD recipes, OpenAI moderation failing, SUPERAntiSpyware, Financial Physics, jailbreak grok on app


BASI Jailbreaking ▷ #jailbreaking (166 messages🔥🔥):

Gemini 3 Pro Jailbreak, Deepseek Jailbreak, Claude Opus 4.5 Jailbreak, LLM Jailbreak Techniques, Banana Jailbreak


BASI Jailbreaking ▷ #redteaming (9 messages🔥):

``


Cursor Community ▷ #general (1073 messages🔥🔥🔥):

Model Selection for Refactoring, TTS Announcer with Neural Voice, Linking Accounts with Cursor, Context Window Limits with LLMs, Cursor Quota Usage


Cursor Community ▷ #announcements (1 messages):

Debug Mode, Browser Layout, Style Editor, Plan Mode, Multi-agent judging


Unsloth AI (Daniel Han) ▷ #general (332 messages🔥🔥):

Daniel Han Appreciation, Selling 'Dans', Devstral Fixes, RL Model Selection, Unsloth UI Interest


Unsloth AI (Daniel Han) ▷ #introduce-yourself (3 messages):

Newcomers ask Getting Started advice, Newcomer's Project Ideas


Unsloth AI (Daniel Han) ▷ #off-topic (600 messages🔥🔥🔥):

Humanoid Robots and Architecture, Tiiny AI Homelab and PowerInfer, Mouse Recommendations, Keyboard Preferences, Data Validation Agent


Unsloth AI (Daniel Han) ▷ #help (30 messages🔥):

LoRA & GRPO issues, Fine-tuning/RL, Unsloth GRPO patch, Llama-3.1-8B LoRA to GGUF, Fine-tuning advice


Unsloth AI (Daniel Han) ▷ #showcase (27 messages🔥):

Unsloth PR Controversy, XLMRobertaModel support, HF Code Upload


Perplexity AI ▷ #general (955 messages🔥🔥🔥):

GPT 5.2, Comet Browser, Model Performance, Perplexity Spaces, Limits on Pro Plans


Perplexity AI ▷ #sharing (3 messages):

Job opportunity at MorningAI, Shareable threads on Discord


Perplexity AI ▷ #pplx-api (1 messages):

Perplexity API for Finance, Perplexity sec endpoint, REST API finance endpoint


OpenAI ▷ #annnouncements (1 messages):

Video on Twitter, Placeholder Topic 2


OpenAI ▷ #ai-discussions (385 messages🔥🔥):

GPT 5.2, Gemini 3 Pro, Image Generation AI, AI for Coding, Alternate Universe Map Generation


OpenAI ▷ #gpt-4-discussions (30 messages🔥):

GPT 5.2 Rollout, GPT 5.2 Benchmarks, iOS Client Editing Issues, Project Memory


OpenAI ▷ #prompt-engineering (5 messages):

Custom GPT pushback, Safety Features, Prompt interpretation


OpenAI ▷ #api-discussions (5 messages):

Prompt pushback, Custom GPT safety features, Image generation inconsistencies


OpenRouter ▷ #announcements (1 messages):

Traces & Observability, OpenRouter Broadcast, Langfuse, LangSmith, Datadog


OpenRouter ▷ #general (219 messages🔥🔥):

GPT 5.2 performance, OpenRouter Free Credits, OpenRouter Latency, JSON schema adherence


OpenRouter ▷ #discussion (123 messages🔥🔥):

Discord bot thread creation, AI improvement through past issues, RAG implementation, Listing custom models on OR, Zoom's new LLM


LM Studio ▷ #general (189 messages🔥🔥):

GPT-OSS-20B, IA local engines download path, Devstral small 2, Mistral 3 lineup, Qwen 80


LM Studio ▷ #hardware-discussion (47 messages🔥):

GPU for 30GB Model, 7900 XTX Price vs Performance, CUDA Advantages, Server GPU Power Solutions, Model Size for C++ Coding


Nous Research AI ▷ #general (82 messages🔥🔥):

GPT-5.2 Release, Gemini 3.0 and Claude Opus, Oracle's AI play, Nous Research pivoting to RL, Reverse Charge VAT


Nous Research AI ▷ #ask-about-llms (6 messages):

Censorship, Thinking Loop


Nous Research AI ▷ #research-papers (20 messages🔥):

AI Survey, Impressive AI Model, Engineering Blueprints


Nous Research AI ▷ #research-papers (20 messages🔥):

AI Survey, impressive model, Engineering blueprints


HuggingFace ▷ #general (98 messages🔥🔥):

RTX 3090, DGX Spark, Ollama, load_dataset getting stuck, HF Pro Storage


HuggingFace ▷ #i-made-this (3 messages):

Superintelligence relational cognition, tokenflood v.0.6.0, ReasoningLayer AI


HuggingFace ▷ #smol-course (6 messages):

AI and Blockchain Engineer Introduction, Large Scale Training on LAION-5B, AI Engineer Collaboration


HuggingFace ▷ #agents-course (1 messages):

RAG Setup, Context Retrieval, Hallucination Issues in RAG


Yannick Kilcher ▷ #general (19 messages🔥):

CV Spammers, Reinforcement Learning unpopularity


Yannick Kilcher ▷ #paper-discussion (1 messages):

erkinalp: <#1448887055936655441> (co-authored by GPT-5, as mentioned in the abstract) ?


Yannick Kilcher ▷ #ml-news (28 messages🔥):

Deepseek vs OpenAI, GPT Sparsity, Samsung Shifts Focus


Eleuther ▷ #general (1 messages):

Dynamic Concepts via Symbolic Layer, Local LLM setup with Ollama and vLLM


Eleuther ▷ #research (32 messages🔥):

Llama3 architecture, High curvature regions, Error propagation, Diffusion transformer, Classifier-free


Eleuther ▷ #interpretability-general (11 messages🔥):

Interpretability Framework Licensing, Apple's Superweight paper, OLMo-1B Model, Orthogonal Repair, Hydra Effect


Eleuther ▷ #lm-thunderdome (1 messages):

Hugging Face Processor, gemma3-12b


GPU MODE ▷ #triton-gluon (1 messages):

FP8 Speedups, RTX 30-series/Ampere, Feather Library, Triton Kernels, GEMMs Scaling


GPU MODE ▷ #cuda (11 messages🔥):

CUDA Programming Guide, Register Usage, Local Memory Spilling


GPU MODE ▷ #cool-links (1 messages):

NVIDIA Hopper Architecture


GPU MODE ▷ #jobs (1 messages):

Red Hat AI, Software Engineers, Hiring in 2026, Golang, Rust


GPU MODE ▷ #beginner (2 messages):

cuDF and cuML Optimization, GPU Job Market in East Africa, Remote Work Location Restrictions


GPU MODE ▷ #submissions (9 messages🔥):

NVIDIA personal best, NVIDIA successful


GPU MODE ▷ #helion (4 messages):

Random Number Generation, Helion Issues, PR Fix


GPU MODE ▷ #nvidia-competition (8 messages🔥):

Extension Build Errors, Cutlass Path, Submission Timeout


GPU MODE ▷ #robotics-vla (6 messages):

URDF for TRLC-DK1 arm, Bimanual Robot


MCP Contributors (Official) ▷ #mcp-dev-summit (1 messages):

MCP Dev Summit NA 2026, CFP Opening, Talk Submissions


MCP Contributors (Official) ▷ #general (25 messages🔥):

Prompt Data Types, GetPromptResult, MCP Server, Marking Tools as Dangerous


Modular (Mojo 🔥) ▷ #general (18 messages🔥):

Modular Meetup, MAX and Mojo 1.0, Windows support, Community-driven projects, libmojo


Modular (Mojo 🔥) ▷ #announcements (1 messages):

Modular Meetup, MAX framework


Moonshot AI (Kimi K-2) ▷ #general-chat (14 messages🔥):

Claude 4.5 Thinking in Chinese, Kimi AI vs Mistral, Zoom is Frontier Lab Now?, Kimi NB Pro Slides Limited Use, Multiple Accounts Violation


aider (Paul Gauthier) ▷ #general (7 messages):

Aider Prompt Caching, DeepSeek Prompt Caching, Aider Server optimization


DSPy ▷ #show-and-tell (1 messages):

ReasoningLayer AI, Neurosymbolic AI, DSPY GEPA, Ontology Ingestion


DSPy ▷ #general (4 messages):

BAML, DSPy, BAMAdapter


Manus.im Discord ▷ #general (5 messages):

Manus outage, Manus new features since Feb 2023


Windsurf ▷ #announcements (1 messages):

GPT-5.2, Windsurf, Agentic Coding, SOTA coding model