Frozen AI News archive

Anthropic Claude Sonnet 4.5, Claude Code 2.0, new VS Code Extensions

**Anthropic** launched a major update with **Claude Sonnet 4.5**, achieving **77.2% SWE-Bench** verified performance and improvements in finance, law, and STEM. They also released **Claude Code v2** featuring checkpoints, a refreshed terminal, and a native VS Code extension, plus a new mascot **Clawd**. The **Claude API** gained context editing and memory tools, and the **Claude Agent SDK** was introduced. The **Claude.ai** apps now support code execution and file creation, with a **Chrome extension** available for Max users. Additionally, **Imagine with Claude** offers a generative UI research preview. Reception has been positive from developers and third-party evaluators. Meanwhile, **DeepSeek** released **V3.2-Exp** with a new **Sparse Attention** algorithm, significantly reducing long-context costs and cutting API prices by over 50%, while maintaining quality.

Canonical issue URL

Claude is all you need.

AI News for 9/26/2025-9/29/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (196 channels, and 15992 messages) for you. Estimated reading time saved (at 200wpm): 1286 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Special mentions go out to John Schulman's Thinking Machines blogpost on LoRA and OpenAI launching Instant Checkout in ChatGPT and Agentic Commerce Protocol with Stripe and DeepSeek announcing big price cuts for V3.2 with a new Sparse Attention algorithm who will be overlooked because...

Anthropic chose today to drop an entire week's worth of launches on one single day:

Reception has been roundly positive, with folks like Cognition Devin and Sourcegraph Amp adopting as default model and third party evals like Box and SWE-Agent approving.

You can now also check out Mike Krieger's chat on Latent Space about all the big day:


AI Twitter Recap

DeepSeek V3.2-Exp: Sparse Attention, price cuts, and open kernels

Anthropic’s Claude Sonnet 4.5: coding/agent leap and first interpretability audit in a system card

RL for LLMs: GRPO vs PPO vs REINFORCE, and LoRA matches full FT in many settings

Agentic commerce and platform updates

Infra, kernels, and other releases

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. China AI Model Launches: Alibaba Qwen Scaling Roadmap and Tencent Hunyuan Image 3.0

2. Fenghua No.3 GPU API Support and Post-abliteration Uncensored LLM Finetuning

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Anthropic Claude Sonnet 4.5 Launch, Features, and Benchmarks

2. OpenAI/ChatGPT Ads, Forced Model Changes, and Community Backlash

3. Prompt Engineering Frameworks and AI Computer-Use Safety


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. DeepSeek V3.2-Exp: Sparse Attention & Reasoning Controls

2. Claude Sonnet 4.5: Long-Horizon Coding & App Integrations

3. Web‑Enabled Agents & Agentic Commerce

4. GPU Kernels, ROCm, and FP8 Training

5. RL Stability, Monitor‑RAG, and Mechanistic Steering


Discord: High level Discord summaries

LMArena Discord


LM Studio Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


OpenRouter Discord


HuggingFace Discord


Cursor Community Discord


Moonshot AI (Kimi K-2) Discord


Yannick Kilcher Discord


Eleuther Discord


GPU MODE Discord


Nous Research AI Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


MCP Contributors (Official) Discord


Manus.im Discord Discord


DSPy Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


Windsurf Discord


MLOps @Chipro Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (988 messages🔥🔥🔥):

Integral Calculation, Video Arena Evaluation, OpenAI Platform Changes, Model Merging, LMArena Popularity


LMArena ▷ #announcements (3 messages):

claude-sonnet-4-5, deepseek-v3.2-exp


LM Studio ▷ #general (401 messages🔥🔥):

DDR5 RAM Speed Impact, GPT-oss 120b, Model Preferences and Benchmarks, LM Studio and Offline Use, Character Emulation


LM Studio ▷ #hardware-discussion (730 messages🔥🔥🔥):

Blackwell, 4090 pricing, RAM amount, A3B architectures, LLM's limit


Unsloth AI (Daniel Han) ▷ #general (538 messages🔥🔥🔥):

IBM Granite 4, NVIDIA synthetic datasets, Qwen3 Next, OSS 20B fine-tuning on 5090, DeepSeek-V3.2


Unsloth AI (Daniel Han) ▷ #introduce-yourself (9 messages🔥):

New member introductions, AI project development, Finance automation


Unsloth AI (Daniel Han) ▷ #off-topic (101 messages🔥🔥):

Test Loss Spikes, Thinking Model for Coding Questions, Venv Alternatives, GPT-5 Release, GPU Recommendations


Unsloth AI (Daniel Han) ▷ #help (196 messages🔥🔥):

mmproj file for GGUFs, GRPO notebooks reflections, gpt-oss-20b memory issues, torch grouped gemm availability, Fine-tuning dataset format for Q&A


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

AWS Quant Process


Unsloth AI (Daniel Han) ▷ #research (21 messages🔥):

LLM-RL collapse, Tversky Layer, GSPO, data efficiency


OpenAI ▷ #annnouncements (2 messages):

ChatGPT parental controls, Instant Checkout in ChatGPT, Agentic Commerce Protocol, Etsy, Shopify


OpenAI ▷ #ai-discussions (637 messages🔥🔥🔥):

Comet Browser, Seedream Image Models, GPT-5 Coding Prowess, AI Emotional Bonding, 4o Personality Nerf


OpenAI ▷ #gpt-4-discussions (25 messages🔥):

Rerouting issues with OpenAI app, DALL-E brand, AI giving wrong answers, GPT knowing location, Web search tool for images


OpenAI ▷ #prompt-engineering (11 messages🔥):

Translator prompt code block effect, Prompts for AI failure, Model obedience, Automated scientific writing


OpenAI ▷ #api-discussions (11 messages🔥):

AI translation quality, Prompts for incorrect AI answers, Scientific writing automation, Fine tuning settings


OpenRouter ▷ #announcements (4 messages):

DeepSeek V3.2 Exp, DeepSeek Sparse Attention (DSA), Auto Router, Claude Sonnet 4.5, Google AI APIs


OpenRouter ▷ #app-showcase (4 messages):

AI Model Release Tracker, Browser Compatibility Issues


OpenRouter ▷ #general (810 messages🔥🔥🔥):

Grok-4-fast API issues, Rate limit issues, Data retention policies, Gemini models for translation, Model naming conventions


OpenRouter ▷ #new-models (3 messages):

``


OpenRouter ▷ #discussion (31 messages🔥):

Grok-4-Fast Rate Limits, OpenRouter API keys security, XAI Native Web Search Tool, Gemini glitches, Google new logo


HuggingFace ▷ #general (710 messages🔥🔥🔥):

Intel GPU, Qwen models, Fake USDT scams, HuggingFace pro billing issues, LLMs for video games


HuggingFace ▷ #today-im-learning (3 messages):

Linux apps installation, Gaming on Linux, Windows user switches to Linux


HuggingFace ▷ #cool-finds (8 messages🔥):

Liquid AI Collection, SLMs for Robots, Open Source GPT-5, Vintage iPod Classic, Conversational Transformer in Video Game


HuggingFace ▷ #i-made-this (24 messages🔥):

HuggingFace dataset downloads, AI Agents with Metacognition, Crusty PC image generation, mytqdm online progress tracker, Paracord crossbody bag


HuggingFace ▷ #reading-group (2 messages):

Efficient Training Techniques, Challenges in Long Context Training


HuggingFace ▷ #computer-vision (1 messages):

SLAM, monocular camera, Python


HuggingFace ▷ #smol-course (56 messages🔥🔥):

SmolLM3-3B chat template bug, Tool calling with SmolLM3-3B, Role conversion in chat template, Understanding evals in the course, Eval job timeout in section 2


HuggingFace ▷ #agents-course (8 messages🔥):

HF Agents Course, Introductions


Cursor Community ▷ #general (607 messages🔥🔥🔥):

Terminal Commands Hanging, GPTs Agents Training, New Models Discussion, Cursor performance issues


Cursor Community ▷ #background-agents (2 messages):

DevContainers configurations, Background agents and images


Moonshot AI (Kimi K-2) ▷ #general-chat (515 messages🔥🔥🔥):

Kimi K2 Performance, Chinese LLM Frontier, Model Preferences, DeepSeek for Coding, Kimi Base Model


Yannick Kilcher ▷ #general (354 messages🔥🔥):

Transformer Models and Continued Learning, AI Reproducibility and Verifiability, LLMs Training with RL, Human vs. Machine Inductive Bias, Evolutionary Methods for AGI


Yannick Kilcher ▷ #paper-discussion (28 messages🔥):

Sycophancy with AI, LessWrong Post, DeepSeek V3.2, LatentCoT-Horizon GitHub Repo


Yannick Kilcher ▷ #ml-news (4 messages):

Uber App Interception, DeepSeek AI, Anthropic Claude Sonnet 4.5


Eleuther ▷ #general (77 messages🔥🔥):

Bayesian Optimization for Learning Rates, Layer-wise Weight Decay, Yarn paper authorship, Vision Language Action Models (VLAs), Adversarial examples


Eleuther ▷ #research (282 messages🔥🔥):

Information Geometry and DNNs, Quantization, Expert Routing, Lie Groups and Homogeneous Spaces, Mode Connectivity


Eleuther ▷ #scaling-laws (3 messages):

Asymptotic Performance Research, Optimal Granularity Research, Static Router Choice, Grouped Topk, PEER


Eleuther ▷ #interpretability-general (1 messages):

SAEs, steering, dynamic low rank updates, preference optimization, RLHF


Eleuther ▷ #lm-thunderdome (4 messages):

lm-harness, GitHub PR


Eleuther ▷ #gpt-neox-dev (3 messages):

Rotary Percentage Impact, RoPE Speed, VRAM Savings with rotary_pct


GPU MODE ▷ #general (12 messages🔥):

Semi sync training delayed, Code rewrite makes problem tractable, FlashAttention 4


GPU MODE ▷ #triton (4 messages):

High order derivatives in PyTorch, Energy based transformer, Flash attention limitations, jvp_flash_attention, Block based Quant/Dequant Triton implementation


GPU MODE ▷ #cuda (20 messages🔥):

sm_120, tcgen05, Jetson T5000, cudaMallocManaged Overhead, Chips and Cheese


GPU MODE ▷ #torch (1 messages):

Saving weight-tied models, Safetensors, Torch compiled models


GPU MODE ▷ #cool-links (5 messages):

DeepSeek-V3.2-Exp, NVIDIA GPUs, matmul kernels, warp-tiling


GPU MODE ▷ #beginner (8 messages🔥):

CS336 Language Modeling, GPU Optimization Techniques, Practical GPU Programming Resources, CUDA Handbook vs PTX ISA


GPU MODE ▷ #torchao (1 messages):

int4 matmul, tensor cores


GPU MODE ▷ #off-topic (2 messages):

FA4, Clean-room implementation


GPU MODE ▷ #rocm (67 messages🔥🔥):

TheRock Nightlies for ROCm, Framework Desktop for PyTorch Dev, FP8 Conversion in ROCm, HIP Cache Modifiers, fp16 & float conversions in ROCm


GPU MODE ▷ #self-promotion (8 messages🔥):

TPU Top-K speed, CuTe Layouts Categorical Foundations, Make Diffusion Great Again (MDGA), DLM Scaling


GPU MODE ▷ #🍿 (1 messages):

Formal Grammars, Model Capabilities


GPU MODE ▷ #gpu模式 (1 messages):

ML Prerequisites, CUDA basics


GPU MODE ▷ #submissions (37 messages🔥):

MI300x8, A100, amd-all2all, amd-gemm-rs, amd-ag-gemm


GPU MODE ▷ #status (4 messages):

Timeouts on H100, Timeouts on AMD GPUs, All-gather+gemm Problem, rocshmem PR Merged


GPU MODE ▷ #tpu (1 messages):

TPU, Pallas, Hardware Aware Kernel Design, Top-K Sampling


GPU MODE ▷ #factorio-learning-env (13 messages🔥):

Claude plays Factorio, PR #339 Ready, Sonnet 4.5 Released, MCP Server Verification


GPU MODE ▷ #amd-competition (6 messages):

rocshmem, devcloud, mi300x, AMD MORI, all2all HIP design


GPU MODE ▷ #cutlass (16 messages🔥):

TmemAllocator location, CuTe DSL cooperative copy, UMMA meaning, make_layout_tv complex layouts, int4 matmul tensor cores


GPU MODE ▷ #general (2 messages):

Mojo support on Python leaderboards, Mojo interop with Python


GPU MODE ▷ #multi-gpu (1 messages):

NCCL examples released


GPU MODE ▷ #low-bit-training (6 messages):

Quantizing Transformers, Phonetic Binary System, 8-bit LLM code


GPU MODE ▷ #irl-accel-hackathon (1 messages):

Hackathon application status


GPU MODE ▷ #cluster-management (3 messages):

Apptainer, ROCm, Nix


GPU MODE ▷ #llmq (30 messages🔥):

Fully-Sharded FP8 Training, CUDA Optimization, Memory Management


Nous Research AI ▷ #announcements (1 messages):

Psyche Model Training, Internet Bandwidth Training, Trainer Abstraction, HuggingFace, TorchTitan


Nous Research AI ▷ #general (217 messages🔥🔥):

RWKV Benchmarking, Latent Zoning Networks, DeepSeek Sparse Attention, Sonnet 4.5, RL Train and Distill RL Expert Train sets


Nous Research AI ▷ #research-papers (3 messages):

RL Collapse, Training Inference Mismatch, Speed Kills Stability, Azure Real


Nous Research AI ▷ #interesting-links (6 messages):

Vision Models as 'Thinkers', Manifold Muon Optimizer, AGI Discourse, LoRA Deep Dive


Nous Research AI ▷ #research-papers (3 messages):

RL Collapse, Training-Inference Mismatch


Latent Space ▷ #ai-general-chat (192 messages🔥🔥):

Anthropic Code Design, Fake ARR, OpenAI compute scale, Avi's AI-Friend App, AntLingAGI Ring-linear-2.0 LLMs


Latent Space ▷ #ai-announcements (4 messages):

Latent Space Podcast, Amp Code, Sourcegraph, AI Coding Agent


Latent Space ▷ #genmedia-creative-ai (25 messages🔥):

AI "Mind-Drugs", Veed Studio Fabric 1.0 API, Suno DAW, AI Actress Tilly Norward, AI Headshot Prompt


Modular (Mojo 🔥) ▷ #general (14 messages🔥):

GPU Puzzles on MacOS, Metal Toolchain, AMD dev cloud, TensorWave MI355X


Modular (Mojo 🔥) ▷ #mojo (179 messages🔥🔥):

C interop challenges, Mojo's approach to C interop, Variable destruction in Mojo, Lexical scoping in Mojo, Mojo readiness for data science


Modular (Mojo 🔥) ▷ #max (1 messages):

clattner: This is really amazing Gabriel!


MCP Contributors (Official) ▷ #mcp-dev-summit (6 messages):

Agnost AI, MCP Dev Summit, London Meetup, YouTube Live Stream


MCP Contributors (Official) ▷ #general (15 messages🔥):

Anthropic Trademark, ModelContextProtocol licensing, Independent org for MCP


MCP Contributors (Official) ▷ #general-wg (58 messages🔥🔥):

JFrog's TULIP protocol for tool verification, Security implications of MCP servers, Annotations vs verification, ResourceTemplates missing Icons metadata


Manus.im Discord ▷ #general (62 messages🔥🔥):

Unity game, Manus trial, Local project, GitHub integration, Claude Code vs. Manus design


DSPy ▷ #papers (7 messages):

Monitor-based RAG, Eigen-1, Zero-entropy


DSPy ▷ #general (46 messages🔥):

ProgramOfThought vs AlgorithmOfThought, DSPy + Langgraph Integration, Prompt Compiler for MD Files, Caching Aware DSPy Adapter


aider (Paul Gauthier) ▷ #general (13 messages🔥):

GPT-5 vs GPT-4.1, Aider-CE navigator mode, aiderx model, DeepSeek v3.1


aider (Paul Gauthier) ▷ #questions-and-tips (9 messages🔥):

Aider task/todo management, Commit only staged files


tinygrad (George Hotz) ▷ #general (14 messages🔥):

ROCM vs NVIDIA, hashcat performance, tinybox performance, Genoa CPU for hashing, tinygrad meeting 90


Windsurf ▷ #announcements (2 messages):

code-supernova-1-million, Claude Sonnet 4.5, Windsurf credits