Frozen AI News archive

not much happened today

**Cognition** raised **$400M** at a **$10.2B** valuation to advance AI coding agents, with **swyx** joining to support the "Decade of Agents" thesis. **Vercel** launched an OSS "vibe coding platform" using a tuned **GPT-5** agent loop. **Claude Code** emphasizes minimalism in agent loops for reliability. **Kimi K2-0905** achieved 94% on coding evals and improved agentic capabilities with doubled context length. **Alibaba** released **Qwen3-ASR**, a multilingual transcription model with <8% WER. **Meta** introduced Set Block Decoding for 3-5× faster decoding without architectural changes. Innovations in KV cache compression and quantization include **AutoRound**, **QuTLASS v0.1.0**, and **AlgoPerf v0.6**. **Google's Veo 3** video generation API went GA with significant price cuts and vertical video support.

Canonical issue URL

a quiet day

AI News for 9/8/2025-9/9/2025. We checked 12 subreddits, 544 Twitters and 22 Discords (187 channels, and 4104 messages) for you. Estimated reading time saved (at 200wpm): 337 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Apple iPhone event offered some small updates.


AI Twitter Recap

Coding Agents and Tooling Momentum

Model and Inference Advances

Multimodal Generation, Video, and “Vibe Coding”

Agents, Post-Training RL, and Evaluation Practice

Robotics and Embodied AI

Benchmarks, Leaderboards, and Enterprise

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. A3B HF Releases: Qwen3-Next-80B-Instruct & ERNIE-4.5-21B-Thinking

2. Open-Source SOTA Challengers (PyDevMini-1, ROMA Seal-0/FRAMES, Apertus)

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Anthropic Claude Degradation Incident and Churn Discussions

2. Recent Model and Feature Releases (Seedream 4, HunyuanImage-2.1, Claude File Creation, ChatGPT Voice Mode)

3. OpenAI GPT-5 vs 4o Conversation Quality and Community Backlash


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-4

Theme 1. Model Mayhem: Speed, Smarts, and Slip-Ups

Theme 2. Hardware Hustle: GPUs, Offloads, and Homebrew Hacks

Theme 3. Tooling Turmoil: Bugs, Fixes, and Feature Fiascos

Theme 4. Education Explosion: Courses, Newsletters, and Agent Adventures

Theme 5. Business Buzz: Deals, Launches, and Funding Frenzy


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


LM Studio Discord


Cursor Community Discord


OpenRouter Discord


GPU MODE Discord


OpenAI Discord


DSPy Discord


Nous Research AI Discord


HuggingFace Discord


Latent Space Discord


Moonshot AI (Kimi K-2) Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


Manus.im Discord Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1197 messages🔥🔥🔥):

Comet Browser, Gemini 2.5 Heavy, Apple launch, Kimi Model, AI Video Generation limits


Perplexity AI ▷ #sharing (2 messages):

Shareable threads, Apple event summary


Perplexity AI ▷ #pplx-api (1 messages):

lordof_the_flies: <@1357424961249349632>


Unsloth AI (Daniel Han) ▷ #general (484 messages🔥🔥🔥):

RP for LLMs, R-4B Model Evaluation, Hermes Model Series, GPT-4.5 Analysis, Quantization Tradeoffs


Unsloth AI (Daniel Han) ▷ #introduce-yourself (2 messages):

Introduce Yourself Discussions, Discord Channel Greetings


Unsloth AI (Daniel Han) ▷ #off-topic (209 messages🔥🔥):

2.5 Pro vs 2.5 Flash, GPT-5 frankenmerge, Runpod downtime, Whisper Transcription, Digital Nomad Life


Unsloth AI (Daniel Han) ▷ #help (92 messages🔥🔥):

HF Model Upload Issues, Vision Models Supported by Unsloth, Flash Attention Errors, GGUF Conversion


Unsloth AI (Daniel Han) ▷ #showcase (8 messages🔥):

Multilingual Dataset Builder, GPT-5 Performance, OpenAI Overreactions


Unsloth AI (Daniel Han) ▷ #research (16 messages🔥):

RSLoRA vs OLoRA or ABBA, Audio research on vocal clarity, Frequency analysis of voice, OpenMule Marketplace


LMArena ▷ #general (698 messages🔥🔥🔥):

Reasoning content from models, Picture generation overlaps, GPT5-high Recognition, LM Arena subscription and limits, Gemini models for manipulation


LMArena ▷ #announcements (2 messages):

Multi-Turn Image Editing, Video Arena Rate Limit


LM Studio ▷ #general (72 messages🔥🔥):

GPU vanishing issue, LM Studio conversation save location, Discord server outages, Gemma vision support, LM Studio outbound traffic concerns


LM Studio ▷ #hardware-discussion (158 messages🔥🔥):

LM Studio install location, AI Workstation Build, Multi-socket performance, GPU offloading, AMD MI50 setup


Cursor Community ▷ #general (200 messages🔥🔥):

Remote SSH extension broken, Student discount issues, Cursor plan change and refund, Terminal hanging issues, Student status verification


OpenRouter ▷ #app-showcase (3 messages):

Interfaze LLM, Design Arena


OpenRouter ▷ #general (152 messages🔥🔥):

Model hosting on OpenRouter, Gemini 1.5 Flash Access, OpenAI's Response API support, Untraceable usage, Token Drop Issue with Deepseek V3


OpenRouter ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter ▷ #discussion (25 messages🔥):

Qwen ASR Model Integration, TTS and STT Unification, Gemini's Thought Signatures, Nvidia Nemotron Nano 9B V2 Pricing, Agentic Tool Calling Models


GPU MODE ▷ #general (11 messages🔥):

Triton vs New DSLs, Jane Street Hackathon Overhears, Interesting Projects


GPU MODE ▷ #cuda (3 messages):

L1 Cache Loading, Memory Bank Conflicts, Constant Cache vs L1/L2 Cache


GPU MODE ▷ #torch (10 messages🔥):

PyTorch Blas documentation, Dynamic Shape Compilation in PyTorch, PyTorch Conference Discount


GPU MODE ▷ #pmpp-book (2 messages):

ScienceDirect Preface


GPU MODE ▷ #off-topic (2 messages):

Homebrew GPUs, Jeri Ellsworth, Sam Zeloof, Home Microchip Manufacturing


GPU MODE ▷ #irl-meetup (4 messages):

Registration approved emails, Registration awaiting approval


GPU MODE ▷ #rocm (1 messages):

mpi4py Removal, ROCm Setup Feedback


GPU MODE ▷ #self-promotion (2 messages):

CuTeDSL Tensors, Tensor Slicing, r/LocalLlama AMA


GPU MODE ▷ #submissions (31 messages🔥):

MI300x8 submissions, amd-all2all leaderboard, leaderboard submit command, Cluster-Bot help command


GPU MODE ▷ #ppc (1 messages):

verspasian: <#1198358627594023014>


GPU MODE ▷ #factorio-learning-env (59 messages🔥🔥):

Factorio fle evalerrors,open_world scenario compatibility, Docker container command failures, Headless server errors, Desync issues


GPU MODE ▷ #amd-competition (20 messages🔥):

Team Registrations, Leaderboard Time Values, RT11's Performance Edge, MoE Latency, HIPRTC Support in PyTorch


GPU MODE ▷ #singularity-systems (7 messages):

MLSys Education, Karpathy's Zero to Hero, Percy Liang's Language Modeling, Autograd Leaderboard, MiniPT2, MiniCUDA, MiniTriton


GPU MODE ▷ #general (8 messages🔥):

PMPP Benchmarking, GPU Streams, GPU Events, Reference Kernels


GPU MODE ▷ #multi-gpu (6 messages):

FP4 in NCCL, Distributed compute with FP4, Hardware native FP4 vs Software abstraction MXFP4, NCCL FP4 support in 2.28


GPU MODE ▷ #low-bit-training (2 messages):

``


GPU MODE ▷ #jane-street-hackathon (2 messages):

Hackathon Submission, kyolebu


OpenAI ▷ #annnouncements (1 messages):

Advanced Voice Mode, Standard Voice Mode


OpenAI ▷ #ai-discussions (104 messages🔥🔥):

Extracting data from Excel to JSON, OpenAI Job Platform beta group, MCP (Model Context Protocol) in LM Studio, MCP for Enterprise, Google Gemini's deep research and AI existential crisis


OpenAI ▷ #gpt-4-discussions (9 messages🔥):

GPT Freezing, GPT-4.1 Hallucinations, GPT Signing


OpenAI ▷ #prompt-engineering (4 messages):

Role-Based Chatbot System, Response Mode Control, System Prompt Engineering


OpenAI ▷ #api-discussions (4 messages):

Chatbot Response Modes, LLM Summarization, Flask + Supabase Chatbot


DSPy ▷ #show-and-tell (3 messages):

DSPy Weekly Newsletter, AI Agents Play Taboo, LangGraph & DSPy Course


DSPy ▷ #general (82 messages🔥🔥):

Open Source Forum vs Discord, DSPy Usage Tracking, Databricks Fine-Tuning, DSPy Documentation Contributions, Streaming usecase for DSPy with arrays of complex objects


Nous Research AI ▷ #general (84 messages🔥🔥):

Hermes Speed, Discord Outage, Alterego device, Grok model uncensored, llama.cpp Kernels


HuggingFace ▷ #general (46 messages🔥):

Multi-agent systems, Model Learning automation, Moderation using vector DB, Telegram chat analysis, AI image generation workflow


HuggingFace ▷ #i-made-this (4 messages):

Loggenix-MoE-0.3B, SRE/DevOps tasks, Model training costs, NextJS


HuggingFace ▷ #smol-course (13 messages🔥):

Smol Course Registration, Smol Course Updates, Smol Course Duration, Smol Course Content, Smol Course Certificate


HuggingFace ▷ #agents-course (4 messages):

Agents course, Coding exercises, Space template


Latent Space ▷ #ai-general-chat (62 messages🔥🔥):

Anthropic Endorsing SB-53, Claude's Performance, Jake Paul Investing in AI, Mistral Funding, Qwen3-Next


Moonshot AI (Kimi K-2) ▷ #general-chat (60 messages🔥🔥):

EQ Bench accuracy, Kimi's deep reasoning, Model coding tradeoffs, Claude Code & Zai costs, LMArena voting bias


Yannick Kilcher ▷ #general (18 messages🔥):

Adapter Training, Local LLM UIs, DiT Efficiency


Yannick Kilcher ▷ #paper-discussion (1 messages):

``


Yannick Kilcher ▷ #agents (8 messages🔥):

Agent Setups, Pydantic AI


Yannick Kilcher ▷ #ml-news (5 messages):

Private LLMs, ASML Custom Model, Mistral Valuation, X Algorithm


aider (Paul Gauthier) ▷ #general (22 messages🔥):

Aider vs Codex Context Management, LLM prompting Length, AI coding Speed, SWE Bench, Roo/Cline vs Aider


aider (Paul Gauthier) ▷ #questions-and-tips (3 messages):

Gemini Errors, Changing Model API URL


Manus.im Discord ▷ #general (20 messages🔥):

Manus Spam, Manus website errors, Manus Free Credits, Manus Referral Credits


Eleuther ▷ #general (9 messages🔥):

Neel Interview, AI/ML Enthusiasts Introductions


Eleuther ▷ #research (4 messages):

6m Model, arxiv link


Eleuther ▷ #lm-thunderdome (1 messages):

LM Eval Harness Calibration Scores, RL for Calibration, LM Eval Harness PR, Critical Take on Calibration Scores


Modular (Mojo 🔥) ▷ #mojo (3 messages):

explicitcopies, moves, c binder, EmberJson


Modular (Mojo 🔥) ▷ #max (4 messages):

Mojo test suite duration, Custom ops compilation issues