Frozen AI News archive

not much happened today

**Alibaba** unveiled the **Qwen3** model family including **Qwen3-Max** and **Qwen3-VL** with a native 256K context window expandable to 1M, strong OCR in 32 languages, and rapid release velocity (~3.5 releases/month) backed by a $52B infrastructure roadmap. **OpenAI** launched **GPT-5 Codex**, an agent-optimized coding model with up to **400K context** and adaptive reasoning priced at $1.25/$10 per million tokens, integrated into Cline and benchmarked in WebDev arenas. **Meta AI FAIR** released the open-weight **Code World Model (CWM) 32B**, a dense code generation model with strong benchmark scores (e.g., 65.8% SWE-bench Verified, 96.6% Math-500) and public safety reports. Ecosystem updates include GitHub Copilot's new embedding model for faster code search and Anthropic's Claude Sonnet 4 and Opus 4.1 integration into Microsoft 365 Copilot. The vLLM 0.10.2 update introduces Decode Context Parallel (DCP) for improved system performance.

Canonical issue URL

a quiet day

AI News for 9/24/2025-9/25/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (194 channels, and 2885 messages) for you. Estimated reading time saved (at 200wpm): 230 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

You can catch Day 2 of AIE Paris here, where tickets for AIE Europe 2026 were announced. You should also apply for Wave 2 of AIE CODE in NYC in November — it'll be a big one.


AI Twitter Recap

Alibaba’s Qwen3 push: Max, VL, Coder and a $52B roadmap

Coding models and agents: GPT-5 Codex lands; Meta’s 32B CWM

Systems and infra: vLLM DCP, multimodal data plumbing, and platform moves

Video and multimodal generation: Alibaba Wan2.5, Runway A2D, NVIDIA Lyra, Kling 2.5

Reasoning, RL, and evaluation science

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. MiniModel-200M and DeepSeek-V3.1-Terminus Local Release Benchmarks

2. DIY Local AI Hardware: RTX 3080 20GB Mods and Ryzen AI MAX+ 395

3. LLM Performance Growth Claims and Hype Reactions

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Qwen Image Edit 2509 Release Benchmarks and Workflows

2. AI in Games: Among Us Deception Benchmark and Veo-3 Game Video

3. ChatGPT Photo Editing and AI Cultural Satire Projects


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. MCP Tooling for Agentic Browsers and IDEs

2. Gemini Live and the Model Bake‑Offs

3. GPU Kernels and Consistency: Hopper TMA to PTX Proofs

4. Modular’s Mega Round and Mojo’s Metal Move

5. Prompting, Evaluation, and VLM Studies


Discord: High level Discord summaries

OpenRouter Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


Cursor Community Discord


OpenAI Discord


HuggingFace Discord


GPU MODE Discord


LM Studio Discord


Eleuther Discord


Moonshot AI (Kimi K-2) Discord


Latent Space Discord


MCP Contributors (Official) Discord


Nous Research AI Discord


aider (Paul Gauthier) Discord


Yannick Kilcher Discord


Manus.im Discord Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


DSPy Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

OpenRouter ▷ #announcements (1 messages):

Qwen Pricing Incident, Automatic Refunds, Validation Checks


OpenRouter ▷ #general (709 messages🔥🔥🔥):

Qwen3 VL ratelimits, Deepseek alternatives, Janitor AI vs SillyTavern, OpenRouter API key as proxy, GPT-5 features


OpenRouter ▷ #discussion (78 messages🔥🔥):

Encoder LLMs, Token embeddings, MLP blocks, Residual stream, Attention mechanism


Unsloth AI (Daniel Han) ▷ #general (89 messages🔥🔥):

Off Policy GRPO, Qwen3-VL-235B-A22B-Thinking GGUF, Unsloth'd models and AI safety, P100 for training


Unsloth AI (Daniel Han) ▷ #off-topic (293 messages🔥🔥):

Strix Halo, Evaluation set, Training loss, 5090 GPU, Gemini 2.5 Pro


Unsloth AI (Daniel Han) ▷ #help (39 messages🔥):

Company hardware access for vision project, Fine-tuning model recommendations, Qwen2.5-VL fine-tuning for domain-specific knowledge, Gemma 3N notebook error, Distillation usage


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

ChatGPT Instagram analysis, Competitor Comparison, Reel analysis


Perplexity AI ▷ #general (272 messages🔥🔥):

Comet Browser, GPT-5 testing, Novel Crafter, Perplexity Max, Qwen3 Max


Perplexity AI ▷ #sharing (6 messages):

Portkey AI, Apollo 16, Artemis 2, Carl Sagan, 3i/Atlas


Perplexity AI ▷ #pplx-api (3 messages):

Solution being found


Cursor Community ▷ #general (264 messages🔥🔥):

Gemini vs Sonnet, GPT-5-Codex Bugs, GLM 4.5 on Cursor, Windsurf free models, MCP (Model Control Program) Servers


OpenAI ▷ #ai-discussions (49 messages🔥):

GPT-5 Mini, Kimi AGI, 4o vs GPT-5, Markov Chain, GPT-OSS-20B


OpenAI ▷ #gpt-4-discussions (1 messages):

ChatGPT Agent Mode, ChatGPT Companion Mode, Mode-Locking, Mode-Switching, Tracking KPIs


OpenAI ▷ #prompt-engineering (28 messages🔥):

Chain of Thought (CoT) Prompting, Deep Research for Prompting, Translation Prompting Strategies, Interactive Prompting Infographics


OpenAI ▷ #api-discussions (28 messages🔥):

Chain of Thought Prompting, Model Performance, Prompt Engineering, Interactive Infographic for CoT


HuggingFace ▷ #general (95 messages🔥🔥):

Huggingface cache deletion, MariaDB hackathon, Language learning apps, Qwen model reasoning, LinkedIn content


HuggingFace ▷ #today-im-learning (1 messages):

GPU, monitor, drivers, Linux, Windows


HuggingFace ▷ #cool-finds (2 messages):

trade-bench.live, UIUC students finance work


HuggingFace ▷ #smol-course (6 messages):

OOM Error on 3090, PEFT runs successful locally, SFTTrainer writes fine tuned model


HuggingFace ▷ #agents-course (2 messages):

Global Greetings, Course Kickoff


GPU MODE ▷ #general (2 messages):

Hopper TMA, Minimal Matmul Kernel, CWM paper from FAIR


GPU MODE ▷ #cuda (11 messages🔥):

cuda headers, smem bank conflicts, cudaGraphicsGLRegisterImage and tex2d are undefined, TMA matmul kernel


GPU MODE ▷ #torch (3 messages):

torchrun API, torchrun --help


GPU MODE ▷ #cool-links (10 messages🔥):

CUDA and Triton, PTX Memory Consistency Model, Compound Memory Models, GPU Consistency Analysis, Dat3M Verification Tool


GPU MODE ▷ #beginner (2 messages):

Inter-warp operations, Intra-warp operations, Independent thread scheduling, NVIDIA GPUs, CTA clusters


GPU MODE ▷ #triton-puzzles (1 messages):

Puzzle difficulty, Puzzle completion time


GPU MODE ▷ #self-promotion (1 messages):

LLM serving, Embeddings Pricing, Kernel Profiling


GPU MODE ▷ #🍿 (5 messages):

Code Generation, Two-Stage Approach, Model Performance


GPU MODE ▷ #thunderkittens (5 messages):

H100 matmul kernel runtime error, nvshmem usage in paper 2


GPU MODE ▷ #submissions (17 messages🔥):

MI300x8, amd-all2all leaderboard, amd-gemm-rs leaderboard


GPU MODE ▷ #hardware (4 messages):

Voltage Park H100s Donation, Nebius Exclusive Sponsorship


GPU MODE ▷ #factorio-learning-env (2 messages):

FLE Eval System Prompt, Image Analysis PR


GPU MODE ▷ #amd-competition (4 messages):

GEMM-RS atomic writes optimization with Iris, Iris shared memory initialization, GEMM-RS bias handling


GPU MODE ▷ #cutlass (3 messages):

Refinement hierarchy, TmemAllocator vs cute.arch.alloc_tmem


GPU MODE ▷ #mojo (1 messages):

Mojo Metal GPU target, Custom bitcode writer


GPU MODE ▷ #singularity-systems (6 messages):

Picograd's tensor and engine, Eager and lazy execution policies, Tinygrad's architecture, Graph compiler, Shipping incremental intermediaries


GPU MODE ▷ #cluster-management (2 messages):

Singularity, Apptainer, Sylabs, Linux Foundation


LM Studio ▷ #general (37 messages🔥):

Seed-OSS thinking budget, Conversation.json to markdown, LM Studio Plugins in Linux, Ollama Fine Tuning, LoRA injection into models


LM Studio ▷ #hardware-discussion (13 messages🔥):

Budget GPUs, Tesla K80s


Eleuther ▷ #general (3 messages):

Measuring AI Dialogue Coherence and Novelty, NYC Meetup in Central Park


Eleuther ▷ #research (12 messages🔥):

DeepIgnorence Generalization Difficulty, Mathematical Formalism for Knowledge Completion, CFG on Style Transfer, Data Centric Approaches to ML/AI


Eleuther ▷ #lm-thunderdome (3 messages):

GSM8k Benchmark, flexible-extract, strict-match


Eleuther ▷ #multimodal-general (1 messages):

Benchmarking Prompting Methods in VLMs, Interpretability Studies on VLMs, Ineffectiveness of LLM Prompting Techniques for VLMs, Mech-Interpretability Probing Study for VLMs


Moonshot AI (Kimi K-2) ▷ #general-chat (15 messages🔥):

Kimi doesn't encourage delusions, Mini version of Kimi, Qwen model distilled on K2


Latent Space ▷ #ai-general-chat (11 messages🔥):

Gemini Live Model, Chrome DevTools MCP, AI Coding Agents


MCP Contributors (Official) ▷ #general (1 messages):

glassbeadaleph: i think so, give me one second


MCP Contributors (Official) ▷ #general-wg (9 messages🔥):

Embedded Resources title vs name, Claude Code, ReadResourceResult contents array


Nous Research AI ▷ #general (8 messages🔥):

Anthropic Misuse Report, Cybercrime and AI, AI-fabricated credentials


aider (Paul Gauthier) ▷ #questions-and-tips (6 messages):

Aider's /clear command, Aider access to Internet search


Yannick Kilcher ▷ #general (3 messages):

Saturday evening talks, Reading papers before talks


Yannick Kilcher ▷ #paper-discussion (2 messages):

Hyperparameters for Diffusion Models, ODE Solvers vs DPM++2m, Applications of Fast Inference, Diffusion Efficiency Research


Yannick Kilcher ▷ #ml-news (1 messages):

.neoneye: https://x.com/Alibaba_Qwen/status/1970599323013652705


Manus.im Discord ▷ #general (5 messages):

Manus PDF download issues, Beta Pro Access


Modular (Mojo 🔥) ▷ #general (3 messages):

Modular contributor, Contributing to Mojo


Modular (Mojo 🔥) ▷ #announcements (1 messages):

New Fundraising, Unified compute layer


tinygrad (George Hotz) ▷ #general (3 messages):

clspv build errors, Python bindings for clspv


DSPy ▷ #general (1 messages):

DSPy attachments, UV Tooling