Frozen AI News archive

not much happened today

**Alibaba** announced the release of **Qwen3-Coder-480B-A35B-Instruct**, an open agentic code model with **480B** parameters and **256K** context length, praised for rapid development and strong coding performance. Benchmark claims of **41.8% on ARC-AGI-1** faced skepticism from **Fran\0ois Chollet** and others due to reproducibility issues. The model quickly integrated into ecosystems like **vLLM**, **Dynamic GGUFs**, and **OpenRouterAI**. The **White House** unveiled a new **AI Action Plan** emphasizing **Innovation**, **Infrastructure**, and **International Diplomacy**, linking AI leadership to national security and prioritizing compute access for the **Department of Defense**. The plan sparked debate on open vs. closed-source AI, with calls from **Clement Delangue** to embrace open science to maintain US AI competitiveness.

Canonical issue URL

a quiet day

AI News for 7/22/2025-7/23/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (227 channels, and 9736 messages) for you. Estimated reading time saved (at 200wpm): 748 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

The White House announced their AI Action Plan, but we'll keep this newsletter technical. As commented yesterday, QwenCoder has had a largely positive reception but not hugely so that we'd make it a title story.


AI Twitter Recap

New Model Release: Qwen3-Coder

US AI Policy and Geopolitics

Model Updates, Research, and Techniques

AI Tooling, Frameworks, and Infrastructure

Companies, Ecosystem, and Broader Implications

Humor/Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3 and Qwen3-Coder Release Performance, Benchmarks, and User Experiences

2. Agentic Coding Model Face-offs: Kimi K2 vs Claude Sonnet 4

3. Governmental and Industry Initiatives for Open-Source AI and LLM Architectures

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Notable New Model, Agent, and Benchmark Launches (July 2025)

2. Anthropic's Discovery of Trait Transmission and Hidden Signals in Language Models

3. Impact of AI on Employment, Global Policy, and Societal Change


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Cutting-Edge Models Push Coding Boundaries

Theme 2. AI Agents: From Promises to Production Pains

Theme 3. LLM Practicality and User Experience Woes

Theme 4. Infrastructure & Optimization for AI Performance

Theme 5. Advancing AI Through Data & Interpretability


Discord: High level Discord summaries

Perplexity AI Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


OpenRouter (Alex Atallah) Discord


Cursor Community Discord


Latent Space Discord


Eleuther Discord


Nous Research AI Discord


HuggingFace Discord


GPU MODE Discord


aider (Paul Gauthier) Discord


Notebook LM Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


LlamaIndex Discord


Manus.im Discord Discord


DSPy Discord


MLOps @Chipro Discord


Cohere Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


Codeium (Windsurf) Discord


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1216 messages🔥🔥🔥):

Perplexity Pro cost and value, Grok model identification, Using code editors with perplexity, Linus Tech Tips, LLMs and overthinking


Perplexity AI ▷ #sharing (4 messages):

Shareable Threads, Replit news, Cast Studies


OpenAI ▷ #annnouncements (1 messages):

ChatGPT agent rollout, EEA and Switzerland


OpenAI ▷ #ai-discussions (942 messages🔥🔥🔥):

ChatGPT vs other models speed, Models and creative writing, O3 Ultra, Arc AGI, RL irl


OpenAI ▷ #gpt-4-discussions (18 messages🔥):

GPT-4o Delay Issues, Most Popular MCPs, Personal Website Creation, ChatGPT model for reminders


OpenAI ▷ #prompt-engineering (1 messages):

Custom Instruct Modifications, Dialog Continuation Strategies


OpenAI ▷ #api-discussions (1 messages):

Custom Instruct Modification, Controlled English 2.0, Dialog Continuation


Unsloth AI (Daniel Han) ▷ #general (1053 messages🔥🔥🔥):

Qwen3-Coder-480B Model, Hyperbolic Hosting, DGX Station vs. RTX 6000, GANs for Text Augmentation, Unsloth Workshops


Unsloth AI (Daniel Han) ▷ #introduce-yourself (6 messages):

Minecraft AI Model, Open Source Morocco


Unsloth AI (Daniel Han) ▷ #off-topic (98 messages🔥🔥):

Music Haptics hijacking, iOS Apple Music, Vibration recording, Song-humming dataset, Apple Sandbox limitations


Unsloth AI (Daniel Han) ▷ #help (37 messages🔥):

NVMe performance issues with Unsloth, FastAPI deployment best practices, vLLM and SGLang for production inference, Merging LoRA weights back into the base model, Dynamic quantization vs. ik quants


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

RL Workshop


Unsloth AI (Daniel Han) ▷ #research (5 messages):

Fine-tuning Datasets with AI Agents, RULER code and LLM-lite, Thought Anchors for LLM Reasoning Analysis, Qwen3 vs DeepSeek-R1 Cognitive Styles, PTS library for reasoning patterns


Unsloth AI (Daniel Han) ▷ #unsloth-bot (159 messages🔥🔥):

Custom Loss Functions in GRPO Trainer, Unsloth Dynamic Quant 2.0, GRPO Training for Vision Models, SFTTrainer length truncation, Ollama Modelfile Configuration


LMArena ▷ #general (585 messages🔥🔥🔥):

Qwen3-coder's Verilog skills, Qwen3 vs other models, Grok 4 coder, Model Merging, Open Empathic


LMArena ▷ #announcements (1 messages):

Search Arena, Grok 4, Claude Opus 4, Sonar Pro High & Reasoning Pro High, o3


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Qwen3-Coder, SWE-Bench Verified, 480B param Mixture-of-Experts


OpenRouter (Alex Atallah) ▷ #app-showcase (4 messages):

Openrouter, QwEn-3, automation deployment


OpenRouter (Alex Atallah) ▷ #general (534 messages🔥🔥🔥):

Qwen3 Coder, Kimi K2, Gemini Pro/Flash for Coding, Free vs. Paid LLMs, Claude's strange behavior


OpenRouter (Alex Atallah) ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter (Alex Atallah) ▷ #discussion (14 messages🔥):

Qwen Coder, Contextualized Evaluations, Chutes Models, Muting Thread Owners, xAI Colossus 2


Cursor Community ▷ #general (335 messages🔥🔥):

Qwen3-Coder integration, Cursor auto-commit issues, Cursor usage caps, Cursor terminal hanging issues, Gemini 2.5 Pro performance


Cursor Community ▷ #background-agents (4 messages):

Conversation length errors, Secrets debugging, Devcontainer configs for background agents, Background agent infinite loops


Latent Space ▷ #ai-general-chat (165 messages🔥🔥):

Agentic Benchmarks, Reka Funding, AI Action Plan, Claude Code as general agent, Qwen Benchmarks


Latent Space ▷ #ai-announcements (5 messages):

GEO / AI SEO podcast, nitter.net maintenance, AI Engineering podcast


Eleuther ▷ #general (13 messages🔥):

AlphaProof, International Math Olympiad, Creativity and Open Endedness, LLM behavior, emergent properties, and field-based interaction.


Eleuther ▷ #research (76 messages🔥🔥):

Kimi k2, AI peer pressure, single unit attribution to logits, clockwork RNNs, MoEs


Eleuther ▷ #scaling-laws (6 messages):

Spline Training, Diffusion Latency Reduction


Eleuther ▷ #interpretability-general (3 messages):

Sparse MoE, SAEs, FFN Layer, PEER


Eleuther ▷ #lm-thunderdome (10 messages🔥):

Global MMLU filters, Loglikelihood requests, Multiple Choice Problems


Eleuther ▷ #gpt-neox-dev (5 messages):

Amazon infra support, EFA, NCCL EFA plugin, SageMaker team


Nous Research AI ▷ #announcements (1 messages):

Psyche/DisTrO office hours


Nous Research AI ▷ #general (85 messages🔥🔥):

Open Source Agentic Platform: n8n, Deepseek API Issue, Kimi K2 vs DeepSeek R1, Nous Research Funding, Qwen Models


Nous Research AI ▷ #ask-about-llms (1 messages):

Hermes benchmarks, Text LLMs


Nous Research AI ▷ #interesting-links (1 messages):

terrachad_0x: https://x.com/ZeyuanAllenZhu/status/1918684257058197922?t=Z_vhpqsVx39pX4xkU07H2Q&s=19


HuggingFace ▷ #general (60 messages🔥🔥):

HF Spaces API issues, Account Lockouts on HF, Qwen Model Training Errors, Langchain with local LLMs, LLM Dataset Creation


HuggingFace ▷ #today-im-learning (1 messages):

Medical AI Imaging Future, Ethical use of AI in medicine


HuggingFace ▷ #cool-finds (1 messages):

Flux.1 Kontext Model, Watermark Removal


HuggingFace ▷ #i-made-this (5 messages):

LLM Reasoning Styles, Thought Anchors Technique, PTS Library, Image Models to Generate Text


HuggingFace ▷ #NLP (1 messages):

Local Vector DBs, ChromaDB


HuggingFace ▷ #agents-course (6 messages):

Gemini alternatives, Course start date, Skipping Agents Course sections


GPU MODE ▷ #general (2 messages):

Ginkgo SpMV kernel, Ginkgo framework


GPU MODE ▷ #triton (1 messages):

marksaroufim: https://github.com/compiler-explorer/compiler-explorer/pull/7919


GPU MODE ▷ #cuda (6 messages):

NCCL Performance at Scale, All-reduce Degradation, All-Gather Degradation, All-to-All Performance, Communication Imbalance


GPU MODE ▷ #torch (1 messages):

PyTorch 2.7, float8_e8m0fnu edge case, torch.compile, Custom Operators, Stride Matching


GPU MODE ▷ #jobs (1 messages):

AMD Hiring, GPU experience, Kernel development, Distributed inference, vLLM/Sglang


GPU MODE ▷ #beginner (11 messages🔥):

Saving and Loading Model Weights, Python Pickle Security Risks, GPU Cloud Storage Options, torch.save vs joblib.dump vs safetensors.save_file


GPU MODE ▷ #torchao (22 messages🔥):

FP8 Training in Axolotl, DDP Issues with torch.compile and FP8, FSDP2 Performance with Activation Checkpointing, Activation Checkpointing Optimization for Float8


GPU MODE ▷ #webgpu (1 messages):

AMD Developer Cloud, MCP Servers, Agentic RAG, Gemini CLI


GPU MODE ▷ #factorio-learning-env (14 messages🔥):

Belts show their content, Status overlays implemented, Factorio renderer performance, Agent Trajectory Length Clarification, Value Accrual Time


GPU MODE ▷ #cutlass (2 messages):

CUTLASS index mapping, tv_layout thread mapping, Hierarchical Layout Benefits


aider (Paul Gauthier) ▷ #general (33 messages🔥):

Ubuntu 20.04 deprecation, Open weights models lagging in Aider Polyglot, Qwen3 Coder, sglang setup, Claude Code (CC) usage


aider (Paul Gauthier) ▷ #questions-and-tips (15 messages🔥):

Aider file patching method, Gemini 2.5 Pro issues, Gemini Pro free tier, Aider system prompt


Notebook LM ▷ #use-cases (19 messages🔥):

Psychology differences between NotebookLM and other LLMs, NotebookLM PRO Settings, Deepseek API vs NotebookLM, Knowledge architecture using NotebookLM, Source ID


Notebook LM ▷ #general (24 messages🔥):

Podcast Length Issues, Chat History Saving Issues, Notebook Sharing Issues, Custom Audio Overview Issues, PDF Upload Issues


Modular (Mojo 🔥) ▷ #general (26 messages🔥):

Windows Support for Mojo, PowerPC resurrection, Mojo compiler status, GPU programming focus


Modular (Mojo 🔥) ▷ #max (14 messages🔥):

Max vs llama.cpp, vLLM vs Max Benchmarking, KV Cache Preemption, Device Memory Utilization, Prefix Cache


MCP (Glama) ▷ #general (24 messages🔥):

Agent tech stack, Session management, MCP security solutions, Immature SDKs, Claude desktop env vars


MCP (Glama) ▷ #showcase (3 messages):

Data and MLE infrastructure at startups, AI Agents with MCP, Scalekit.com, Secure MCP servers, OAuth 2.1 to an MCP server


LlamaIndex ▷ #blog (4 messages):

OCR Alternatives, Multimodal report generation with LlamaIndex, Notebook Llama Document Management, LlamaIndex Workflows State Management


LlamaIndex ▷ #general (6 messages):

Notmuch Integration, LlamaReport Alternatives


Manus.im Discord ▷ #general (10 messages🔥):

AI Foundation School App, Manus Computer Location, Startup App Development


DSPy ▷ #show-and-tell (4 messages):

DSPy presentation, DSPy modules


DSPy ▷ #general (2 messages):

dspy.Module subclass


DSPy ▷ #examples (2 messages):

DSPy Tutorial issues, Hugging Face dataset lib update, Dataset scripts issue


MLOps @Chipro ▷ #events (6 messages):

Data, MLE, and Startups Talk, AI Coding Tools Chat, MCP Builders Summit


MLOps @Chipro ▷ #general-ml (2 messages):

Research Faculty Recommendation System, Azure AI Search alternatives, Hybrid Search, Semantic Ranker Replacement, Explainability and Control in Ranking


Cohere ▷ #🧵-general-thread (2 messages):

Welcome to Cohere


Cohere ▷ #👋-introduce-yourself (2 messages):

AI product development, LLM products, AI Engineering, New technologies for business


Torchtune ▷ #dev (4 messages):

DCP Saving, FSDP+TP


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Trailblazer Tier Certificate, Certificate Declaration Form


tinygrad (George Hotz) ▷ #general (2 messages):

Shipping containers for tinyboxes, Modular cooling benefits, tinycontainer


Nomic.ai (GPT4All) ▷ #general (2 messages):

New member Santhos, Ransomware Hacking on GPT4All


Codeium (Windsurf) ▷ #announcements (1 messages):

Kimi K2, Windsurf, New Model, Pricing