Frozen AI News archive

not much happened today

**Moonshot AI** released the **Kimi K2**, a 1-trillion parameter ultra-sparse Mixture-of-Experts (MoE) model with the **MuonClip** optimizer and a large-scale agentic data pipeline using over **20,000 tools**. Shortly after, **Alibaba** updated its **Qwen3** model with the **Qwen3-235B-A22B** variant, which outperforms Kimi K2 and other top models on benchmarks like **GPQA** and **AIME** despite being 4.25x smaller. Alibaba also released **Qwen3-Coder-480B-A35B**, a MoE model specialized for coding with a 1 million token context window. **Google DeepMind** launched **Gemini 2.5 Flash-Lite**, a faster and more cost-efficient model outperforming previous versions in coding, math, and multimodal tasks. The MoE architecture is becoming mainstream, with models like **Mistral**, **DeepSeek**, and **Kimi K2** leading the trend. In mathematics, an advanced **Gemini** model achieved a gold medal level score at the **International Mathematical Olympiad (IMO)**, marking a first for AI. An **OpenAI** researcher noted their IMO model "knew" when it did not have a correct solution, highlighting advances in model reasoning and self-awareness.

Canonical issue URL

a quiet day

AI News for 7/21/2025-7/22/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (227 channels, and 6134 messages) for you. Estimated reading time saved (at 200wpm): 527 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

the release of Qwen 3 Coder (claiming Sonnet 4 level performance) and Qwen Code (a fork of Gemini Code) almost made title story, but we're going to wait out a little bit to see where the reviews come in.


AI Twitter Recap

Major Model Releases & Benchmarks: Qwen, Kimi, and Gemini

AI in Mathematics: The Race for IMO Gold

AI Infrastructure, Hardware & Efficiency

AI Tooling, Frameworks, and Applications

Research, Company News, and Broader Discourse

Humor/Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3 Coding Model Releases and Benchmarks

2. AI Hardware and Enthusiast Upgrades

3. MegaTTS 3 Voice Cloning and Open-Source AI Tools

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Claude Code User Experience & Optimization Discussion

2. AI Model Benchmarking at the International Mathematical Olympiad

3. Colossus Supercluster Expansion and xAI Training Infrastructure


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1. The Qwen3 Onslaught: A New Titan Enters the Arena

Theme 2. AI Faces Off in High-Stakes Competitions

Theme 3. Infrastructure Under Siege: Downtime, Rate Limits, and Training Woes

Theme 4. Open-Source Advances Beyond the Hype

Theme 5. The Hardware Frontier: Pushing Silicon to the Limit


Discord: High level Discord summaries

Perplexity AI Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


LMArena Discord


Cursor Community Discord


LM Studio Discord


Latent Space Discord


GPU MODE Discord


HuggingFace Discord


Yannick Kilcher Discord


Eleuther Discord


LlamaIndex Discord


aider (Paul Gauthier) Discord


Notebook LM Discord


Nous Research AI Discord


Cohere Discord


Modular (Mojo 🔥) Discord


Torchtune Discord


MCP (Glama) Discord


DSPy Discord


tinygrad (George Hotz) Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Nomic.ai (GPT4All) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1266 messages🔥🔥🔥):

Comet Browser Invites, Perplexity Pro vs Max, Perplexity's Memory Feature, GPT 4.5 Performance, Perplexity as Default Assistant


Perplexity AI ▷ #sharing (4 messages):

Perplexity AI Apps, SETBA, Mahjong, Ozzy Osbourne


Perplexity AI ▷ #pplx-api (2 messages):

Comet Invite, Discord Channel Link


OpenAI ▷ #annnouncements (1 messages):

Stargate, Oracle, Abilene, TX


OpenAI ▷ #ai-discussions (829 messages🔥🔥🔥):

automatic writing, GPT-4 memory, Agent Mode release, DALL-E 3 art styles


OpenAI ▷ #gpt-4-discussions (4 messages):

Feature requests, iPhone app image upload issue, Cache storage


OpenAI ▷ #prompt-engineering (2 messages):

Discord Bot Creation, ChatGPT integration, Role-Playing Bots


OpenAI ▷ #api-discussions (2 messages):

Discord ChatGPT Bot Creation, Server Role-Playing Bots


Unsloth AI (Daniel Han) ▷ #general (742 messages🔥🔥🔥):

Qwen3 total size, resume training issues, Open Empathic, Qwen3 2507 reasoning ability, GPTs Agents


Unsloth AI (Daniel Han) ▷ #introduce-yourself (6 messages):

Minecraft AI Model, Unsloth usage


Unsloth AI (Daniel Han) ▷ #off-topic (19 messages🔥):

Open-weight SSM models, Falcon mamba, RULER by ART/OpenPipe, Emotions Aftermath, Neat life hacks


Unsloth AI (Daniel Han) ▷ #help (47 messages🔥):

Training Checkpoints, Multiple LoRA Adapters, Qwen3 Quants Issues, Falcon-7b Model, Qwen2.5-VL


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Reinforcement Learning Workshop


Unsloth AI (Daniel Han) ▷ #research (4 messages):

Custom Model Outputs, RULER by ART/OpenPipe, Fine-tuning a dataset to fine-tune a model


Unsloth AI (Daniel Han) ▷ #unsloth-bot (54 messages🔥):

Flash Attention with Qwen3, Finetuning Mistral with tool calls, RULER integration with Unsloth, Multimodal training error resolution, Audio fine-tuning error resolution


OpenRouter (Alex Atallah) ▷ #announcements (4 messages):

Intermittent 408 errors, DeepSeek v3 0324 Free Model, Chutes rate limits, Traffic Spikes, OpenRouter credits


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

YourChat.pro, T3.chat, ChatGPT


OpenRouter (Alex Atallah) ▷ #general (723 messages🔥🔥🔥):

OpenRouter Free Tier, DeepSeek v3 Issues, Qwen3 Model Discussions, Chutes Rate Limiting, Model Censorship


OpenRouter (Alex Atallah) ▷ #new-models (4 messages):

``


OpenRouter (Alex Atallah) ▷ #discussion (72 messages🔥🔥):

Window AI browser extension status, Native Search Functionality for Models, OpenRouter's Exa search Implementation, Modular Add-on System, Gemini 2.5 Flash Lite GA


LMArena ▷ #general (672 messages🔥🔥🔥):

IMO math competition, DeepThink release, Qwen3 model, Grok4 Coder


Cursor Community ▷ #general (323 messages🔥🔥):

Freezing Chat Terminal, Gemini Recharge, Kimi K2 Speed, ChatGPT stealing souls, Cursor Pro Billing


Cursor Community ▷ #background-agents (4 messages):

Background Agent Quality, Automating Linear Issues in Slack with Cursor, Conversation Length Error in Background Agent


LM Studio ▷ #general (158 messages🔥🔥):

YaRN with LM Studio + MLX, Deleting unused back-ends, Coding championship AI, Eye Microsurgery AI Application, Nvidia Orin and LM Studio Compatibility


LM Studio ▷ #hardware-discussion (67 messages🔥🔥):

3090 vs 4080, SnapDragon X1 Adreno GPU, 5090 for RP, DeepSeek R1 70B, Gemma for creative writing


Latent Space ▷ #ai-general-chat (106 messages🔥🔥):

Kimi K2 Report, Agent Failure Modes, Humanloop Shutting Down, Cognition Valuation, Turbopuffer Pod


GPU MODE ▷ #general (30 messages🔥):

C++ templated libraries, Kog's inference benchmark on AMD MI300X, AMD MI300X Inference, vLLM optimization on MI300X, Automated bans for image spam


GPU MODE ▷ #cuda (1 messages):

pekaro: lmao, who they are lying to, themselves? sucky move to publish something like this


GPU MODE ▷ #torch (1 messages):

Torch compilation, stride assertions


GPU MODE ▷ #jobs (2 messages):

Kog, AMD MI300X, Inference Speed, French startup


GPU MODE ▷ #beginner (2 messages):

GCP, Google Colab


GPU MODE ▷ #torchao (5 messages):

FP8 Training, DDP Training, FSDP2, Activation Checkpointing


GPU MODE ▷ #off-topic (2 messages):

Neuralink, Brain-computer interfaces


GPU MODE ▷ #rocm (1 messages):

Register Corruption, SGPR Allocation


GPU MODE ▷ #thunderkittens (1 messages):

ThunderKittens, Attention Backward, LCF support


GPU MODE ▷ #factorio-learning-env (30 messages🔥):

System Prompt Length, RCON Errors, Item Display on Belt, Rate Limiting


GPU MODE ▷ #cutlass (2 messages):

Hierarchical Layouts, Tensor Reshaping, MMA Atoms


HuggingFace ▷ #general (58 messages🔥🔥):

Hugging Face Hub Issues, Wandb Alternatives on Kubernetes, Dalle-mini Traffic Issues, Advice for Young ML Enthusiasts, Shell.ai Hackathon 2025 Teams


HuggingFace ▷ #today-im-learning (4 messages):

JAX ML Scaling Book, Medical AI Imaging Future


HuggingFace ▷ #cool-finds (1 messages):

tejasshinde400: https://github.com/jujumilk3/leaked-system-prompts


HuggingFace ▷ #i-made-this (4 messages):

PDF to Dataset Tool, FaceFlux Deepfake Tool


HuggingFace ▷ #NLP (3 messages):

SetFit OOM issues, Jina Embeddings v2 base, CosineSimilarityLoss, ContrastiveDataset, Triplet


Yannick Kilcher ▷ #general (52 messages🔥):

Arxiv endorsement, DeepMind vs OpenAI in AI math, Language for solving math problems, Cross entropy loss


Yannick Kilcher ▷ #paper-discussion (13 messages🔥):

Deepseek V3, MuonClip Algorithm, smolLM3 implementation details, Model Merging


Yannick Kilcher ▷ #ml-news (4 messages):

OpenAI UK deal, Ministry of Silly Walks, AI youtube video


Eleuther ▷ #general (11 messages🔥):

lm-eval-harness, AI Math Olympiad, NAIRR compute, AI safety


Eleuther ▷ #research (8 messages🔥):

Weight Decay in Embeddings, Norm Layers scaling, Kimi k2 Paper, Synchronous training


Eleuther ▷ #scaling-laws (13 messages🔥):

KAN Activation Functions, B-Spline Curves, Training Dynamics Optimization, Expressivity vs Stability, Cell 9 Spline Training


Eleuther ▷ #interpretability-general (3 messages):

Sparse MoE Models, SAEs, Interpretability, PEER follow up


Eleuther ▷ #lm-thunderdome (19 messages🔥):

lm-evaluation-harness, byte latent transformer, facebook/blt-entropy, facebook/blt-1b


Eleuther ▷ #gpt-neox-dev (5 messages):

Amazon infra, SageMaker support, EFA support


LlamaIndex ▷ #blog (4 messages):

A2A Agents Hackathon, LlamaCloud nodes for n8n.io, LlamaParse Feature, Automate PDF parsing


LlamaIndex ▷ #general (51 messages🔥):

Inference Nerf, LlamaParse Authentication, LlamaIndex AWS Bedrock Models, Error Handling in LlamaIndex TS


aider (Paul Gauthier) ▷ #general (17 messages🔥):

Aider as an agent vs tool, Copilot glitch, Aider ignore .gitignore, Qwen3 models vs Aider, Python versions for Aider


aider (Paul Gauthier) ▷ #questions-and-tips (26 messages🔥):

Aider Polyglot examples, Model Looping, Aider summarization calls, Correct Edit Format


Notebook LM ▷ #use-cases (7 messages):

NotebookLM use cases, Obsidian Plugin, Reading books, Chat history retrieval


Notebook LM ▷ #general (30 messages🔥):

Google Ultra AI Subscription Benefits, NotebookLM Service Unavailable Error, PDF Image Reading Capabilities, NotebookLM for American Yawp Notes, Gemini Pro Model Integration


Nous Research AI ▷ #general (28 messages🔥):

Qwen3-235B-A22B, Kimi-K2 Tech Report, RLHF Reward Model, Kimi K2 Style


Cohere ▷ #🧵-general-thread (5 messages):

Computer vision applications, Generative models, Flow matching, VLM fine-tuning, Success Principles


Cohere ▷ #🔌-api-discussions (8 messages🔥):

JSON Schema Regression, Embed v4 Rate Limit


Cohere ▷ #👋-introduce-yourself (6 messages):

VLMs, Generative models, 3D reconstruction, AI platforms


Modular (Mojo 🔥) ▷ #general (16 messages🔥):

Mojo vs C++ Vectorization, Modular Careers, Modular Open Source Contribution, Mojo GPU Programming, Mojo Package Manager


Modular (Mojo 🔥) ▷ #mojo (1 messages):

Mojo async design, async/await ecosystem split


Modular (Mojo 🔥) ▷ #max (1 messages):

Max vs llama.cpp, CPU serving performance


Torchtune ▷ #general (6 messages):

RL Torchtune Release, RL Libraries


Torchtune ▷ #dev (2 messages):

Fine-tuning 70B models, recipe_state.pt size, torch.save behavior, Distributed checkpointing


MCP (Glama) ▷ #general (6 messages):

MCP vs System Tool Call, MCP Tool Purpose, Image Context, New Member Introduction


MCP (Glama) ▷ #showcase (2 messages):

WordPress integration for Claude, MCP server for WordPress


DSPy ▷ #show-and-tell (1 messages):

DSPy, Python user group, Local Tech Meetups


DSPy ▷ #general (5 messages):

Teleprompters, DSPy Modules, Professional Services orgs, AWS Services


tinygrad (George Hotz) ▷ #general (3 messages):

Whisper PR speed, shipping containers for tinyboxes, Rust for Tinygrad


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

CUDA on Windows, CPU backend setup, LLVM backend setup


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Certificate Issues, Writing Submission Form Issue