Frozen AI News archive

The Quiet Rise of Claude Code vs Codex

**Claude Code** is gaining mass adoption, inspiring derivative projects like **OpenCode** and **ccusage**, with discussions ongoing in AI communities. **Mistral AI** released **Mistral Small 3.2**, a **24B** parameter model update improving instruction following and function calling, available on **Hugging Face** and supported by **vLLM**. Sebastian Raschka implemented **Qwen3 0.6B** from scratch, noting its deeper architecture and memory efficiency compared to **Llama 3 1B**. **Google DeepMind** showcased **Gemini 2.5 Flash-Lite**'s UI code generation from visual context and added video upload support in the **Gemini App**. **Apple**'s new **3B** parameter on-device foundation model was benchmarked, showing slower speed but efficient memory use via **2-bit quantization**, suitable for background tasks. **Google DeepMind** also released **Magenta Real-time**, an **800M** parameter music generation model licensed under **Apache 2.0**, marking Google's 1000th model on **Hugging Face**. **Kuaishou** launched **KLING 2.1**, a new video model accessible via API.

Canonical issue URL

Claude Code is all you need?

AI News for 6/19/2025-6/20/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 4421 messages) for you. Estimated reading time saved (at 200wpm): 440 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Since there is no single event to point to, we have no real mechanism by which to nominate "quietly rising" stories like the ongoing mass adoption of Claude Code, leading to derivative projects like OpenCode and ccusage being also popular, but it definitely feels like something special is happening here. You can tune in to the AIE or LS Claude Code discussions.

Anj from the newly rebranded (and cluelyed) a16z points out that there is a way to track background coding agent PRs in open source, and its not much of a surprise that OpenAI Codex has something like 91.9% market share, but these numbers don't capture Claude Code's contributions, and Cursor's Background Agents are still prelaunch.


AI Twitter Recap

Model Updates, Releases, and Performance

AI Agent Development & Tooling

Infrastructure, Efficiency, and Developer Tools

Research, Papers, and New Techniques

Industry Commentary & Broader Implications

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

1. Mistral Small 3.2 Model Launch and Community Discussion

2. Repurposing Legacy GPUs for LLM Inference: RX 580 Cluster Project

3. Launch of Google MagentaRT: Real-Time Music Generation Model

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Apollo Research on Model-Aware AI Safety Testing

2. US Army Appointing Tech Executives as Lt. Colonels

3. AI Agent Event Planning — 4 Agents, 23 Humans


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: AI Model Mania: Performance Peaks and Pitfalls

Theme 2: Building the Future: Tools, Training, and GPU Tribulations

Theme 3: Beyond Bytes: Probing AI's Mind and Expanding Its Reach

Theme 4: Open Source Uprising: Community Forges Ahead with Tools and Talent

Theme 5: Access All Areas? Navigating Model Costs, Uptime, and Deprecations


Discord: High level Discord summaries

OpenAI Discord


Perplexity AI Discord


HuggingFace Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


Nous Research AI Discord


LM Studio Discord


Latent Space Discord


Eleuther Discord


GPU MODE Discord


aider (Paul Gauthier) Discord


Manus.im Discord Discord


MCP (Glama) Discord


LlamaIndex Discord


Notebook LM Discord


Torchtune Discord


Cohere Discord


DSPy Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


Codeium (Windsurf) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

OpenAI ▷ #ai-discussions (859 messages🔥🔥🔥):

AI Soul, LLAMA Model Benchmarks, OpenAI Content Filters, GPT-5 Speculation, O3 Pro Performance


OpenAI ▷ #gpt-4-discussions (6 messages):

Phi-5, Banning words from vocabulary, GPT Customization Soft-Ban


OpenAI ▷ #prompt-engineering (1 messages):

Conjecture Dialogue Engine, AI Systems for Opposing Viewpoints, Theoretical Extrapolation


OpenAI ▷ #api-discussions (1 messages):

Conjecture Dialogue Engine, AI system utility, Theoretical extrapolation


Perplexity AI ▷ #general (458 messages🔥🔥🔥):

Rate Limiting on X, Sonnet Reasoning Issues, MIT Study on ChatGPT Use, Grok Nerfed?, Perplexity not responding


Perplexity AI ▷ #sharing (9 messages🔥):

Shareable Threads, MIT ChatGPT study, Belief & Identity threat, Oakley Meta Partnership, Earthquake


Perplexity AI ▷ #pplx-api (3 messages):

sonar-deep-research model, AI Browsing capabilities, search context size, real-time browsing, deep research


HuggingFace ▷ #general (338 messages🔥🔥):

LLM OS, Gemini Diffusion, hf email servers DDOS, SmolVLM on vllm


HuggingFace ▷ #today-im-learning (2 messages):

Qwen2.5-Coder Model, Langgraph Tool Calls, Open-Source Coding LLM, Megatron Parallelism


HuggingFace ▷ #i-made-this (33 messages🔥):

OS-Agent Update, Claude Opus 4 Emergence, VoiceHub TTS Library, Adaptive Classifier, Quantum effects of consciousness


HuggingFace ▷ #reading-group (2 messages):

Micro Batch Size, USPB space


HuggingFace ▷ #core-announcements (1 messages):

disk offloading, low VRAM-RAM scenarios


HuggingFace ▷ #computer-vision (1 messages):

master_andreas: Does Optimum.Intel support object detection tasks?


HuggingFace ▷ #agents-course (3 messages):

Google Colabs in course, Gemini 2.0 Flash, Langgraph START import error


LMArena ▷ #general (336 messages🔥🔥):

Google free storage "hack", GPT4o-mini usage, Minimax vs Veo 3, Gemini Token Usage, Flamesong Model


Unsloth AI (Daniel Han) ▷ #general (211 messages🔥🔥):

Gemma 3 12B distillation, Unsloth on B200, Training with Unsloth issues, Runpod and Unsloth, Accelerate and Unsloth


Unsloth AI (Daniel Han) ▷ #help (55 messages🔥🔥):

Career path into AI, Training QWEN 3, Unsloth Breaking Changes, Distributing Models on Multiple GPUs, LLM model running on Hardware


Unsloth AI (Daniel Han) ▷ #research (1 messages):

codelion_: https://huggingface.co/blog/codelion/adaptive-classifier


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Gemini 2.5 Pro Uptime Boost, Claude Sonnet 4 Uptime Boost, GPT-4.5 Deprecation


OpenRouter (Alex Atallah) ▷ #general (221 messages🔥🔥):

OpenRouter Pricing, Gemini vs GPT, Deepseek Models, Chrome Extensions, MiniMax


Modular (Mojo 🔥) ▷ #general (2 messages):

Mojo vs Python


Modular (Mojo 🔥) ▷ #mojo (188 messages🔥🔥):

helper script for mojo kernel development, dynamic linking issues in QEMU, Standard Library discussion, Mojo benchmark vs python


Yannick Kilcher ▷ #general (119 messages🔥🔥):

Bias in AI training data, Agent Architecture Coherency, Mamba vs RNN, AI NPCs in gaming


Yannick Kilcher ▷ #paper-discussion (17 messages🔥):

Energy Matching, Flow Matching, Energy-Based Models, nano-jepa, nano-gpt


Yannick Kilcher ▷ #ml-news (9 messages🔥):

Illusion of Thinking, Logic Analyzer, Credentials Exposed


Nous Research AI ▷ #general (98 messages🔥🔥):

AI short-circuiting reasoning, Hermes-4, LLaVa-CC3M-595k, Entropy in AI, Quantum Brains


Nous Research AI ▷ #ask-about-llms (7 messages):

Anthropic Models, Claude Code, Opus 4, Sonnet


Nous Research AI ▷ #research-papers (3 messages):

Illusion of Thinking, Fractals


Nous Research AI ▷ #interesting-links (6 messages):

Nous Inference, Models.dev, Vercel's AI SDK, Hermes API, Opencode


Nous Research AI ▷ #research-papers (3 messages):

Illusion of Thinking, Fractal Cosmos


LM Studio ▷ #general (43 messages🔥):

OpenCode setup with LM Studio, Displaying context usage in LM Studio, RyzenAI NPU support in LM Studio, Audio transcription with LM Studio, Faster Whisper


LM Studio ▷ #hardware-discussion (69 messages🔥🔥):

GMKtec EVO-X1 Speed, Q8 vs Q6_K Models, LLM Quantization Explanation, LLM performance measurement, New LLM Models


Latent Space ▷ #ai-general-chat (54 messages🔥):

Model Context Protocol (MCP), OpenAI Codex GitHub Activity, Tersa Open-Source AI Workflow, Mistral Small 3.2 Update, Claude Code Autonomous Improvement


Latent Space ▷ #ai-announcements (16 messages🔥):

Noam Brown Podcast, Windsurf AI, Test-Time Scaling Limitations, Multi-Agent Research, Ilya Sutskever's Views


Eleuther ▷ #general (27 messages🔥):

Contributing to EleutherAI, Interpretability Projects, Open World Labs (OWL), Public Problem List


Eleuther ▷ #research (38 messages🔥):

Illusion of Thinking, Ergonomics tips for LaTeX, AI Social Dynamics, Codebook Training for LLMs


GPU MODE ▷ #general (21 messages🔥):

Domain-Specific LLMs, Gemma 27B Capabilities, Fine-tuning vs. Training from Scratch, Parameter-Efficient Fine-Tuning (PEFT), Large Concept Model


GPU MODE ▷ #cuda (6 messages):

CUDA gdb, Nsight Integration


GPU MODE ▷ #torch (6 messages):

Torch Compiler Thread Safety, FX Tracing and Dynamo Optimization, Module#forward Compilation


GPU MODE ▷ #algorithms (1 messages):

kszysiu2137: Bubble sort


GPU MODE ▷ #cool-links (1 messages):

LLMs, AusysAI blog post


GPU MODE ▷ #jobs (1 messages):

Security Hypervisor Platform Job, KVM/QEMU, Low-Level Systems Performance, Linux Kernel


GPU MODE ▷ #beginner (2 messages):

LLM research project, GPU reduction


GPU MODE ▷ #rocm (1 messages):

ROCm code objects, RadeonGPUAnalyzer


GPU MODE ▷ #submissions (1 messages):

MI300 Leaderboard, AMD MLA Decode Performance


GPU MODE ▷ #factorio-learning-env (15 messages🔥):

ImportError fix, AlphaStar project, Factorio source code access, on_player events in Factorio, Cool paper on Factorio


GPU MODE ▷ #cutlass (1 messages):

edd0302: https://github.com/Dao-AILab/quack

Dao-AILab just release a repo with several example


aider (Paul Gauthier) ▷ #general (39 messages🔥):

Deepseek Free and openrouter, Github Copilot pricing, Llama Models, O3 Pricing, C# Benchmarks


aider (Paul Gauthier) ▷ #questions-and-tips (10 messages🔥):

Aider's prompts, AI code additions, Gemini 2.5 timeout, No code platform ideas


aider (Paul Gauthier) ▷ #links (1 messages):

Prompt Engineering, AI Agent workflow


Manus.im Discord ▷ #general (41 messages🔥):

Finalspark and Koniku biocomputers, Reporting bugs in Manus, GLaDOS dataset and sarcastic Manus, Free AI APIs with high rate limits, Using generated documents as source for new tasks


MCP (Glama) ▷ #general (28 messages🔥):

Endpoint Description Generation, Memvid MCP Server, Dynamic Client Registration, NPM Package MCP, Local MCP Servers


MCP (Glama) ▷ #showcase (6 messages):

ht-mcp open source, Agentic coding tools, MXCP: Build Secure, Fast, MCP Servers from SQL, Deno Template Repo


LlamaIndex ▷ #blog (2 messages):

LlamaIndex Memory Blocks, LlamaCloud MCP hackathon, LlamaExtract, Claude Desktop


LlamaIndex ▷ #general (28 messages🔥):

Gemini Token Counting, LlamaIndex Tokenizer, Multi-Agent Context Management, LLM Class Extensions


Notebook LM ▷ #use-cases (6 messages):

GestaltView Ecosystem, NotebookLM Partnership, Innovation Mental Health


Notebook LM ▷ #general (21 messages🔥):

Site Access Issues, NotebookLM Plans, Running Open Source Models, Removing Failed URLs, Tables for Comparison


Torchtune ▷ #dev (25 messages🔥):

Nvidia Megatron-LM vs NeMO, Manual testing PR's for model definitions, Dataset packing OOM on 64 H100s, Pre-tokenized and packed datasets, on-the-fly packing RFC


Cohere ▷ #🧵-general-thread (7 messages):

Cohere Billing, Training and Serving Models


Cohere ▷ #🔌-api-discussions (4 messages):

Cohere Embed-4, Azure Integration, CohereClientV2 Support, PDF Embedding


Cohere ▷ #👋-introduce-yourself (6 messages):

Multimodal privacy, NLP in Singapore, ML and Cybersecurity, Model Compression


DSPy ▷ #general (6 messages):

Bedrock, Claude models, Nova models, Haiku 3, 4o-mini


tinygrad (George Hotz) ▷ #general (3 messages):

Contributing to tinygrad


Nomic.ai (GPT4All) ▷ #general (3 messages):

AI-powered voice assistant shell script, LLM as a server, Discord account hacked


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Official Brand, New Logo and Wordmark, International Surf Day, Windsurf Community Event