Frozen AI News archive

Anthropic releases Claude 4 Sonnet and Opus: Memory, Agent Capabilities, Claude Code, Redteam Drama

**Anthropic** has officially released **Claude 4** with two variants: **Claude Opus 4**, a high-capability model for complex tasks priced at **$15/$75 per million tokens**, and **Claude Sonnet 4**, optimized for efficient everyday use. The release emphasizes **instruction following** and extended work sessions up to **7 hours**. Community discussions highlight concerns about **token pricing**, **token accounting transparency**, and calls for **open-sourcing Claude 3.5 Sonnet** weights to support local model development. The news also covers **Claude Code GA**, new **Agent Capabilities API**, and various livestreams and reports detailing these updates. There is notable debate around **sliding window attention** and advanced inference techniques for local deployment.

Canonical issue URL

Hybrid models are all you need.

AI News for 5/21/2025-5/22/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (215 channels, and 9192 messages) for you. Estimated reading time saved (at 200wpm): 747 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

There are going to be a LOT more places that cover Claude 4 better than us, so we'll just provide our picks of the best:

It's very early still but our bet is that Claude 4's emphasis on "hours" of work - up to 7 hours, per Rakuten, and >1hr in Claude Code per Cat Wu's demo in the keynote, is significantly underrated vs the METR trajectories which had us at 1 hour 3 months ago.


AI Twitter Recap

Anthropic Claude 4 Release and Capabilities

Google AI Announcements and Models

AI Model Evaluation, Benchmarks, and Research

AI Agents and Applications

Industry News, Events, and Opinions

Humor and Miscellaneous


AI Reddit Recap

/r/LocalLlama Recap

1. Claude 4 Release and Controversies

2. Multimodal and Diffusion Model Announcements

3. Licensing, Agent Models, AI Policy, and Hardware Developments

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Claude 4 Official Releases, Demos, and Benchmarks

2. Anthropic Claude Opus 4 AI Ethics, Safety, and Emergent Behaviors

3. Veo 3 Disrupting Video Creation and AI-Generated Media


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: New Model Mayhem - Claude 4 & Gemini 2.5 Lead the Charge, Performance Debated

Theme 2: Developer Toolkits Get Sharper: IDEs, Frameworks, and GPU Optimizations Evolve

Theme 3: AI's Wild West: Navigating Safety, Privacy, and Censorship Frontiers

Theme 4: Fueling the Fire: Hardware and Infrastructure Debates for AI Dominance

Theme 5: Collective Brainpower: Community, Collaboration, and Learning Propel AI Forward


Discord: High level Discord summaries

LMArena Discord


LM Studio Discord


Perplexity AI Discord


Cursor Community Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


GPU MODE Discord


HuggingFace Discord


Latent Space Discord


Notebook LM Discord


Modular (Mojo 🔥) Discord


Manus.im Discord Discord


Yannick Kilcher Discord


MCP (Glama) Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


tinygrad (George Hotz) Discord


Torchtune Discord


MLOps @Chipro Discord


Codeium (Windsurf) Discord


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1357 messages🔥🔥🔥):

Claude 4 Opus vs Codex, Gemini 2.5 Pro is still competitive, Sonnet 4 vs Sonnet 3.5, Is LM Arena Rigged?


LMArena ▷ #announcements (2 messages):

Claude Opus 4, Claude Sonnet 4, Staff AMA


LM Studio ▷ #general (277 messages🔥🔥):

Qwen3 models, Gemma3, LM Studio and iPad, Void AI code editor, LLMs as calculators


LM Studio ▷ #hardware-discussion (993 messages🔥🔥🔥):

MoE Split between VRAM and RAM, Giant M.2 Box for Bandwidth, Specialized Models vs. MoE, Matrix Multiply Using Light, Project Silica


Perplexity AI ▷ #announcements (1 messages):

Perplexity Developer Forum, Sonar, API


Perplexity AI ▷ #general (1071 messages🔥🔥🔥):

Perplexity AI new voice mode, Gradient Colors, Discord server boosts, Comet browser data collection, GPTs Agents


Perplexity AI ▷ #sharing (4 messages):

Chain of Draft, Ant movement, Anthropic, Buc-ee's


Perplexity AI ▷ #pplx-api (13 messages🔥):

Perplexity Hackathon rules, Office hours reminder, API Key problems, API Credits issues, Sonar Hackathon


Cursor Community ▷ #general (950 messages🔥🔥🔥):

Cursor blocking requests, Claude 4 model release, Gemini 2.5 Pro model issues, AI coding and job security, RAG pipelines setup


OpenAI ▷ #ai-discussions (663 messages🔥🔥🔥):

Claude 4 launch, Gemini 2.5 Pro, Veo 3 vs Sora, AI film, Liquid Neural Networks


OpenAI ▷ #gpt-4-discussions (7 messages):

ChatGPT 4 experiences, ChatGPT bugs, ChatGPT 4.1 vs 4.0, ChatGPT 4o performance


OpenAI ▷ #prompt-engineering (11 messages🔥):

Meta-Cognition Agent, PromptChainHub Spec, Wordle AI, GPT-4o Personalization, Magic New Chat Window


OpenAI ▷ #api-discussions (11 messages🔥):

Wordle Challenge, CustomGPT's struggles with Wordle, GPT4o's performance, Prompting and the 'magic new chat window', PID UI Mockup


Unsloth AI (Daniel Han) ▷ #general (316 messages🔥🔥):

Falkon fine-tuning, Llama 3.2 vision conversion to GGUF, SmolVLM fine-tuning on T4 Colab, Mistral fine-tuning service, Devstral GGUFs


Unsloth AI (Daniel Han) ▷ #off-topic (23 messages🔥):

Gemma BOS Token, Tokenizer Differences, Untrained Embeddings in Llama3 Instruct, Anthropic quota, SeedCoder


Unsloth AI (Daniel Han) ▷ #help (175 messages🔥🔥):

Donut model for specific tasks, Unsloth patching, Deepseek V3, Qwen3-235B-A22B, Llama 4 fine-tuning


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Retrieval Augmented Finetuning, RAFT Article, Finetuning Cookbook


Unsloth AI (Daniel Han) ▷ #research (9 messages🔥):

StabGAN, MMaDA-8B, MoE parallelism


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Claude Sonnet 4, Claude Opus 4, OpenRouter Caching, OpenRouter Reasoning Parameters


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

Loqus AI Launch, AI Model Subscription, Custom AI Agents


OpenRouter (Alex Atallah) ▷ #general (386 messages🔥🔥):

OpenAI reasoning summaries, DeepSeek V3 vs 2.5 Flash, Vercel AI Model, Claude 4 Pricing and Performance, OpenRouter support for OpenAI responses API


aider (Paul Gauthier) ▷ #general (314 messages🔥🔥):

Gemini 2.5 Flash, Claude 4 Pricing, Local vs Cloud Models, OpenRouter benchmarks, Deepseek R2 Release


aider (Paul Gauthier) ▷ #questions-and-tips (37 messages🔥):

Github Copilot auth token extraction, Aider writing git diffs to markdown, Aider linter for Golang, Aider auto-accept architect mode, Gemini 2.5 Flash Preview in Aider


Nous Research AI ▷ #announcements (1 messages):

Nous Research Twitter, Talk Release, Community Achievements


Nous Research AI ▷ #general (306 messages🔥🔥):

Diffusion models, HVM2 bend, Consumer owned hardware and edge computing, Sonnet 4 and Opus 4 evals, Windsurf acquisition


Nous Research AI ▷ #ask-about-llms (3 messages):

Hermes 4 Release ETA


GPU MODE ▷ #general (21 messages🔥):

Triton language adoption, eDSLs vs CUDA, torch.compile use cases, Liger kernel at LinkedIn, Tiramisu and Halide comparisons


GPU MODE ▷ #triton (6 messages):

Triton 3.3.1 release, 5090 support, Blackwell support, Triton backward kernels


GPU MODE ▷ #cuda (3 messages):

Threads not having their bit set, reduction among threads, wgmma mbarrier synchronization


GPU MODE ▷ #cool-links (1 messages):

MAX Graph Compilation


GPU MODE ▷ #beginner (1 messages):

Elementwise Kernel, Vectorized Loads/Stores, Float4 Operations


GPU MODE ▷ #self-promotion (1 messages):

RGFW, Single-header library, Cross-platform windowing


GPU MODE ▷ #🍿 (6 messages):

RL Baseline for kernel model, PyTorch backend as an eval suite, Kevin model, human-designed RL rewards


GPU MODE ▷ #submissions (49 messages🔥):

MI300, amd-mixture-of-experts, amd-mla-decode, amd-fp8-mm


GPU MODE ▷ #status (3 messages):

AMD MLA Decode, GPU Issue Fixed, Output Weights Normalized


GPU MODE ▷ #factorio-learning-env (11 messages🔥):

Factorio TAS Runs, MineLand Simulator, FLE Lab Scenario


GPU MODE ▷ #amd-competition (15 messages🔥):

Weight Adjustment Patch, Seq Length Concerns, Kernel Length Limit, Deadline Extended


GPU MODE ▷ #cutlass (16 messages🔥):

CuTe DSL, AOT Model, PTX Dumps, Inductor Backends, CUDA


GPU MODE ▷ #mojo (4 messages):

Mojo language introduction, Mojo open sourced GPU code, Appropriate channels for Mojo posts


GPU MODE ▷ #singularity-systems (1 messages):

Picograd Parallelization, Math and Models Appendix


HuggingFace ▷ #announcements (1 messages):

Transformers, HuggingFace Hub, Gradio 5.30, SAM-HQ, HF Datasets


HuggingFace ▷ #general (86 messages🔥🔥):

Inference Speed Benchmarks, Quantization, Qwen vs Gemma, SBERT Fine-tuning, Cloud GPU Platforms with Free Credits


HuggingFace ▷ #today-im-learning (6 messages):

synthetic data to model pipelines, pytorch profiler, wandb integration


HuggingFace ▷ #i-made-this (16 messages🔥):

Syntx MCP Hub, Lunaris Codex, Paper Agent, HF Transfer Jupyter Notebook, OpenAI Agents JS


HuggingFace ▷ #NLP (1 messages):

Multi-modal AI, Tech Support Agent


HuggingFace ▷ #agents-course (18 messages🔥):

Agent Certification Issues, Dummy Agents Library Notebook Errors, Inference Provider Credit Limits


Latent Space ▷ #ai-general-chat (125 messages🔥🔥):

Altman Ive partnership, Universal Geometry of Embeddings, Cursor.ai Updates, v0 AI Model Release, Linear Agents


Notebook LM ▷ #use-cases (15 messages🔥):

Download text with attached citations, Gemini 2.5 Pro update on NotebookLM Plus, Instacart's new policies for Shoppers, Audio Overview customization, LLM synthesis between two topics


Notebook LM ▷ #general (105 messages🔥🔥):

NLM iOS app improvements, Generated audio quality in Spanish, NotebookLM Pro plan benefits, Audio overview customization, Podcast length limitations


Modular (Mojo 🔥) ▷ #general (3 messages):

Claude Code, Cursor, Mojo code generation tools, claude-sonnet-3.7, Open Source repo


Modular (Mojo 🔥) ▷ #mojo (108 messages🔥🔥):

compile time JSON parsing, Mojo max access removal, Rust vs C++ HTTP, Mojo async support, lockless mpmc queues


Manus.im Discord ▷ #general (111 messages🔥🔥):

Manus credits refund, Manus Image generation quality, Manus Enterprise version, AI agent security vulnerability, AI learning tool


Yannick Kilcher ▷ #general (40 messages🔥):

Entangled Representations in LLMs, Claude 4 Leaks, Toolformer, Stochasticity and Simpler Representations, ML for Art


Yannick Kilcher ▷ #paper-discussion (6 messages):

Knowledge Manipulation in LMs, Knowledge Capacity Scaling Laws, GPT-2 with rotary embedding vs LLaMA/Mistral


Yannick Kilcher ▷ #ml-news (31 messages🔥):

OpenAI acquisition, OpenAlpha_Evolve, Claude 4 Release, Anthropic's Claude Opus 4


MCP (Glama) ▷ #general (36 messages🔥):

MCP Server Namespacing, FastMCP Healthcheck, MCP Session Authentication, Restrictive Tool Naming Rules, MCP Server Execution


MCP (Glama) ▷ #showcase (21 messages🔥):

MCP Agent Update, LLM-Oriented Accessibility, AutoRAG MCP server, MCP Course Topic Suggestions, VerbalCodeAI release


DSPy ▷ #show-and-tell (1 messages):

Minting, Whitelists


DSPy ▷ #general (13 messages🔥):

DSPy, Bias training, Minting, LiteLLM terminal spam


DSPy ▷ #examples (1 messages):

Minting Announcement, OpenSea


DSPy ▷ #colbert (3 messages):

pylate interaction models, colbert with modernbert, question answer pair dataset with vlm and dspy, hard negative examples


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (11 messages🔥):

Certificate Verification, Assignment Deadlines, Written Assignment Submission, Entrepreneurship Track Submission


Nomic.ai (GPT4All) ▷ #general (5 messages):

Interface Extension for Non-Text LLMs, AMD 395+ NPU, 256 GB RAM Motherboards, GPT4All Break, AI Software Engineer Services


tinygrad (George Hotz) ▷ #general (4 messages):

AI in PRs, Halide Optimization vs tinygrad, tinygrad backend comparison (llvm, PTX, CUDA, NV)


Torchtune ▷ #rl (4 messages):

Microsoft RL framework, Multi-node async GRPOs, VLLM instances


MLOps @Chipro ▷ #events (1 messages):

MCP Hackathon, Featureform, Cased, Ridge Ventures


MLOps @Chipro ▷ #general-ml (1 messages):

AI Learning Path, AI Courses, India Engineering Student


Codeium (Windsurf) ▷ #announcements (1 messages):

Anthropic API key, Claude 4 Models, Cascade, Windsurf, BYOK