Frozen AI News archive

not much happened today

**Anthropic's Claude 4 models (Opus 4, Sonnet 4)** demonstrate strong coding abilities, with Sonnet 4 achieving **72.7%** on SWE-bench and Opus 4 at **72.5%**. Claude Sonnet 4 excels in codebase understanding and is considered **SOTA on large codebases**. Criticism arose over Anthropic's handling of **ASL-3 security requirements**. Demand for Claude 4 is high, with integration into IDEs and support from Cherry Studio and FastHTML. **Google DeepMind** introduced **Gemini 2.5 Pro Deep Think** and **Gemma 3n**, a mobile multimodal model reducing RAM usage by nearly 3x. **Google's Imagen 4 Ultra** ranks third in the Artificial Analysis Image Arena, available on **Vertex AI Studio**. Google also promoted **Google Beam**, an AI video model for immersive 3D experiences, and new text-to-speech models with multi-speaker support. The **GAIA benchmark** shows Claude 4 Opus and Sonnet leading in agentic performance.

Canonical issue URL

a quiet day.

AI News for 5/22/2025-5/23/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (215 channels, and 8630 messages) for you. Estimated reading time saved (at 200wpm): 705 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

A quiet day before a long weekend. AIE schedules are mostly published, and there are 5 discounted Expo tickets left for AINews readers.


AI Twitter Recap

Anthropic Claude Models (Opus 4, Sonnet 4)

Google Models (Gemini, Imagen, Veo) and AI Studio

Open Source and Frameworks

AI Agents and Tooling

Industry Musings and Opinions

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

1. User Hardware Setups for Large-Scale LLM Inference

2. Speech and Audio Interfacing with LLMs: Kyutai Unmute Demo

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Veo 3 and AI Text-to-Video Model Use Cases & Community Experiments

2. Isomorphic Labs & AlphaFold: Rapid AI-driven Drug Discovery Progress

3. Anthropic Claude Opus 4 Launch: User Impressions, Pricing, and Creative Impact


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: The Claude Conundrum: Capabilities, Costs, and Controversies

Theme 2: Google's Gemini Gambit: Strengths, Stumbles, and Strategic Moves

Theme 3: Agents Agitating for Action: MCP, Interoperability, and New Tools

Theme 4: Performance Pursuit: Fine-tuning, Hardware, and Optimization Frontiers

Theme 5: Ecosystem Expansion: New Models, Tools, and Community Happenings


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


OpenAI Discord


Cursor Community Discord


Manus.im Discord Discord


Nous Research AI Discord


aider (Paul Gauthier) Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


Latent Space Discord


GPU MODE Discord


Notebook LM Discord


LlamaIndex Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Yannick Kilcher Discord


DSPy Discord


Cohere Discord


tinygrad (George Hotz) Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


Codeium (Windsurf) Discord


The Nomic.ai (GPT4All) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Pro Perks, Academic Homepage, Revamped Finance Dashboard, Search Audio & Video Files, 35+ Spaces Templates


Perplexity AI ▷ #general (1248 messages🔥🔥🔥):

Claude Opus 4 Coding Prowess, Flowith Privacy Concerns, Grok 3 mini accuracy, Comet Browser Access, Overrated sushi


Perplexity AI ▷ #sharing (4 messages):

ants insane movement speeds, Anthropic news, Buc-ee's Oak Creek


Perplexity AI ▷ #pplx-api (6 messages):

Devpost Forms, Github API Issue, API Billing


Unsloth AI (Daniel Han) ▷ #general (764 messages🔥🔥🔥):

Claude 4 evaluation, Fine-tuning Llama 4 Scout with Unsloth, Unsloth at AMD AI Event, Career advice: AI engineer major, job market, Herman Miller chairs


Unsloth AI (Daniel Han) ▷ #off-topic (13 messages🔥):

MCP Tunnelling, DeepChat, Opus 4 Limit


Unsloth AI (Daniel Han) ▷ #help (325 messages🔥🔥):

Llama4-Scout fine-tuning, Unsloth on Mac M1, Fine-tuning LLMs for fictional characters, vLLM and Inference speed, Qwen2-VL with Unsloth and vLLM


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

Retrieval Augmented Finetuning (RAFT), Unsloth, Llama32 1bn


Unsloth AI (Daniel Han) ▷ #research (20 messages🔥):

Expert Parallelism, Multi-Agent Systems, Model Review Requests, Gemma vs Qwen


LM Studio ▷ #general (216 messages🔥🔥):

Open WebUI as an alternative to LM Studio, LM Studio CORS issues with browsers, LLMs as Calculators, Tool calling in LLMs, AMD ROCm support for LLM inference


LM Studio ▷ #hardware-discussion (523 messages🔥🔥🔥):

DEC PDP-10 byte sizes, x86 page table entries, RAM density doubling, USB naming scheme, Multi-GPU setups with CUDA


OpenAI ▷ #ai-discussions (642 messages🔥🔥🔥):

Google Veo 3, Gemini vs ChatGPT, Claude 4, AI Film Creation, Anthropic's Claude API


OpenAI ▷ #gpt-4-discussions (2 messages):

ChatGPT, GitHub, GPT


OpenAI ▷ #prompt-engineering (9 messages🔥):

Slate Guessing Game, Magic New Chat Window Phenomenon, Vision Comprehension Struggles, AI and Religion alternative, Markdown in Prompts


OpenAI ▷ #api-discussions (9 messages🔥):

Wordle Solver GPT Performance, Magic New Chat Window, AI, Religion, or Beans?, Markdown Formatting in Prompts, Prompt Engineering Corrections


Cursor Community ▷ #general (671 messages🔥🔥🔥):

Claude 4 availability issues, Gemini 2.5 Pro shortcomings, Cursor performance issues, Claude 4's potential 'snitching' behavior, Comparing Cursor and Windsurf


Manus.im Discord ▷ #general (399 messages🔥🔥):

Manus Credits, Spam Calls After Phone Number, Alibaba Qwen3, Manus Agentic Features, Emergent.sh Credit System


Nous Research AI ▷ #general (315 messages🔥🔥):

Claude 4 reporting users, Mistral's OCR model, Hermes capabilities


Nous Research AI ▷ #ask-about-llms (15 messages🔥):

Lightweight Embeddings Models, bge m3 Embeddings Model, Claude 4


Nous Research AI ▷ #interesting-links (1 messages):

Psyche, Decentralized AI, Psyche network


aider (Paul Gauthier) ▷ #general (265 messages🔥🔥):

Claude 4 vs Gemini, OpenRouter API Key, Aider Benchmark, Python 3.13 support, Repo map ignore


aider (Paul Gauthier) ▷ #questions-and-tips (27 messages🔥):

Code comments overuse, Eloquent Code, Claude Sonnet 4, HTML Refactoring, Aider Edit Formats


OpenRouter (Alex Atallah) ▷ #general (168 messages🔥🔥):

Claude 4 Pricing, Sonnet 4 Performance, VerbalCodeAI Tool, Gemini Voice Mode, DeepSeek v3 for Knowledge


HuggingFace ▷ #general (63 messages🔥🔥):

Smaller models for memory extraction, Cloud GPU platforms with free credits, 256 GB supporting motherboards, Automating air traffic control with agentic LLMs, Video generation AI trends


HuggingFace ▷ #today-im-learning (3 messages):

GPU Memory Optimization, Gradient and Optimizer State Management, CUDA Out of Memory Errors


HuggingFace ▷ #i-made-this (20 messages🔥):

openai-agents-js Release, Rare Numbers Mobile Game, Takara AI Game with Claude 4, Lazarus Instruct LLM


HuggingFace ▷ #NLP (8 messages🔥):

LLaDA support in Transformers, Chat model training dataset design, Local RAG chatbot LLM recommendations, Fine-tuning models with non-public architectures


HuggingFace ▷ #agents-course (26 messages🔥):

LinkedIn Credential for certificate, Final submission & Certificate requirement, Deep Learning and ML Question, Share agents


Latent Space ▷ #ai-general-chat (50 messages🔥):

Mistral Document AI, Nitter 500 Errors, Claude Code Equivalents, Textract Comparison, Screenless Audio Devices


Latent Space ▷ #ai-in-action-club (65 messages🔥🔥):

MCPI CLI Update, Auto-Accept Rate, Discord Audio Issues, Cursor Tools vs Resources


GPU MODE ▷ #general (4 messages):

Dark Souls, Expedition 33, New Doom


GPU MODE ▷ #triton (5 messages):

Triton Convolution Example, Triton Double Buffering Kernels, Triton Auto-tuning Triggers, Interleaving PIDs in Triton


GPU MODE ▷ #cuda (2 messages):

mma.sync performance, RTX 5090


GPU MODE ▷ #torch (3 messages):

torch.compile for/while loop, nvtx annotations with torch compiled regions, CUDA graphs


GPU MODE ▷ #cool-links (7 messages):

MAX Graph Compilation, Fireworks DeepSeek Speed, Blackwell deployment


GPU MODE ▷ #beginner (1 messages):

Shared Memory Swizzling


GPU MODE ▷ #off-topic (6 messages):

Mick Gordon, DOOM 2016, Soundtrack, Balance Patch, Nightmare Difficulty


GPU MODE ▷ #self-promotion (1 messages):

RGFW, STB-style Libraries, Cross-platform Development


GPU MODE ▷ #🍿 (2 messages):

RL Kernel Code FT, PyTorch Backend Optimization, Leaderboard Data Strategy


GPU MODE ▷ #submissions (42 messages🔥):

MI300, amd-mla-decode, amd-mixture-of-experts, amd-fp8-mm, T4 grayscale


GPU MODE ▷ #ppc (1 messages):

PPC Course, Aalto Scoreboard


GPU MODE ▷ #factorio-learning-env (2 messages):

Codebase Flow, Agent-Server interaction


GPU MODE ▷ #amd-competition (7 messages):

RoPE bug, wo_weight normalization


GPU MODE ▷ #cutlass (9 messages🔥):

CUTLASS vs Triton, MLA Kernel Performance, FlashInfer CUTLASS Blackwell MLA Support


GPU MODE ▷ #mojo (3 messages):

channel posting, apologies


Notebook LM ▷ #use-cases (26 messages🔥):

Audio Overview Length Customization, NotebookLM Podcast Sound Naturalness, Google Gemini and NBLM, Audio Overview Language Availability, Spreadsheet Conversion to NotebookLM


Notebook LM ▷ #general (50 messages🔥):

Audio Overviews control, PDF processing, podcast longer in german, AI Gemini with prompts, Podcast in Italian


LlamaIndex ▷ #blog (3 messages):

Claude 4 Sonnet, Opus, Databricks AI Summit, Image Generation Agent


LlamaIndex ▷ #general (48 messages🔥):

ContextChatEngine and local file downloads, llama cloud integration for google drive, Claude 4 function calling issue, Anthropic API thinking blocks, AgentWorkflow issues with Claude 4


LlamaIndex ▷ #ai-discussion (2 messages):

LLM Prompt Engineering, Word-Wrapping in LLMs, LLM Tokenization, LLM output formats


Eleuther ▷ #general (35 messages🔥):

Discord Introductions, Llama 3 for chatbots, Open-weight models, Matching to ongoing work, ChatGPT for paper discovery


Eleuther ▷ #research (8 messages🔥):

Interpretability post removal, ICML AI agent workshop, AI generated work, Novel research, Paper submission


Eleuther ▷ #interpretability-general (4 messages):

Circuits 2.0, nnsight vs tl, causal interventions


Eleuther ▷ #lm-thunderdome (4 messages):

SOTA models, deduplication tools, dolma dedup tooling


Modular (Mojo 🔥) ▷ #mojo (45 messages🔥):

ARC Sorcery, LayoutTensor Parameters, Atomic Types, External Calls to Libs, Compile Time Changes


Modular (Mojo 🔥) ▷ #max (5 messages):

Offline Inference, LLM API Changes, LlamaConfig TypeError


MCP (Glama) ▷ #general (21 messages🔥):

GitHub MCP Server Access from Container, Clay.earth MCP Testing, Streaming Tool Results with MCP, Securing MCP Sessions, Claude Desktop Tool Consent Withdrawal


MCP (Glama) ▷ #showcase (26 messages🔥):

MCP Server Security, Client-side vs Server-side Security, VerbalCodeAI Introduction, Aura A2A Agent for Aira Hub, UI in MCP Spec


Yannick Kilcher ▷ #general (29 messages🔥):

Beefed-up Jailbreak, Token Limit Woes, Wumpus World Adaptation, Oscar-c Architecture, Attention Span Decay


Yannick Kilcher ▷ #paper-discussion (3 messages):

Knowledge Capacity Scaling Laws, GPT-2 vs LLaMA/Mistral, Domain Names Increase Knowledge Capacity


Yannick Kilcher ▷ #ml-news (13 messages🔥):

Claude Opus 4, AI Whistleblowing, Locally Hosted Models, AI Reporting Illegal Activity


DSPy ▷ #general (33 messages🔥):

LiteLLM Terminal Spam, BAML integration with DSPy, DSPy Prompt Structure, vLLM Thread Count, DSPy Core Concepts


Cohere ▷ #💬-general (1 messages):

kuki9999: Hi


Cohere ▷ #🔌-api-discussions (10 messages🔥):

Cohere Rerank API, Command A Model, PHP API Usage


Cohere ▷ #🤝-introductions (3 messages):

Blockchain Product Management, Emerging Tech Exploration, AI Project Development, Automation Tasks


tinygrad (George Hotz) ▷ #general (9 messages🔥):

Halide optimization similarities to tinygrad, tinygrad vs llvm vs cuda vs NV, Qwen3 performance on tinygrad, tinygrad AMD issues, Federated training with tinygrad


Torchtune ▷ #general (1 messages):

Office Hours Announcement, Upcoming Focus Areas, New Feature Highlights, Hat Promises


Torchtune ▷ #rl (8 messages🔥):

GRPO Recipe Validation, Async RL and Federated Learning


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (6 messages):

Entrepreneurship Track, Live Product Link, Browser Extension, Manual Installation


MLOps @Chipro ▷ #events (1 messages):

MCP Hackathon, Featureform, Cased, Ridge Ventures


MLOps @Chipro ▷ #general-ml (1 messages):

ML Courses, LLM Agents


Codeium (Windsurf) ▷ #announcements (1 messages):

Bring Your Own Key, Anthropic API Key, Claude 4 Models