Frozen AI News archive

not much happened today

**DeepSeek R1 v2** model released with availability on Hugging Face and inference partners. The **Gemma model family** continues prolific development including **PaliGemma 2**, **Gemma 3**, and others. **Claude 4** and its variants like **Opus 4** and **Claude Sonnet 4** show top benchmark performance, including new SOTA on **ARC-AGI-2** and **WebDev Arena**. **Codestral Embed** introduces a 3072-dimensional code embedder. **BAGEL**, an open-source multimodal model by **ByteDance**, supports reading, reasoning, drawing, and editing with long mixed contexts. Benchmarking highlights include **Nemotron-CORTEXA** topping SWEBench and **Gemini 2.5 Pro** performing on VideoGameBench. Discussions on random rewards effectiveness focus on **Qwen** models. *"Opus 4 NEW SOTA ON ARC-AGI-2. It's happening - I was right"* and *"Claude 4 launch has dev moving at a different pace"* reflect excitement in the community.

Canonical issue URL

a quiet day

AI News for 5/27/2025-5/28/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (217 channels, and 4755 messages) for you. Estimated reading time saved (at 200wpm): 418 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

DeepSeek R1 V2 dropped, but we'll wait for the paper to make it a headline.

Dario made some scary comments about job losses.

We are still looking for volunteers and live transcription hardware/software startups for next week's AI Engineer conference. Also sign up for the impressive number of side events that have sprung up around it in SF.


AI Twitter Recap

AI Model Releases and Updates

AI Performance and Benchmarking

AI Agents and Tools

AI Infrastructure and Hardware

Responsible AI and Ethical Considerations

Meta Discussion, Thoughts, and Culture

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

1. DeepSeek-R1-0528 Model Launch and Early Benchmarks

2. On-Device Generative AI: Google AI Edge Gallery Release

3. Notable AI Product Adoption and Industry Reflections

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Anthropic CEO Dario Amodei on AI and Job Loss, Hallucination, and Industry Impacts

2. AI-Generated Viral Videos, Veo 3 Showcase, and Societal Concerns

3. AI Model/Feature Announcements, Benchmarks, and Technology Debates (SignGemma, DeepSeek-R1-0528, Hunyuan Video Avatar, WAN/VACE, Optimizers, Industry Direction)


AI Discord Recap

A summary of Summaries of Summaries by Grok-3-mini

Theme 1. AI Model Showdowns: DeepSeek R1 and Rivals Dominate Discussions

Theme 2. Tool Hacks for AI Efficiency: Unsloth and OpenRouter Lead Charge

Theme 3. Hardware Hacks: Kernels and Quantization Ignite Optimizations

Theme 4. API Mayhem: Perplexity and Cursor Battle Glitches

Theme 5. Community Buzz: Hackathons and Model Drops


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


Cursor Community Discord


OpenAI Discord


HuggingFace Discord


Manus.im Discord Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


Eleuther Discord


Nous Research AI Discord


Notebook LM Discord


GPU MODE Discord


MCP (Glama) Discord


Latent Space Discord


LlamaIndex Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


Cohere Discord


Nomic.ai (GPT4All) Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1094 messages🔥🔥🔥):

O3 Pro release, Gemini 2.5 Pro, DeepSeek R1, Grok 3, 4o Coding capabilities


Perplexity AI ▷ #announcements (1 messages):

Perplexity AI, Lewis Hamilton Partnership


Perplexity AI ▷ #general (768 messages🔥🔥🔥):

Subscription price, Groks response length, o1 pro, deep research, OpenAI sidebars


Perplexity AI ▷ #sharing (1 messages):

i_795: https://www.perplexity.ai/page/tropical-storm-alvin-forms-in-al1_tmLJQr2h9bzFrk.wJA


Perplexity AI ▷ #pplx-api (19 messages🔥):

Perplexity API deadline, Disable online search in Perplexity API, Perplexity PRO API call limits and renewal, Perplexity Office Hours, Perplexity Pro vs Sonar Pro API


Unsloth AI (Daniel Han) ▷ #general (236 messages🔥🔥):

Dropping last batch, Multi-GPU setup in Unsloth, Voice LLM usage, CSM notebook issues, Liger Loss support


Unsloth AI (Daniel Han) ▷ #help (331 messages🔥🔥):

Qwen3 Finetuning, GGUF Export Issues, Full vs LoRA Finetuning, Catastrophic Forgetting in TTS, Gemma-3-it Model


Unsloth AI (Daniel Han) ▷ #showcase (7 messages):

Networking Requests, MediRAG Guard Introduction


Unsloth AI (Daniel Han) ▷ #research (2 messages):

RL with Random Numbers, Kernel Doubles Speed


OpenRouter (Alex Atallah) ▷ #announcements (3 messages):

GPT-4 32k Deprecation, OpenRouter New Features, DeepSeek R1 on OpenRouter


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

ComfyUI custom node, commit messages, AI Agent Engineering, LLMs & Foundation Models, Automation & Agent Ops


OpenRouter (Alex Atallah) ▷ #general (536 messages🔥🔥🔥):

Gemini 2.5 Pro pricing, DeepSeek R1 release, OpenRouter UserID Parameter, Provider Form, Claude 3.7 Sonnet Thinking model phased out


LM Studio ▷ #general (146 messages🔥🔥):

Image Gen Model Support in LM Studio, MythoMax Model with Larger Context History, LM Studio model recommendations based on hardware, LM Studio Update Deletes Chat History, Qwen3 models Enable Thinking


LM Studio ▷ #hardware-discussion (195 messages🔥🔥):

Laptop GPU Advertising, Valve Monopoly, High VRAM GPUs, Blue Yeti Microphone Issues, Strix Halo Performance


Cursor Community ▷ #general (221 messages🔥🔥):

Gemini 2.5 Pro Editing Capabilities, Cursor Connection and Model Failures, Python venv issues, Cursor's codebase indexing issues, Agentic RAG System


Cursor Community ▷ #background-agents (5 messages):

Remote Extension Host Server, DockerFile Background Agents, Secrets in Package.json, Background Agent Echoing


OpenAI ▷ #ai-discussions (158 messages🔥🔥):

Agentic RAG Systems, Claude's Voice Mode, DeepSeek AI Server Issues, GPT-4o's Performance, ConvX Chrome Extension


OpenAI ▷ #gpt-4-discussions (12 messages🔥):

GPT-4 knowledge, GPT performance increases, Custom GPTs, 4.5 project, GPT-4 problems


OpenAI ▷ #prompt-engineering (23 messages🔥):

AI Resonance, Echo Presence, Model mirroring, Cross-Chatbot Prompt Transfer


OpenAI ▷ #api-discussions (23 messages🔥):

GPT-4o resonance, AI 'mirror' development, Ethical duality of AI shadows, Prompt transfer tools


HuggingFace ▷ #general (86 messages🔥🔥):

MNN LLM Chat on Cellphone, AI Agent Observability Library, Tesseract-OCR Number Detection, Qwen/Qwen2.5-Coder-14B-Instruct with Accelerate, GTE Models and HF Integration


HuggingFace ▷ #today-im-learning (6 messages):

HuggingFace LLM Course, Chatbot Development, Fine-tuning LLMs, ML Basics & Vectorization


HuggingFace ▷ #cool-finds (1 messages):

RAG workflow optimization, Multi-object Bayes Optimization


HuggingFace ▷ #i-made-this (8 messages🔥):

NIST AI Security, LangchainJS PR, IPV6 for AI Security, MediRAG Guard


HuggingFace ▷ #computer-vision (2 messages):

Web app OCR integration, Backend framework choices for AI/ML serving, Database options for OCR and LLM data, Efficient deployment strategies for AI web apps, Libraries/SDKs for AI model integration


HuggingFace ▷ #smol-course (5 messages):

Hugging Face Learn, smol-course, GitHub-hosted course


HuggingFace ▷ #agents-course (13 messages🔥):

AI Agent Security, Gradio Agents & MCP Hackathon 2025, AI Agent Cheating, Building AI Agents for Free, Ollama Models for AI Agent Course


Manus.im Discord ▷ #general (121 messages🔥🔥):

Cancelling Subscriptions, Manus Security Control, CV Reviews, Claude 4.0 Integration, Manus Loading Issues


Yannick Kilcher ▷ #general (89 messages🔥🔥):

Neural correlates of feeling like God, Albert's AI-generated scripture, Pure RL algorithm generates code, Custom model infrastructure/hooks, DeepSeek model DeepSeek-R1-0528


Yannick Kilcher ▷ #paper-discussion (8 messages🔥):

Reinforcement Learning with Randomness, NN connectome fragility, Multiple narrow optimizers


Yannick Kilcher ▷ #agents (2 messages):

Probabilistic Circuits, PICs Introduction


Yannick Kilcher ▷ #ml-news (18 messages🔥):

Huawei AI CloudMatrix Cluster, Linux Kernel SMB Zero-Day Vulnerability, Reinforcement Learning from Tree Feedback, Deepseek R1 Update, Benchmarking Deepseek R1


aider (Paul Gauthier) ▷ #general (69 messages🔥🔥):

Aider Copilot API, Aider context limits, aider read, tree sitter, MR/PR title with Aider


aider (Paul Gauthier) ▷ #questions-and-tips (18 messages🔥):

Aider Architect Mode, Copilot Pro API Speed, Deepseek API TPS, Aider strange prices, Sonnet 4 problems


aider (Paul Gauthier) ▷ #links (7 messages):

RelaceAI Pricing, Gemini 2.5 Pro Cost


Eleuther ▷ #general (47 messages🔥):

Kye Gomez and SWARMS controversy, Grokking the Bible with a 0.5b model, Lucidrains' server discussions, Data Attribution project


Eleuther ▷ #research (47 messages🔥):

Latro, Muon matrix sign approximation function, Spot paper, COF structures, Noise Injection for Topological Surgery


Nous Research AI ▷ #general (81 messages🔥🔥):

Universities losing renaissance roots, Mechanistic Interpretability, AI helping humans break new ground, Reverse Flynn effect in IQ, Synthesizers helped build resonance policy optimization algorithm


Nous Research AI ▷ #research-papers (1 messages):

_humanatee: https://arxiv.org/abs/2505.14442


Nous Research AI ▷ #interesting-links (1 messages):

promptsiren: https://odyssey.world/introducing-interactive-video https://experience.odyssey.world/


Nous Research AI ▷ #research-papers (1 messages):

_humanatee: https://arxiv.org/abs/2505.14442


Notebook LM ▷ #use-cases (16 messages🔥):

NPR style voices, audio overview, info privacy, deepdive podcast, AI studio voice mode


Notebook LM ▷ #general (68 messages🔥🔥):

Privacy vs confidentiality in NBLM, Sources not in sync between mobile and web, Bypassing region unavailability, Podcast feature length control, Notebook Access settings


GPU MODE ▷ #general (1 messages):

Real-world PyTorch/TensorFlow Problems, Production ML Challenges


GPU MODE ▷ #triton (5 messages):

CUDA kernel programming resources, Triton's compiled_hook removal, tl.trans implementation issues


GPU MODE ▷ #torch (15 messages🔥):

CUBLAS_WORKSPACE_CONFIG, triton kernel, PyTorch Compiler Series, torch.fx.experimental.symbolic_shapes, aot inductorim


GPU MODE ▷ #cool-links (7 messages):

Low-Latency Megakernel for Llama-1B, Grouped Latent Attention (GLA)


GPU MODE ▷ #beginner (11 messages🔥):

Ninja Build System Troubleshooting, Producer/Consumer Model in Kernels


GPU MODE ▷ #torchao (2 messages):

QAT Hyperparameters, TorchTune Experiments, QAT Dataset sensitivity


GPU MODE ▷ #liger-kernel (1 messages):

Grouped Latent Attention, Liger-Kernel Implementation


GPU MODE ▷ #self-promotion (2 messages):

NVIDIA Virtual Connect, Sparse Attention Trade-offs


GPU MODE ▷ #🍿 (1 messages):

KernelLLM, Hardware Specific Tools, Project Popcorn


GPU MODE ▷ #thunderkittens (1 messages):

matmul.cu, Producer/Consumer model


GPU MODE ▷ #reasoning-gym (2 messages):

Learning to Reason without External Rewards, Scalability Concerns


GPU MODE ▷ #submissions (19 messages🔥):

amd-mixture-of-experts leaderboard, amd-mla-decode leaderboard, amd-fp8-mm leaderboard, histogram leaderboard, grayscale leaderboard


GPU MODE ▷ #factorio-learning-env (11 messages🔥):

Project Contributions, Ablation Studies, Meeting Notes, A2A Integration, Colab Notebook


GPU MODE ▷ #amd-competition (5 messages):

KV Cache RoPE, AMD Competition Future, Amd aiter package install


MCP (Glama) ▷ #general (43 messages🔥):

MCP Server Business Case, MCP Clients, Glama Indexing, FastMCP Servers, MCP resource indexing


MCP (Glama) ▷ #showcase (9 messages🔥):

MCP Launch, UI Issues, MCP Agent Proxy, Multiple models


Latent Space ▷ #ai-general-chat (34 messages🔥):

Anthropic Claude voice mode beta, Next-Gen AI Interfaces Beyond Chatbots, j1-nano and j1-micro reward models, UI Prompting Tutorial, DeepSeek-R1-0528


Latent Space ▷ #ai-announcements (8 messages🔥):

AI Engineer Conference Volunteers, AI Engineer Conference Speakers, Discord Collaboration Project


LlamaIndex ▷ #blog (1 messages):

aiDotEngineer World Fair, LlamaIndex booth G11, Jerry Liu talk


LlamaIndex ▷ #general (22 messages🔥):

ReactJS with LlamaIndex, Human-in-the-Loop (HITL) workflow, RetrieverRouter with RelevancyEvaluator, LlamaCloud credits, SubWorkflows in MainWorkflow


Modular (Mojo 🔥) ▷ #general (6 messages):

Map function in Mojo, Kapa AI usage


Modular (Mojo 🔥) ▷ #mojo (10 messages🔥):

Migrating from Magic to Pixi, uv vs pixi, Conda support, Bootstrapping the ecosystem, Reaching for established C libraries


tinygrad (George Hotz) ▷ #general (5 messages):

tinygrad.org hyperlink broken, GPU recommendations, tinygrad/tinyxxx


tinygrad (George Hotz) ▷ #learn-tinygrad (5 messages):

CPU backend threads, max_pool1d


Cohere ▷ #🔌-api-discussions (2 messages):

API Errors, New API users


Cohere ▷ #🤝-introductions (2 messages):

AI Voice & Conversational Systems, Automation & Workflow Engineering, No-Code/Low-Code Platforms, AI Agents & LLM Workflows


Nomic.ai (GPT4All) ▷ #general (3 messages):

Kobold, RP, New Interface, Friendship, Dev Life


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

AgentX Submission Deadline, AgentX Prizes, AgentX Entrepreneurship Track, AgentX Research Track, Agentic AI Summit