Frozen AI News archive

DeepSeek-R1-0528 - Gemini 2.5 Pro-level model, SOTA Open Weights release

**DeepSeek R1-0528** marks a significant upgrade, closing the gap with proprietary models like **Gemini 2.5 Pro** and surpassing benchmarks from **Anthropic**, **Meta**, **NVIDIA**, and **Alibaba**. This Chinese open-weights model leads in several AI benchmarks, driven by reinforcement learning post-training rather than architecture changes, and demonstrates increased reasoning token usage (23K tokens per question). The China-US AI race intensifies as Chinese labs accelerate innovation through transparency and open research culture. Key benchmarks include **AIME 2024**, **LiveCodeBench**, and **GPQA Diamond**.

Canonical issue URL

DeepSeek is all you need.

AI News for 5/28/2025-5/29/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (217 channels, and 4860 messages) for you. Estimated reading time saved (at 200wpm): 456 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

As mentioned yesterday, DeepSeek typically releases papers and benchmarks a day after their model weights, and today it was a benchmarks day.

It's hard to tell but basically it is a big upgrade from DeepSeek R1 and the largest Qwen 3, and roughly at the level of the leading closed models.

Artificial Analysis framed it best, in that China (DeepSeek) has unambiguously taken over the open weights leadership from the US and Europe.

This improvement comes at a cost of extra thinking tokens:

This advancement stems from enhanced thinking depth during the reasoning process: in the AIME test set, the previous model used an average of 12K tokens per question, whereas the new version averages 23K tokens per question.


AI Twitter Recap

DeepSeek R1-0528 and Chinese AI Model Advances (DeepSeek, Qwen, OpenBench, RL, China-US AI race, Architecture, Benchmarks, Open Weights)


AI Tools, Agentic Workflows, and Perplexity Labs


Interpretability, Evaluation, and Open Source Tools (Anthropic, Claude, Neuronpedia, Benchmarks, Transparency)


AI Reddit Recap

/r/LocalLlama Recap

1. DeepSeek-R1-0528 Official Benchmarks and Performance Comparisons

2. Breakout Results and Industry Comparisons for DeepSeek-R1 and R1.1

3. DeepSeek R1.1 and 8B Distill Model Developments and Benchmarks

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1. New Models Storm the Scene, Capabilities Scrutinized

Theme 2. Dev Tools & Frameworks Fuel AI Innovation and Integration

Theme 3. The Balancing Act: Model Safety, Openness, and Control Under Fire

Theme 4. GPU Power & Performance Puzzles Dominate Hardware Discussions

Theme 5. Agentic AI Marches into Real-World Applications, Leaving Old Benchmarks Behind


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


OpenAI Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


GPU MODE Discord


HuggingFace Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


Latent Space Discord


Manus.im Discord Discord


Notebook LM Discord


Yannick Kilcher Discord


MCP (Glama) Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


LLM Agents (Berkeley MOOC) Discord


Torchtune Discord


Cohere Discord


DSPy Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Perplexity Labs launch, Labs Features, Deep Research vs Labs


Perplexity AI ▷ #general (1046 messages🔥🔥🔥):

Opus Pricing, ios 26, Perplexity Labs


Perplexity AI ▷ #sharing (3 messages):

Opera AI Browser, Perplexity AI Search


Perplexity AI ▷ #pplx-api (9 messages🔥):

New search_results Metadata, Perplexity Labs API


LMArena ▷ #general (670 messages🔥🔥🔥):

Veo 3 vs Sora, Arc AGI Leaderboard, XAI integrates Grok into Telegram, Apple's AI search engine, LM Arena UI Changes


LMArena ▷ #announcements (2 messages):

a16z Podcast, LMArena, DeepSeek R1-0528


Unsloth AI (Daniel Han) ▷ #general (564 messages🔥🔥🔥):

DeepSeek-R1-0528, GGUF Quants, Chatterbox TTS, ThunderCompute GPU rental, KTO Uncensoring


Unsloth AI (Daniel Han) ▷ #off-topic (6 messages):

Qwen 3 MoE Lora, Serving Engine for Qwen, FedRag Unique Finetuning


Unsloth AI (Daniel Han) ▷ #help (69 messages🔥🔥):

GGUF saving issues, Qwen 2.5-coder 7b errors, Gemma 3 model inference issues, Unsloth and Flower AI dependency conflicts, Orpheus-tts trainer installation


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Unsloth Finetuning, Hugging Face Collections


Unsloth AI (Daniel Han) ▷ #research (2 messages):

Kernel optimization, Batch 1 forward pass speed


Cursor Community ▷ #general (409 messages🔥🔥🔥):

Student ID verification help, Cursor Vendor Lock-In, Program Self-Improvement on GoDaddy CPanel, Building Agentic Applications, Claude 4 performance


Cursor Community ▷ #background-agents (9 messages🔥):

Cursor verification stuck, Secrets not injected, DNS issue with Cursor


OpenAI ▷ #ai-discussions (180 messages🔥🔥):

OpenAI Content Policy and Image Generation, Deepseek vs OpenAI Models, OpenAI's Data Retention Policies and Privacy, Sustainability of AI


OpenAI ▷ #gpt-4-discussions (11 messages🔥):

OpenAI chat log retention, FastAPI assistant file search throttling, AI model selection, Resonance analysis on AI


OpenAI ▷ #prompt-engineering (73 messages🔥🔥):

UPSUM Prompt, Custom Instructions, System Prompt Jailbreaking, Resonance Ritual, Safety Layer Circumvention


OpenAI ▷ #api-discussions (73 messages🔥🔥):

UPSUM Chain Prompt, Custom Instructions prompt, Privacy and Style Rules, Cool Prompts to share, Presence diagnostic and self-behavioral analysis


LM Studio ▷ #general (198 messages🔥🔥):

LM Studio install issues, Qwen 3 8B vs distil models, Fine-tuning on Windows, Tool calling with Qwen 30b A3 crashes


LM Studio ▷ #hardware-discussion (132 messages🔥🔥):

GPU Recommendations for AI Coding, AMD GPU Error in LM Studio, Huawei GPU Legitimacy, 5060Ti performance expectations, Integrated graphics on LLMs


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

DeepSeek R1, 100M tokens, Free variant


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

AI Agent Engineering, Memory-Augmented Agents, LLMs & Foundation Models, Full-Stack & Backend Systems, Automation & Agent Ops


OpenRouter (Alex Atallah) ▷ #general (320 messages🔥🔥):

PDF Size Limit on OpenRouter API, Gemini 2.5 Pro Creative Writing Struggles, DeepSeek R1 Release, OpenRouter Provider Application Timeline, Embeddings Implementation on OpenRouter


Eleuther ▷ #general (89 messages🔥🔥):

Grokking the Bible, Kye Gomez Rabbit Hole, Emergent Misalignment on Qwen2.5, Thinking tokens in R1 distillation


Eleuther ▷ #research (43 messages🔥):

Multimodal LLM, RL Alignment, Web Agents, Quantum Field Theory (QFT), Noise Injection


Eleuther ▷ #interpretability-general (5 messages):

Anthropic Circuit Tracer release, Neuronpedia Circuit Tracing integration, Attribution graphs


Eleuther ▷ #gpt-neox-dev (5 messages):

GPT-NeoX, ARM CPUs, Isambard cluster


GPU MODE ▷ #general (2 messages):

complex problems solved in pytorch/tflow, Gigabyte AORUS RTX 3080 GAMING BOX setup on Debian Linux


GPU MODE ▷ #triton (4 messages):

num_stages in autotune vs tl.range, Triton monthly meetups


GPU MODE ▷ #cuda (3 messages):

Shared memory access, Bank conflicts, Swizzling


GPU MODE ▷ #torch (32 messages🔥):

Constraining Tensors Value, AOT and Triton issues, FP4 on 5090, Triton and 5090 Issues, Debugging Torch Compilation Hangs


GPU MODE ▷ #cool-links (2 messages):

Grouped Latent Attention, VLMs for Video Games


GPU MODE ▷ #beginner (11 messages🔥):

Identity_py option, ROCm kernel, Triton Performance on AMD, Beginner resources to start learning, GPUMODE resource-stream


GPU MODE ▷ #liger-kernel (3 messages):

Liger-Kernel, Checkstyle errors, Commit formatting, Formatting standards, PR hygiene


GPU MODE ▷ #self-promotion (1 messages):

PTX Instructions in Mojo, Custom tanh function, Bfloat16 Validation, Inline PTX Assembly


GPU MODE ▷ #reasoning-gym (3 messages):

Self-Distillation, DeepSeek-R1-0528, Osmosis-Structure-0.6B


GPU MODE ▷ #submissions (40 messages🔥):

AMD MI300 performance, amd-fp8-mm leaderboard, amd-mixture-of-experts leaderboard, amd-mla-decode leaderboard, grayscale leaderboard


GPU MODE ▷ #factorio-learning-env (6 messages):

FLE Colab Notebook, FLE Gym Compatibility, FLE positioning paper


GPU MODE ▷ #amd-competition (12 messages🔥):

Competition problems, Submission limits, Code review


GPU MODE ▷ #cutlass (11 messages🔥):

Cutlass Fused Kernels, Transformer Models, MoE Kernel Fusion, L1 Alignment on PyTorch Tensors, Cache Control


HuggingFace ▷ #general (85 messages🔥🔥):

LLMs in Software Engineering, DeepSeek R1 Model, Hugging Face Space setup, Custom models for UVR, Chatterbox-tts installation issues


HuggingFace ▷ #today-im-learning (7 messages):

ML Beginner Path, Fine-tuning LLMs Advice, Customer Service Chatbot Project


HuggingFace ▷ #i-made-this (5 messages):

A2A, Model Context Protocol, VerbalCodeAI, pdf2txt converter


HuggingFace ▷ #NLP (7 messages):

Diffusion-LM, GitHub repo


HuggingFace ▷ #smol-course (2 messages):

GitHub-hosted course, Self-paced learning


HuggingFace ▷ #agents-course (11 messages🔥):

Gemma vs GPT-4o-mini, smolagents prompting, agent tool usage, Agent Course Onboarding, Agent Course Costs


aider (Paul Gauthier) ▷ #general (96 messages🔥🔥):

DeepSeek R1, Claude Code, Benchmarking DeepSeek-R1-0528, Sonnet 4 tool calling, aider clone for small models


aider (Paul Gauthier) ▷ #questions-and-tips (4 messages):

earning $100k in a week, multiple lint-cmd in aider conf, subprocess.py error, aider benchmark broken


Nous Research AI ▷ #general (79 messages🔥🔥):

Open Weights, Grok, EleutherAI, Axolotl, Deepseek's R1


Nous Research AI ▷ #ask-about-llms (5 messages):

RL Bot Release, Linux terminal simulator prompt


Nous Research AI ▷ #interesting-links (4 messages):

Chinese Models, BFL Model, OS Models


Latent Space ▷ #ai-general-chat (82 messages🔥🔥):

Reed Hastings joins Anthropic, n8n vs New Workflow Tool, Quantized 70B Llama, Sonnet 4 and Opus 4, Claude Code vs Cursor


Latent Space ▷ #ai-announcements (5 messages):

Autonomous SWE Agents, Factory AI, Browser-based AI Design, SWE-Bench Obsolescence


Manus.im Discord ▷ #general (83 messages🔥🔥):

Manus instability, Connecting tasks to GitHub repositories, Claude Sonnet 4.0, Veo 3, AI Studio


Notebook LM ▷ #use-cases (6 messages):

NotebookLM, NLM potential, NLM limitations, NLM Pro tiers, NLM podcast settings


Notebook LM ▷ #general (57 messages🔥🔥):

Custom Test Simulator, Smart Flashcard System, Selenium Integration, Audio Overviews Length, Podcast Voices


Yannick Kilcher ▷ #general (35 messages🔥):

DeepSeek scaling, Embedding Forward Pass, LLM Choice, Gemini Diffusion, GFlownets


Yannick Kilcher ▷ #paper-discussion (4 messages):

Paper Discussion, KNN, Matteo, Work crunch


Yannick Kilcher ▷ #agents (1 messages):

NeurIPS videos, Simons Institute YouTube channel


Yannick Kilcher ▷ #ml-news (14 messages🔥):

R2 vs O4 benchmark, FrontierMath Fraud, Astrocytes importance, R1-0528 stats


MCP (Glama) ▷ #general (14 messages🔥):

Awesome MCP Servers PR, MonetizedMCP Launch, OAuth2.1 Authentication for MCP Servers, Remote MCP Server Demo


MCP (Glama) ▷ #showcase (11 messages🔥):

mcp-ui-bridge porting, Multi-Chat MCP Server, Financial Analysis Agent, VerbalCodeAI, *arrs MCP servers


Modular (Mojo 🔥) ▷ #general (8 messages🔥):

Modverse 48, Modular blog, Level Advancement


Modular (Mojo 🔥) ▷ #mojo (7 messages):

Mojo C libraries, Mojo tree structure, Mojo GUI UI and FFI


LlamaIndex ▷ #blog (2 messages):

LlamaIndex Agents in Finance Workshop, LlamaCloud agentic strategies, Agentic Retrieval > Naive RAG


LlamaIndex ▷ #general (8 messages🔥):

Exception Handling in Workflows, Nested Asyncio Tasks, LLM-Powered Agents, Multi-Agent Systems, Model Context Protocol (MCP)


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

AgentX Submission, Entrepreneurship Track, Research Track, Agentic AI Summit


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (6 messages):

Kaggle project submissions, Submitting Perplexity outputs, Article language, Adding certificate to LinkedIn


Torchtune ▷ #general (7 messages):

Sanity Checks, Convergence of Loss Curves, Qwen 0.5b


Cohere ▷ #💬-general (2 messages):

CMD-R Model Update, Local Models, HF Weights


Cohere ▷ #🔌-api-discussions (2 messages):

Cohere OpenAI Cline VS Code


Cohere ▷ #🤝-introductions (2 messages):

AI Automation, No-Code/Low-Code Development, AI Agents & LLM Workflows, Voice AI Solutions


DSPy ▷ #show-and-tell (2 messages):

DSPy MCP tutorial, streamable HTTP, HuggingFace Spaces


DSPy ▷ #general (3 messages):

DSPy 3, Latent Space Podcast, Conference Bookings


tinygrad (George Hotz) ▷ #general (2 messages):

Whisper Bounty, Draft PR


tinygrad (George Hotz) ▷ #learn-tinygrad (3 messages):

types.FunctionType documentation, dynamic function construction


Nomic.ai (GPT4All) ▷ #announcements (1 messages):

Tableau CEO joins Nomic talk, New Fundraising, New Models


Nomic.ai (GPT4All) ▷ #general (2 messages):

VOID Pirate Captain Introduction, LocalDocs with Norus Hermes 2 Mistral DPO Model, AI mini PC