Frozen AI News archive

not much happened today

**DeepSeek R1-0528** release brings major improvements in reasoning, hallucination reduction, JSON output, and function calling, matching or surpassing closed models like **OpenAI o3** and **Gemini 2.5 Pro** on benchmarks such as **Artificial Analysis Intelligence Index**, **LiveBench**, and **GPQA Diamond**. The model ranks #2 globally in open weights intelligence, surpassing **Meta AI**, **Anthropic**, and **xAI**. Open weights and technical transparency have fueled rapid adoption across platforms like **Ollama** and **Hugging Face**. Chinese AI labs including **DeepSeek**, **Alibaba**, **ByteDance**, and **Xiaomi** now match or surpass US labs in model releases and intelligence, driven by open weights strategies. Reinforcement learning post-training is critical for intelligence gains, mirroring trends seen at **OpenAI**. Optimized quantization techniques (1-bit, 4-bit) and local inference enable efficient experimentation on consumer hardware. New benchmarks like **LisanBench** test knowledge, planning, memory, and long-context reasoning, with **OpenAI o3** and **Claude Opus 4** leading. Discussions highlight concerns about benchmark contamination and overemphasis on RL-tuned gains.

Canonical issue URL

A quiet weekend.

AI News for 6/2/2025-6/3/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (218 channels, and 9059 messages) for you. Estimated reading time saved (at 200wpm): 852 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Sorry this arrived late but it was a pretty quiet day. AIE sold out so we launched the AI Engineer online track and you can bookmark the keynotes and MCP livestream track now for the YouTube algorithm to do its thing, as it is likely to be the biggest stream in AIE history.

https://www.youtube.com/watch?v=z4zXicOAF28


AI Twitter Recap

1. Foundation Model Advances: DeepSeek R1-0528, Benchmarks, and Open Weights Leadership

2. Model Evaluation, Reasoning, Benchmarking, and RL

3. Multimodal AI, Agents, and Tooling

4. AI Infrastructure, Scaling, and Hardware

5. AI Agents, Memory, and Workflow Orchestration

6. Memes, Humor, and Community Vibes


AI Reddit Recap

/r/LocalLlama Recap

1. New Model Quantization Techniques and Local Model Performance

2. Commentary on Open Source AI Ecosystem Competition

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. OpenAI/Claude Next-Gen Model Release Rumors And Public Messaging

2. Hands-on Experiences With Recent Large Language Models and AI Tools

3. Concerns And Evidence About ChatGPT Data Privacy And Persistence


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1. Breakthroughs in Language Models and Performance Optimization

Theme 2. The Agentic Frontier: Building Smarter, More Reliable AI Assistants

Theme 3. Hardware Hustle and Kernel Kung Fu: Pushing Computational Boundaries

Theme 4. Revolutionizing Developer Toolkits and Integration Ecosystems

Theme 5. Fueling the Future: Innovations in Datasets and Training Data


Discord: High level Discord summaries

Perplexity AI Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


LM Studio Discord


Cursor Community Discord


HuggingFace Discord


GPU MODE Discord


OpenRouter (Alex Atallah) Discord


Manus.im Discord Discord


Eleuther Discord


aider (Paul Gauthier) Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Notebook LM Discord


Nous Research AI Discord


Latent Space Discord


Torchtune Discord


DSPy Discord


LlamaIndex Discord


Yannick Kilcher Discord


LLM Agents (Berkeley MOOC) Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


Cohere Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1241 messages🔥🔥🔥):

Unlimited access caveats, Adguard vs Surfshark, Perplexity Pro Limits, OpenAI T3 Chat Code, O3 Pro Model


Perplexity AI ▷ #sharing (1 messages):

Danube Water Levels, German Ship Locks, Live Dashboards


Perplexity AI ▷ #pplx-api (18 messages🔥):

Perplexity PDF Generation, Perplexity Labs new features, Perplexity API with Manus, Sonar API post, Sonar-reasoning-pro responses


OpenAI ▷ #ai-discussions (1134 messages🔥🔥🔥):

Decentralized AI architectures, AI and Japanese Translation, Gemini 2.5 Pro, Cursor vs Windsurf, OpenAI O3


OpenAI ▷ #gpt-4-discussions (223 messages🔥🔥):

GPT's File Upload Limit, Emergent Stability Claims, Recursive System Dangers, Chakra Compatibility Readings, Emotional Relationships with AI


OpenAI ▷ #prompt-engineering (7 messages):

Agent Swarms Token Usage, NLP Memory Constraints, Attention Management with RAG


OpenAI ▷ #api-discussions (7 messages):

Agent Swarms Token Usage, Memory Constraints in NLP, RAG for Prompting, Attention Management


Unsloth AI (Daniel Han) ▷ #general (688 messages🔥🔥🔥):

GRPO for TTS, Hyperbolic GPU Prices, Double BOS Tokens, Frieren TTS, Unsloth Dynamic Quantization


Unsloth AI (Daniel Han) ▷ #help (555 messages🔥🔥🔥):

Transformers patch fix, Qwen 2.5, Orpheus model data and datasets, VLLM and GRPO with Mistral 7B, Data Type model loading


Unsloth AI (Daniel Han) ▷ #showcase (9 messages🔥):

GRPO Article, LLM Scribe Tool


Unsloth AI (Daniel Han) ▷ #research (1 messages):

System Prompt Learning, LLMs learn problem-solving strategies, Open-source plugin in optillm, Karpathy's idea


LMArena ▷ #general (884 messages🔥🔥🔥):

Claude 4 Opus Problems, O3 Pro Release, DeepThink Context Window, Gemini 2.5 Pro vs GPT-4.5 Hallucinations, AI Generated Images Commercial Use


LMArena ▷ #announcements (1 messages):

Leaderboard Update, Staff AMA This Friday


LM Studio ▷ #general (236 messages🔥🔥):

NomicAI ModernBert Embedding, LM Studio and LiteLLM Integration, Llama 4 Scout Multimodal Support, Prompt Lookup Decoding, DeepSeek R1 vs. Qwen 8B


LM Studio ▷ #hardware-discussion (396 messages🔥🔥):

AMD RX 9080 XT, Deepseek R1 Distill Llama 70B, cheap used Mi50 32gb, Strix Halo 395, Ryzen 5600G


Cursor Community ▷ #general (620 messages🔥🔥🔥):

Cursor update UI, Claude 4 Documentation Inclination, Legacy Rules Removal, O3 Pro Student Discount, TheGalaxyStars org


Cursor Community ▷ #background-agents (10 messages🔥):

Background Agent Secrets, Devcontainers Setup, Background Agent Token Usage, Jules code model


HuggingFace ▷ #general (301 messages🔥🔥):

Consistent character generation workflows, API problems, Evaluation framework that's task agnostic, Reliability and replicability in long-running agentic tasks, Llama 4 10m context and future of big context models


HuggingFace ▷ #today-im-learning (7 messages):

MCP Sentiment Analysis, Gradio, Docker Streamlit, Speculation Decoding


HuggingFace ▷ #cool-finds (6 messages):

Empathy, Connection, Mental Health Support


HuggingFace ▷ #i-made-this (14 messages🔥):

Creator Reacted Badge, Flast Video Platform, AI Demo Directory, AERIS Cognitive Reasoning, Handwriting Fine-Tuning


HuggingFace ▷ #computer-vision (2 messages):

OpenAI pricing plans, AI-based Exam Proctoring


HuggingFace ▷ #NLP (3 messages):

Lunaris Codex, Hugging Face Caching, Fine-tuning embedding models


HuggingFace ▷ #smol-course (4 messages):

PR Request Permissions, GAIA agent issues, Smolagents in Gradio


HuggingFace ▷ #agents-course (19 messages🔥):

Agent Course Deadline, Local vs Conda Environment, Ollama Installation, API Quota Exceeded, LangGraph Assignment Difficulty


GPU MODE ▷ #general (8 messages🔥):

Fastvideo paper, Job advice for big tech


GPU MODE ▷ #triton (7 messages):

Triton Kernel Optimization, Code Reuse Triton, Triton Versioning for AMD and NVIDIA Leaderboards


GPU MODE ▷ #cuda (43 messages🔥):

CUDA Matrix indexing, SIMT vs SIMD, CUDA Matmul, H100 Persistent Mode, Async Copies


GPU MODE ▷ #torch (1 messages):

cuda.tunable, on-the-fly recording, tuning


GPU MODE ▷ #announcements (1 messages):

FastVideo, video diffusion models, accelerate video diffusion


GPU MODE ▷ #cool-links (1 messages):

System Prompt Learning, LLMs Learn Problem-Solving, Open Source plugin in optillm


GPU MODE ▷ #beginner (16 messages🔥):

CUDA correctness tools, GPU Puzzles


GPU MODE ▷ #pmpp-book (2 messages):

CUDA warp execution, Active warps per block, Divergent branches in CUDA


GPU MODE ▷ #irl-meetup (3 messages):

AI Engineer World's Fair, AI/ML infra at Microsoft, DINO3D, AI4Science, Robotics


GPU MODE ▷ #triton-puzzles (2 messages):

``


GPU MODE ▷ #self-promotion (4 messages):

GPU performance, atomic addition, custom hardware, tensor cores, gemv implementation


GPU MODE ▷ #🍿 (6 messages):

Ludwig CLI design, Parallel Sampling, Data Labeling Tool


GPU MODE ▷ #reasoning-gym (8 messages🔥):

Reasoning Gym Bugs, Nvidia and Reasoning Gym, Reasoning Gym Paper, OOD generalization


GPU MODE ▷ #general (11 messages🔥):

gpumode.com, GPU programming competition, leaderboard


GPU MODE ▷ #submissions (163 messages🔥🔥):

amd-mla-decode, amd-mixture-of-experts, amd-fp8-mm, grayscale, conv2d


GPU MODE ▷ #factorio-learning-env (7 messages):

Dockerized Factorio Server Activation, External Memory Systems Integration, Mem0 AI for RAG, Factorio Learning Environment PR #158


GPU MODE ▷ #amd-competition (1 messages):

wildman_yasei: As far as I know, the only the first problem is with PDF.


GPU MODE ▷ #cutlass (4 messages):

CuTE Examples, TiledMMA partitioning, Grouped GEMM Kernel


GPU MODE ▷ #mojo (2 messages):

PyTorch custom operators using Mojo, Modular nightly releases, Call Mojo from Python


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

AI Agent Engineering, LLMs & Foundation Models, Google Sheets for model/prompt eval, LLM Scribe tool


OpenRouter (Alex Atallah) ▷ #general (250 messages🔥🔥):

REST API sk-or-v1 keys, Submitting end-user IDs, DeepSeek v3 free rate limit, DeepSeek provider rankings, Chess Data & LLMs


Manus.im Discord ▷ #general (156 messages🔥🔥):

Manus AI student perks, School environment, OpenManus affiliation, Deploying Manus-generated sites


Eleuther ▷ #general (27 messages🔥):

Local 70B Model Graphing, DIY Model Storage Needs, Hugging Face Datasets, Vast.ai Pricing, HDD vs. SSD Bottleneck


Eleuther ▷ #research (102 messages🔥🔥):

Incentivizing agent reasoning, Token dropping in MoEs, Continual learning for RL LLM agents, Noise activations for continual learning, Variable length sequence infilling using discrete diffusion


Eleuther ▷ #interpretability-general (2 messages):

Low Dimensional Manifolds, Data Generating Process, Quotient Out Regularities


Eleuther ▷ #lm-thunderdome (24 messages🔥):

Hugging Face chunked prefill, RWKV model addition, lm-evaluation-harness bugs, lm-evaluation-harness documentation, max_seq_lengths cmdline argument


aider (Paul Gauthier) ▷ #general (69 messages🔥🔥):

AiderBench Dependencies, DeepSeek Agent, Opus Testing, Mac M3 Performance, Decentralized Inference Network


aider (Paul Gauthier) ▷ #questions-and-tips (28 messages🔥):

Gemini Model Issues, Aider's Automatic Conversation Summaries, Using Aider with Multiple Repos, SCM Files for HTML/CSS, Best Local Model Tips


Modular (Mojo 🔥) ▷ #general (3 messages):

Modular Hackathon, GPU programming workshop, Mojo kernels, MAX Graph model architectures, PyTorch custom ops


Modular (Mojo 🔥) ▷ #mojo (77 messages🔥🔥):

Type Checking in Mojo, Mojo and godbolt.org, Copyable and ExplicitlyCopyable Traits, Profiling Mojo, C Bindings Generator for Mojo


MCP (Glama) ▷ #general (73 messages🔥🔥):

MCP with Claude Desktop, MCP Transports, Dynamic Tool Registration in MCP, MCP Client Implementations, Elicitations support in MCP Clients


MCP (Glama) ▷ #showcase (3 messages):

Aura with AppAgentX, Android phone control by voice agent, MCP servers connecting to MCP knowledge store


Notebook LM ▷ #use-cases (10 messages🔥):

Language settings for Audio Overviews, NotebookLM chat API integration, Using NotebookLM to record lectures, Audio Overview length limitations


Notebook LM ▷ #general (63 messages🔥🔥):

Video Uploads, Metadata Embeddings, Pro Subscription Audio Podcast Limits, NotebookLM Availability Outside US, Cancelling Pro Features


Nous Research AI ▷ #general (63 messages🔥🔥):

DeepHermes-3 Discord Integration, Demis Hassabis AGI Prediction, Prompt Lookup Decoding for Speedups, Generalized Overfitting in AI, Claude Model Depreciation


Nous Research AI ▷ #research-papers (1 messages):

MINDcraft, LLM agents collaboration, MineCollab


Nous Research AI ▷ #interesting-links (6 messages):

System Prompt Learning, LLM Scribe Tool, Robotics Mouse


Nous Research AI ▷ #research-papers (1 messages):

MINDcraft platform, LLMs adaptive collaboration, embodied reasoning tasks, MineCollab benchmark, natural language communication


Latent Space ▷ #ai-general-chat (53 messages🔥):

Jason's Nitter Post, EleutherAI's Common-Pile Dataset and Comma 0.1 Model, Kontext Chat for Image Editing, NYT licenses content to Amazon for AI training, Karpathy's Guide to Using ChatGPT Versions


Latent Space ▷ #ai-announcements (14 messages🔥):

AIE World's Fair 2025, Live Production AI Bot Collaboration, Bug Reporting System for AIE


Torchtune ▷ #general (2 messages):

TPS benchmarks, Regression testing, Performance Metrics


Torchtune ▷ #dev (45 messages🔥):

LLaMA-3 70B Fine-Tuning, DP vs TP Performance, FP8 vs BF16 Comparison, Compile impact on TPS, Loss Parallel implementation


DSPy ▷ #general (45 messages🔥):

Claude Code analysis, DSPy talks at AI Engineering and Databricks DAIS, DSPy 3.0 release in June, DSPy and DARPA's Advanced Research Concepts lab, Agentic orchestration


LlamaIndex ▷ #announcements (1 messages):

Gradio Agents x MCP Hackathon


LlamaIndex ▷ #blog (3 messages):

E-Library-Agent, Gradio Agents & MCP Hackathon, Scaling Agents in Finance workshop


LlamaIndex ▷ #general (29 messages🔥):

Nested Workflow Event Streaming, Disabling Streaming During Document Indexing, Migrating from OpenAI LLM to vLLM, Tool Visibility Bug in llama-index-llms-google-genai, AI Powered Web Browser


Yannick Kilcher ▷ #general (5 messages):

AdamW Optimizer, SFT Training, Probability and statistics


Yannick Kilcher ▷ #paper-discussion (12 messages🔥):

RLVR Measurement, FP8 Training Stability, SwiGLU Activation, Smooth-SwiGLU, LLM Baseline Evaluations


Yannick Kilcher ▷ #agents (1 messages):

GitHub MCP Vulnerability, Invariant Labs Post-Mortem Report, GitHub Security Risks


Yannick Kilcher ▷ #ml-news (11 messages🔥):

xAI's Grok, Google AI Edge Repo


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (28 messages🔥):

AgentX submission, Technical appendix submission, Certificate declaration form, Trailblazer certificate, Next MOOC dates


tinygrad (George Hotz) ▷ #general (12 messages🔥):

AI-generated CUDA-C kernels beating PyTorch, tinygrad meeting #73, unsigned firmware on 7900XTX, multihost changes & p2p transfers


tinygrad (George Hotz) ▷ #learn-tinygrad (5 messages):

UOp class, UOp trees, Ops.UNIQUE


Nomic.ai (GPT4All) ▷ #general (13 messages🔥):

GPT4All Extension, Intel Compute, AI models, Model Context Protocol


Cohere ▷ #🔌-api-discussions (1 messages):

Azure AI Inference SDK, Cohere input types


Cohere ▷ #💡-projects (1 messages):

Cohere Spanish Recipes DPO Dataset, New Open Source Projects


Cohere ▷ #🤝-introductions (2 messages):

Agentic Frameworks, LLM Grounding